Article citationsMore>>

Sutton, R. S., McAllester, D. A., Singh, S. P., & Mansour, Y. (2000). Policy Gradient Methods for Reinforcement Learning with Function Approximation. In M. I. Jordan, Y. Lecun, & S. A. Solla (Eds.), Advances in Neural Information Processing Systems (pp. 1057-1063). MIT Press.

has been cited by the following article:

Follow SCIRP
Twitter Facebook Linkedin Weibo
Contact us
customer@scirp.org
WhatsApp +86 18163351462(WhatsApp)
Click here to send a message to me 1655362766
Paper Publishing WeChat
Free SCIRP Newsletters
Copyright © 2006-2025 Scientific Research Publishing Inc. All Rights Reserved.
Top