Policy Gradient Methods for Reinforcement Learning with Function Approximation

Part of Advances in Neural Information Processing Systems 12 (NIPS 1999)

Bibtex »Metadata »Paper »

Authors

Richard S. Sutton, David McAllester, Satinder Singh, Yishay Mansour

Abstract

Abstract Unavailable