Policy Gradient Methods for Reinforcement Learning with Function Approximation

Part of Advances in Neural Information Processing Systems 12 (NIPS 1999)

Bibtex Metadata Paper

Authors

Richard S. Sutton, David McAllester, Satinder Singh, Yishay Mansour

Abstract

Abstract Unavailable