Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bias in Reinforcement Learning

Part of Advances in Neural Information Processing Systems 14 (NIPS 2001)

Bibtex »Metadata »Paper »

Authors

Gregory Grudic, Lyle Ungar

Abstract

Abstract Unavailable