Logarithmic Online Regret Bounds for Undiscounted Reinforcement Learning

Part of Advances in Neural Information Processing Systems 19 (NIPS 2006)

Bibtex »Metadata »Paper »

Authors

Peter Auer, Ronald Ortner

Abstract

Abstract Unavailable