Search Machine Learning Repository: Online Regret Bounds for Undiscounted Continuous Reinforcement Learning
Authors: Ronald Ortner and Daniil Ryabko
Conference: Advances in Neural Information Processing Systems 25
Year: 2012
Pages: 1772--1780
[pdf] [BibTeX]

authors venues years
Suggest Changes to this paper.
Brought to you by the WUSTL Machine Learning Group. We have open faculty positions (tenured and tenure-track).