Search Machine Learning Repository: Online Markov Decision Processes under Bandit Feedback
Authors: Gergely Neu, Andras Antos, András György and Csaba Szepesvári
Conference: Advances in Neural Information Processing Systems 23
Year: 2010
Pages: 1804--1812
[pdf] [BibTeX]

authors venues years
Suggest Changes to this paper.
Brought to you by the WUSTL Machine Learning Group. We have open faculty positions (tenured and tenure-track).