Search Machine Learning Repository:
Online Markov Decision Processes under Bandit Feedback
Authors: Gergely Neu, Andras Antos, András György and Csaba Szepesvári
Conference: Advances in Neural Information Processing Systems 23
authors venues years
Suggest Changes to this paper.
Brought to you by the WUSTL Machine Learning Group. We have open faculty positions (tenured and tenure-track).