Search Machine Learning Repository: Online Bandit Learning against an Adaptive Adversary: from Regret to Policy Regret
Authors: Raman Arora, Ofer Dekel and Ambuj Tewari
Conference: Proceedings of the 29th International Conference on Machine Learning (ICML-12)
Year: 2012
Pages: 1503--1510
[pdf] [BibTeX]

authors venues years
Suggest Changes to this paper.
Brought to you by the WUSTL Machine Learning Group. We have open faculty positions (tenured and tenure-track).