Search Machine Learning Repository:
All publications by Peter Auer
authors venues years



PAC Subset Selection in Stochastic Multi-armed Bandits
Shivaram Kalyanakrishnan, Ambuj Tewari, Peter Auer and Peter Stone
Proceedings of the 29th International Conference on Machine Learning (ICML-12), 2012


PAC-Bayesian Analysis of Contextual Bandits
Yevgeny Seldin, Peter Auer, John S. Shawe-taylor, Ronald Ortner and François Laviolette
Advances in Neural Information Processing Systems 24, 2011


Workshop summary: On-line learning with limited feedback
Jean-yves Audibert, Peter Auer, Alessandro Lazaric, Rémi Munos, Daniil Ryabko and Csaba Szepesvári
Proceedings of the 26th International Conference on Machine Learning (ICML-09), 2009


Near-optimal Regret Bounds for Reinforcement Learning
Peter Auer, Thomas Jaksch and Ronald Ortner
Advances in Neural Information Processing Systems 21, 2008


Logarithmic Online Regret Bounds for Undiscounted Reinforcement Learning
Peter Auer and Ronald Ortner
Advances in Neural Information Processing Systems 19, 2006


Using Confidence Bounds for Exploitation-Exploration Trade-offs
Peter Auer
Journal of Machine Learning Research, 2002