Search Machine Learning Repository: Optimal Regret Analysis of Thompson Sampling in Stochastic Multi-armed Bandit Problem with Multiple Plays
Authors: Junpei Komiyama, Junya Honda and Hiroshi Nakagawa
Conference: Proceedings of the 32nd International Conference on Machine Learning (ICML-15)
Year: 2015
Pages: 1152-1161
[pdf] [BibTeX]

authors venues years
Suggest Changes to this paper.
Brought to you by the WUSTL Machine Learning Group. We have open faculty positions (tenured and tenure-track).