Search Machine Learning Repository:
Optimal Regret Analysis of Thompson Sampling in Stochastic Multi-armed Bandit Problem with Multiple Plays
Authors: Junpei Komiyama, Junya Honda and Hiroshi Nakagawa
Conference: Proceedings of the 32nd International Conference on Machine Learning (ICML-15)
authors venues years
Suggest Changes to this paper.
Brought to you by the WUSTL Machine Learning Group. We have open faculty positions (tenured and tenure-track).