Search Machine Learning Repository:
All publications by Odalric-ambrym Maillard
authors venues years



Latent Bandits.
Odalric-ambrym Maillard and Shie Mannor
Proceedings of the 31st International Conference on Machine Learning (ICML-14), 2014


How hard is my MDP?" The distribution-norm to the rescue"
Odalric-ambrym Maillard, Timothy A. Mann and Shie Mannor
Advances in Neural Information Processing Systems 27, 2014


Optimal Regret Bounds for Selecting the State Representation in Reinforcement Learning
Odalric-ambrym Maillard, Phuong Nguyen, Ronald Ortner and Daniil Ryabko
Proceedings of the 30th International Conference on Machine Learning (ICML-13), 2013


Online allocation and homogeneous partitioning for piecewise constant mean-approximation
Alexandra Carpentier and Odalric-ambrym Maillard
Advances in Neural Information Processing Systems 25, 2012


Hierarchical Optimistic Region Selection driven by Curiosity
Odalric-ambrym Maillard
Advances in Neural Information Processing Systems 25, 2012


Selecting the State-Representation in Reinforcement Learning
Odalric-ambrym Maillard, Daniil Ryabko and Rémi Munos
Advances in Neural Information Processing Systems 24, 2011


Sparse Recovery with Brownian Sensing
Alexandra Carpentier, Odalric-ambrym Maillard and Rémi Munos
Advances in Neural Information Processing Systems 24, 2011


Adaptive Bandits: Towards the best history-dependent strategy
Odalric-ambrym Maillard and Rémi Munos
Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics (AISTATS-11), 2011