Search Machine Learning Repository:
All publications by Rémi Munos
authors venues years



Stochastic Simultaneous Optimistic Optimization
Michal Valko, Alexandra Carpentier and Rémi Munos
Proceedings of the 30th International Conference on Machine Learning (ICML-13), 2013


Toward Optimal Stratification for Stratified Monte-Carlo Integration
Alexandra Carpentier and Rémi Munos
Proceedings of the 30th International Conference on Machine Learning (ICML-13), 2013


Risk-Aversion in Multi-armed Bandits
Amir Sani, Alessandro Lazaric and Rémi Munos
Advances in Neural Information Processing Systems 25, 2012


Bandit Algorithms boost Brain Computer Interfaces for motor-task selection of a brain-controlled button
Joan Fruitet, Alexandra Carpentier, Maureen Clerc and Rémi Munos
Advances in Neural Information Processing Systems 25, 2012


Adaptive Stratified Sampling for Monte-Carlo integration of Differentiable functions
Alexandra Carpentier and Rémi Munos
Advances in Neural Information Processing Systems 25, 2012


On the Sample Complexity of Reinforcement Learning with a Generative Model
Mohammad G. Azar, Bert Kappen and Rémi Munos
Proceedings of the 29th International Conference on Machine Learning (ICML-12), 2012


Bandit Theory meets Compressed Sensing for high dimensional Stochastic Linear Bandit
Alexandra Carpentier and Rémi Munos
Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics (AISTATS-12), 2012


Optimistic planning for Markov decision processes
Lucian Busoniu and Rémi Munos
Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics (AISTATS-12), 2012


Selecting the State-Representation in Reinforcement Learning
Odalric-ambrym Maillard, Daniil Ryabko and Rémi Munos
Advances in Neural Information Processing Systems 24, 2011


Speedy Q-Learning
Mohammad Ghavamzadeh, Hilbert J. Kappen, Mohammad G. Azar and Rémi Munos
Advances in Neural Information Processing Systems 24, 2011


Sparse Recovery with Brownian Sensing
Alexandra Carpentier, Odalric-ambrym Maillard and Rémi Munos
Advances in Neural Information Processing Systems 24, 2011


Finite Time Analysis of Stratified Sampling for Monte Carlo
Alexandra Carpentier and Rémi Munos
Advances in Neural Information Processing Systems 24, 2011


Optimistic Optimization of a Deterministic Function without the Knowledge of its Smoothness
Rémi Munos
Advances in Neural Information Processing Systems 24, 2011


Finite-Sample Analysis of Lasso-TD
Mohammad Ghavamzadeh, Alessandro Lazaric, Matthew Hoffman and Rémi Munos
Proceedings of the 28th International Conference on Machine Learning (ICML-11), 2011


Adaptive Bandits: Towards the best history-dependent strategy
Odalric-ambrym Maillard and Rémi Munos
Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics (AISTATS-11), 2011


Finite-Sample Analysis of LSTD
Alessandro Lazaric, Mohammad Ghavamzadeh and Rémi Munos
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010


Analysis of a Classification-based Policy Iteration Algorithm
Alessandro Lazaric, Mohammad Ghavamzadeh and Rémi Munos
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010


Error Propagation for Approximate Policy and Value Iteration
Amir-massoud Farahmand, Csaba Szepesvári and Rémi Munos
Advances in Neural Information Processing Systems 23, 2010


Scrambled Objects for Least-Squares Regression
Odalric Maillard and Rémi Munos
Advances in Neural Information Processing Systems 23, 2010


LSTD with Random Projections
Mohammad Ghavamzadeh, Alessandro Lazaric, Odalric Maillard and Rémi Munos
Advances in Neural Information Processing Systems 23, 2010


Workshop summary: On-line learning with limited feedback
Jean-yves Audibert, Peter Auer, Alessandro Lazaric, Rémi Munos, Daniil Ryabko and Csaba Szepesvári
Proceedings of the 26th International Conference on Machine Learning (ICML-09), 2009


Compressed Least-Squares Regression
Odalric Maillard and Rémi Munos
Advances in Neural Information Processing Systems 22, 2009


Sensitivity analysis in HMMs with application to likelihood maximization
Pierre-arnaud Coquelin, Romain Deguest and Rémi Munos
Advances in Neural Information Processing Systems 22, 2009


Finite-Time Bounds for Fitted Value Iteration
Rémi Munos and Csaba Szepesvári
Journal of Machine Learning Research, 2008


Particle Filter-based Policy Gradient in POMDPs
Pierre-arnaud Coquelin, Romain Deguest and Rémi Munos
Advances in Neural Information Processing Systems 21, 2008


Online Optimization in X-Armed Bandits
Sébastien Bubeck, Gilles Stoltz, Csaba Szepesvári and Rémi Munos
Advances in Neural Information Processing Systems 21, 2008


Algorithms for Infinitely Many-Armed Bandits
Yizao Wang, Jean-yves Audibert and Rémi Munos
Advances in Neural Information Processing Systems 21, 2008


Fitted Q-iteration in continuous action-space MDPs
András Antos, Csaba Szepesvári and Rémi Munos
Advances in Neural Information Processing Systems 20, 2007


Policy Gradient in Continuous Time
Rémi Munos
Journal of Machine Learning Research, 2006


Geometric Variance Reduction in Markov Chains: Application to Value Function and Gradient Estimation
Rémi Munos
Journal of Machine Learning Research, 2006


Finite time bounds for sampling based fitted value iteration
Csaba Szepesvári and Rémi Munos
Proceedings of the 22nd International Conference on Machine Learning (ICML-05), 2005


Error Bounds for Approximate Policy Iteration
Rémi Munos
Proceedings of the 20th International Conference on Machine Learning (ICML-03), 2003