Search Machine Learning Repository:
All publications by Alessandro Lazaric
authors venues years



Online Stochastic Optimization under Correlated Bandit Feedback
Mohammad G. Azar, Alessandro Lazaric and Emma Brunskill
Proceedings of the 31st International Conference on Machine Learning (ICML-14), 2014


Best-Arm Identification in Linear Bandits
Marta Soare, Alessandro Lazaric and Remi Munos
Advances in Neural Information Processing Systems 27, 2014


Exploiting easy data in online optimization
Amir Sani, Gergely Neu and Alessandro Lazaric
Advances in Neural Information Processing Systems 27, 2014


Sparse Multi-Task Reinforcement Learning
Daniele Calandriello, Alessandro Lazaric and Marcello Restelli
Advances in Neural Information Processing Systems 27, 2014


Sequential Transfer in Multi-armed Bandit with Finite Set of Models
Mohammad G. Azar, Alessandro Lazaric and Emma Brunskill
Advances in Neural Information Processing Systems 26, 2013


Risk-Aversion in Multi-armed Bandits
Amir Sani, Alessandro Lazaric and Rémi Munos
Advances in Neural Information Processing Systems 25, 2012


Best Arm Identification: A Unified Approach to Fixed Budget and Fixed Confidence
Victor Gabillon, Mohammad Ghavamzadeh and Alessandro Lazaric
Advances in Neural Information Processing Systems 25, 2012


A Dantzig Selector Approach to Temporal Difference Learning
Matthieu Geist, Bruno Scherrer, Alessandro Lazaric and Mohammad Ghavamzadeh
Proceedings of the 29th International Conference on Machine Learning (ICML-12), 2012


Multi-Bandit Best Arm Identification
Victor Gabillon, Mohammad Ghavamzadeh, Alessandro Lazaric and Sébastien Bubeck
Advances in Neural Information Processing Systems 24, 2011


Transfer from Multiple MDPs
Alessandro Lazaric and Marcello Restelli
Advances in Neural Information Processing Systems 24, 2011


Finite-Sample Analysis of Lasso-TD
Mohammad Ghavamzadeh, Alessandro Lazaric, Matthew Hoffman and Rémi Munos
Proceedings of the 28th International Conference on Machine Learning (ICML-11), 2011


Classification-based Policy Iteration with a Critic
Victor Gabillon, Alessandro Lazaric, Mohammad Ghavamzadeh and Bruno Scherrer
Proceedings of the 28th International Conference on Machine Learning (ICML-11), 2011


Finite-Sample Analysis of LSTD
Alessandro Lazaric, Mohammad Ghavamzadeh and Rémi Munos
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010


Analysis of a Classification-based Policy Iteration Algorithm
Alessandro Lazaric, Mohammad Ghavamzadeh and Rémi Munos
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010


Bayesian Multi-Task Reinforcement Learning
Alessandro Lazaric and Mohammad Ghavamzadeh
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010


LSTD with Random Projections
Mohammad Ghavamzadeh, Alessandro Lazaric, Odalric Maillard and Rémi Munos
Advances in Neural Information Processing Systems 23, 2010


Workshop summary: On-line learning with limited feedback
Jean-yves Audibert, Peter Auer, Alessandro Lazaric, Rémi Munos, Daniil Ryabko and Csaba Szepesvári
Proceedings of the 26th International Conference on Machine Learning (ICML-09), 2009


Transfer of samples in batch reinforcement learning
Alessandro Lazaric, Marcello Restelli and Andrea Bonarini
Proceedings of the 25th International Conference on Machine Learning (ICML-08), 2008


Reinforcement Learning in Continuous Action Spaces through Sequential Monte Carlo Methods
Alessandro Lazaric, Marcello Restelli and Andrea Bonarini
Advances in Neural Information Processing Systems 20, 2007