Search Machine Learning Repository:
All publications by Mohammad Ghavamzadeh
authors venues years



High Confidence Policy Improvement
Philip Thomas, Georgios Theocharous and Mohammad Ghavamzadeh
Proceedings of the 32nd International Conference on Machine Learning (ICML-15), 2015


Policy Gradient for Coherent Risk Measures
Aviv Tamar, Yinlam Chow, Mohammad Ghavamzadeh and Shie Mannor
Advances in Neural Information Processing Systems 28, 2015


Algorithms for CVaR Optimization in MDPs
Yinlam Chow and Mohammad Ghavamzadeh
Advances in Neural Information Processing Systems 27, 2014


Cost-sensitive Multiclass Classification Risk Bounds
Bernardo V. Pires, Csaba Szepesvari and Mohammad Ghavamzadeh
Proceedings of the 30th International Conference on Machine Learning (ICML-13), 2013


Approximate Dynamic Programming Finally Performs Well in the Game of Tetris
Victor Gabillon, Mohammad Ghavamzadeh and Bruno Scherrer
Advances in Neural Information Processing Systems 26, 2013


Actor-Critic Algorithms for Risk-Sensitive MDPs
Prashanth L.a. and Mohammad Ghavamzadeh
Advances in Neural Information Processing Systems 26, 2013


A Generalized Kernel Approach to Structured Output Learning
Hachem Kadri, Mohammad Ghavamzadeh and Philippe Preux
Proceedings of the 30th International Conference on Machine Learning (ICML-13), 2013


Best Arm Identification: A Unified Approach to Fixed Budget and Fixed Confidence
Victor Gabillon, Mohammad Ghavamzadeh and Alessandro Lazaric
Advances in Neural Information Processing Systems 25, 2012


Approximate Modified Policy Iteration
Bruno Scherrer, Mohammad Ghavamzadeh, Victor Gabillon and Matthieu Geist
Proceedings of the 29th International Conference on Machine Learning (ICML-12), 2012


A Dantzig Selector Approach to Temporal Difference Learning
Matthieu Geist, Bruno Scherrer, Alessandro Lazaric and Mohammad Ghavamzadeh
Proceedings of the 29th International Conference on Machine Learning (ICML-12), 2012


Speedy Q-Learning
Mohammad Ghavamzadeh, Hilbert J. Kappen, Mohammad G. Azar and Rémi Munos
Advances in Neural Information Processing Systems 24, 2011


Multi-Bandit Best Arm Identification
Victor Gabillon, Mohammad Ghavamzadeh, Alessandro Lazaric and Sébastien Bubeck
Advances in Neural Information Processing Systems 24, 2011


Finite-Sample Analysis of Lasso-TD
Mohammad Ghavamzadeh, Alessandro Lazaric, Matthew Hoffman and Rémi Munos
Proceedings of the 28th International Conference on Machine Learning (ICML-11), 2011


Classification-based Policy Iteration with a Critic
Victor Gabillon, Alessandro Lazaric, Mohammad Ghavamzadeh and Bruno Scherrer
Proceedings of the 28th International Conference on Machine Learning (ICML-11), 2011


Finite-Sample Analysis of LSTD
Alessandro Lazaric, Mohammad Ghavamzadeh and Rémi Munos
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010


Analysis of a Classification-based Policy Iteration Algorithm
Alessandro Lazaric, Mohammad Ghavamzadeh and Rémi Munos
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010


Bayesian Multi-Task Reinforcement Learning
Alessandro Lazaric and Mohammad Ghavamzadeh
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010


LSTD with Random Projections
Mohammad Ghavamzadeh, Alessandro Lazaric, Odalric Maillard and Rémi Munos
Advances in Neural Information Processing Systems 23, 2010


Regularized Policy Iteration
Amir M. Farahmand, Mohammad Ghavamzadeh, Shie Mannor and Csaba Szepesvári
Advances in Neural Information Processing Systems 21, 2008


Bayesian actor-critic algorithms
Mohammad Ghavamzadeh and Yaakov Engel
Proceedings of the 24th International Conference on Machine Learning (ICML-07), 2007


Incremental Natural Actor-Critic Algorithms
Shalabh Bhatnagar, Mohammad Ghavamzadeh, Mark Lee and Richard S. Sutton
Advances in Neural Information Processing Systems 20, 2007


Hierarchical Average Reward Reinforcement Learning
Mohammad Ghavamzadeh and Sridhar Mahadevan
Journal of Machine Learning Research, 2007


Bayesian Policy Gradient Algorithms
Mohammad Ghavamzadeh and Yaakov Engel
Advances in Neural Information Processing Systems 19, 2006


Hierarchical Policy Gradient Algorithms
Mohammad Ghavamzadeh and Sridhar Mahadevan
Proceedings of the 20th International Conference on Machine Learning (ICML-03), 2003