Search Machine Learning Repository:
All publications by Hamid R. Maei
authors venues years



Toward Off-Policy Learning Control with Function Approximation
Hamid R. Maei, Csaba Szepesvári, Shalabh Bhatnagar and Richard S. Sutton
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010


Fast gradient-descent methods for temporal-difference learning with linear function approximation
Richard S. Sutton, Hamid R. Maei, Doina Precup, Shalabh Bhatnagar, David Silver, Csaba Szepesvári and Eric Wiewiora
Proceedings of the 26th International Conference on Machine Learning (ICML-09), 2009


Convergent Temporal-Difference Learning with Arbitrary Smooth Function Approximation
Shalabh Bhatnagar, Doina Precup, David Silver, Richard S. Sutton, Hamid R. Maei and Csaba Szepesvári
Advances in Neural Information Processing Systems 22, 2009


A Convergent $O(n)$ Temporal-difference Algorithm for Off-policy Learning with Linear Function Approximation
Richard S. Sutton, Hamid R. Maei and Csaba Szepesvári
Advances in Neural Information Processing Systems 21, 2008