Search Machine Learning Repository:
All publications by Richard S. Sutton
authors venues years



Universal Option Models
Hengshuai Yao, Csaba Szepesvari, Richard S. Sutton, Joseph Modayil and Shalabh Bhatnagar
Advances in Neural Information Processing Systems 27, 2014


Weighted importance sampling for off-policy learning with linear function approximation
A. R. Mahmood, Hado Hasselt and Richard S. Sutton
Advances in Neural Information Processing Systems 27, 2014


Off-Policy Actor-Critic
Thomas Degris, Martha White and Richard S. Sutton
Proceedings of the 29th International Conference on Machine Learning (ICML-12), 2012


Toward Off-Policy Learning Control with Function Approximation
Hamid R. Maei, Csaba Szepesvári, Shalabh Bhatnagar and Richard S. Sutton
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010


Fast gradient-descent methods for temporal-difference learning with linear function approximation
Richard S. Sutton, Hamid R. Maei, Doina Precup, Shalabh Bhatnagar, David Silver, Csaba Szepesvári and Eric Wiewiora
Proceedings of the 26th International Conference on Machine Learning (ICML-09), 2009


Convergent Temporal-Difference Learning with Arbitrary Smooth Function Approximation
Shalabh Bhatnagar, Doina Precup, David Silver, Richard S. Sutton, Hamid R. Maei and Csaba Szepesvári
Advances in Neural Information Processing Systems 22, 2009


Multi-Step Dyna Planning for Policy Evaluation and Control
Hengshuai Yao, Shalabh Bhatnagar, Dongcui Diao, Richard S. Sutton and Csaba Szepesvári
Advances in Neural Information Processing Systems 22, 2009


Sample-based learning and search with permanent and transient memories
David Silver, Richard S. Sutton and Martin Müller
Proceedings of the 25th International Conference on Machine Learning (ICML-08), 2008


A Convergent $O(n)$ Temporal-difference Algorithm for Off-policy Learning with Linear Function Approximation
Richard S. Sutton, Hamid R. Maei and Csaba Szepesvári
Advances in Neural Information Processing Systems 21, 2008


A computational model of hippocampal function in trace conditioning
Elliot A. Ludvig, Richard S. Sutton, Eric Verbeek and E. J. Kehoe
Advances in Neural Information Processing Systems 21, 2008


On the role of tracking in stationary environments
Richard S. Sutton, Anna Koop and David Silver
Proceedings of the 24th International Conference on Machine Learning (ICML-07), 2007


Incremental Natural Actor-Critic Algorithms
Shalabh Bhatnagar, Mohammad Ghavamzadeh, Mark Lee and Richard S. Sutton
Advances in Neural Information Processing Systems 20, 2007


iLSTD: Eligibility Traces and Convergence Analysis
Alborz Geramifard, Michael Bowling, Martin Zinkevich and Richard S. Sutton
Advances in Neural Information Processing Systems 19, 2006


TD(lambda) networks: temporal-difference networks with eligibility traces
Brian Tanner and Richard S. Sutton
Proceedings of the 22nd International Conference on Machine Learning (ICML-05), 2005


Off-policy Learning with Options and Recognizers
Doina Precup, Cosmin Paduraru, Anna Koop, Richard S. Sutton and Satinder P. Singh
Advances in Neural Information Processing Systems 18, 2005


Temporal Abstraction in Temporal-difference Networks
Eddie Rafols, Anna Koop and Richard S. Sutton
Advances in Neural Information Processing Systems 18, 2005


Temporal-Difference Networks
Richard S. Sutton and Brian Tanner
Advances in Neural Information Processing Systems 17, 2004