Search Machine Learning Repository: Convergent Temporal-Difference Learning with Arbitrary Smooth Function Approximation
Authors: Shalabh Bhatnagar, Doina Precup, David Silver, Richard S. Sutton, Hamid R. Maei and Csaba Szepesvári
Conference: Advances in Neural Information Processing Systems 22
Year: 2009
Pages: 1204--1212
[pdf] [BibTeX]

authors venues years
Suggest Changes to this paper.
Brought to you by the WUSTL Machine Learning Group. We have open faculty positions (tenured and tenure-track).