Search Machine Learning Repository:
Convergent Temporal-Difference Learning with Arbitrary Smooth Function Approximation
Authors: Shalabh Bhatnagar, Doina Precup, David Silver, Richard S. Sutton, Hamid R. Maei and Csaba Szepesvári
Conference: Advances in Neural Information Processing Systems 22
authors venues years
Suggest Changes to this paper.
Brought to you by the WUSTL Machine Learning Group. We have open faculty positions (tenured and tenure-track).