Search Machine Learning Repository: TD(0) Leads to Better Policies than Approximate Value Iteration
Authors: Benjamin V. Roy
Conference: Advances in Neural Information Processing Systems 18
Year: 2005
Pages: 1377--1384
[pdf] [BibTeX]

authors venues years
Suggest Changes to this paper.
Brought to you by the WUSTL Machine Learning Group. We have open faculty positions (tenured and tenure-track).