Search Machine Learning Repository:
TD(0) Leads to Better Policies than Approximate Value Iteration
Authors: Benjamin V. Roy
Conference: Advances in Neural Information Processing Systems 18
authors venues years
Suggest Changes to this paper.
Brought to you by the WUSTL Machine Learning Group. We have open faculty positions (tenured and tenure-track).