Search Machine Learning Repository:
Multi-Step Dyna Planning for Policy Evaluation and Control
Authors: Hengshuai Yao, Shalabh Bhatnagar, Dongcui Diao, Richard S. Sutton and Csaba Szepesvári
Conference: Advances in Neural Information Processing Systems 22
authors venues years
Suggest Changes to this paper.
Brought to you by the WUSTL Machine Learning Group. We have open faculty positions (tenured and tenure-track).