Search Machine Learning Repository: Weighted importance sampling for off-policy learning with linear function approximation
Authors: A. R. Mahmood, Hado Hasselt and Richard S. Sutton
Conference: Advances in Neural Information Processing Systems 27
Year: 2014
Pages: 3014--3022
[pdf] [BibTeX]

authors venues years
Suggest Changes to this paper.
Brought to you by the WUSTL Machine Learning Group. We have open faculty positions (tenured and tenure-track).