Search Machine Learning Repository:
Weighted importance sampling for off-policy learning with linear function approximation
Authors: A. R. Mahmood, Hado Hasselt and Richard S. Sutton
Conference: Advances in Neural Information Processing Systems 27
authors venues years
Suggest Changes to this paper.
Brought to you by the WUSTL Machine Learning Group. We have open faculty positions (tenured and tenure-track).