Search Machine Learning Repository:
All publications by Hado Hasselt
authors venues years



Weighted importance sampling for off-policy learning with linear function approximation
A. R. Mahmood, Hado Hasselt and Richard S. Sutton
Advances in Neural Information Processing Systems 27, 2014