Search Machine Learning Repository:
All publications by Shalabh Bhatnagar
authors venues years



Universal Option Models
Hengshuai Yao, Csaba Szepesvari, Richard S. Sutton, Joseph Modayil and Shalabh Bhatnagar
Advances in Neural Information Processing Systems 27, 2014


Toward Off-Policy Learning Control with Function Approximation
Hamid R. Maei, Csaba Szepesvári, Shalabh Bhatnagar and Richard S. Sutton
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010


Fast gradient-descent methods for temporal-difference learning with linear function approximation
Richard S. Sutton, Hamid R. Maei, Doina Precup, Shalabh Bhatnagar, David Silver, Csaba Szepesvári and Eric Wiewiora
Proceedings of the 26th International Conference on Machine Learning (ICML-09), 2009


Convergent Temporal-Difference Learning with Arbitrary Smooth Function Approximation
Shalabh Bhatnagar, Doina Precup, David Silver, Richard S. Sutton, Hamid R. Maei and Csaba Szepesvári
Advances in Neural Information Processing Systems 22, 2009


Multi-Step Dyna Planning for Policy Evaluation and Control
Hengshuai Yao, Shalabh Bhatnagar, Dongcui Diao, Richard S. Sutton and Csaba Szepesvári
Advances in Neural Information Processing Systems 22, 2009


Incremental Natural Actor-Critic Algorithms
Shalabh Bhatnagar, Mohammad Ghavamzadeh, Mark Lee and Richard S. Sutton
Advances in Neural Information Processing Systems 20, 2007


A Simulation-Based Algorithm for Ergodic Control of Markov Chains Conditioned on Rare Events
Shalabh Bhatnagar, Vivek S. Borkar and Madhukar Akarapu
Journal of Machine Learning Research, 2006