Search Machine Learning Repository:
All publications by David Silver
authors venues years



Universal Value Function Approximators
Tom Schaul, Daniel Horgan, Karol Gregor and David Silver
Proceedings of the 32nd International Conference on Machine Learning (ICML-15), 2015


Fictitious Self-Play in Extensive-Form Games
Johannes Heinrich, Marc Lanctot and David Silver
Proceedings of the 32nd International Conference on Machine Learning (ICML-15), 2015


Learning Continuous Control Policies by Stochastic Value Gradients
Nicolas Heess, Gregory Wayne, David Silver, Tim Lillicrap, Tom Erez and Yuval Tassa
Advances in Neural Information Processing Systems 28, 2015


Deterministic Policy Gradient Algorithms
David Silver, Guy Lever, Nicolas Heess, Thomas Degris, Daan Wierstra and Martin Riedmiller
Proceedings of the 31st International Conference on Machine Learning (ICML-14), 2014


Bayes-Adaptive Simulation-based Search with Value Function Approximation
Arthur Guez, Nicolas Heess, David Silver and Peter Dayan
Advances in Neural Information Processing Systems 27, 2014


Concurrent Reinforcement Learning from Customer Interactions
David Silver, Leonard Newnham, David Barker, Suzanne Weller and Jason Mcfall
Proceedings of the 30th International Conference on Machine Learning (ICML-13), 2013


Efficient Bayes-Adaptive Reinforcement Learning using Sample-Based Search
Arthur Guez, David Silver and Peter Dayan
Advances in Neural Information Processing Systems 25, 2012


Compositional Planning Using Optimal Option Models
David Silver and Kamil Ciosek
Proceedings of the 29th International Conference on Machine Learning (ICML-12), 2012


Monte-Carlo Planning in Large POMDPs
David Silver and Joel Veness
Advances in Neural Information Processing Systems 23, 2010


Fast gradient-descent methods for temporal-difference learning with linear function approximation
Richard S. Sutton, Hamid R. Maei, Doina Precup, Shalabh Bhatnagar, David Silver, Csaba Szepesvári and Eric Wiewiora
Proceedings of the 26th International Conference on Machine Learning (ICML-09), 2009


Monte-Carlo simulation balancing
David Silver and Gerald Tesauro
Proceedings of the 26th International Conference on Machine Learning (ICML-09), 2009


Convergent Temporal-Difference Learning with Arbitrary Smooth Function Approximation
Shalabh Bhatnagar, Doina Precup, David Silver, Richard S. Sutton, Hamid R. Maei and Csaba Szepesvári
Advances in Neural Information Processing Systems 22, 2009


Bootstrapping from Game Tree Search
Joel Veness, David Silver, Alan Blair and William W. Cohen
Advances in Neural Information Processing Systems 22, 2009


Sample-based learning and search with permanent and transient memories
David Silver, Richard S. Sutton and Martin Müller
Proceedings of the 25th International Conference on Machine Learning (ICML-08), 2008


On the role of tracking in stationary environments
Richard S. Sutton, Anna Koop and David Silver
Proceedings of the 24th International Conference on Machine Learning (ICML-07), 2007


Combining online and offline knowledge in UCT
Sylvain Gelly and David Silver
Proceedings of the 24th International Conference on Machine Learning (ICML-07), 2007