Search Machine Learning Repository:
All publications by Pieter Abbeel
authors venues years



Trust Region Policy Optimization
John Schulman, Sergey Levine, Pieter Abbeel, Michael Jordan and Philipp Moritz
Proceedings of the 32nd International Conference on Machine Learning (ICML-15), 2015


Alpha-Beta Divergences Discover Micro and Macro Structures in Data
Karthik Narayan, Ali Punjani and Pieter Abbeel
Proceedings of the 32nd International Conference on Machine Learning (ICML-15), 2015


Gradient Estimation Using Stochastic Computation Graphs
John Schulman, Nicolas Heess, Theophane Weber and Pieter Abbeel
Advances in Neural Information Processing Systems 28, 2015


Learning Neural Network Policies with Guided Policy Search under Unknown Dynamics
Sergey Levine and Pieter Abbeel
Advances in Neural Information Processing Systems 27, 2014


Risk Aversion in Markov Decision Processes via Near Optimal Chernoff Bounds
Teodor M. Moldovan and Pieter Abbeel
Advances in Neural Information Processing Systems 25, 2012


Safe Exploration in Markov Decision Processes
Teodor M. Moldovan and Pieter Abbeel
Proceedings of the 29th International Conference on Machine Learning (ICML-12), 2012


On a Connection between Importance Sampling and the Likelihood Ratio Policy Gradient
Tang Jie and Pieter Abbeel
Advances in Neural Information Processing Systems 23, 2010


Learning for control from multiple demonstrations
Adam Coates, Pieter Abbeel and Andrew Y. Ng
Proceedings of the 25th International Conference on Machine Learning (ICML-08), 2008


Max-margin Classification of Data with Absent Features
Gal Chechik, Geremy Heitz, Gal Elidan, Pieter Abbeel and Daphne Koller
Journal of Machine Learning Research, 2008


Hierarchical Apprenticeship Learning with Application to Quadruped Locomotion
J. Z. Kolter, Pieter Abbeel and Andrew Y. Ng
Advances in Neural Information Processing Systems 20, 2007


Using inaccurate models in reinforcement learning
Pieter Abbeel, Morgan Quigley and Andrew Y. Ng
Proceedings of the 23th International Conference on Machine Learning (ICML-06), 2006


An Application of Reinforcement Learning to Aerobatic Helicopter Flight
Pieter Abbeel, Adam Coates, Morgan Quigley and Andrew Y. Ng
Advances in Neural Information Processing Systems 19, 2006


Max-margin classification of incomplete data
Gal Chechik, Geremy Heitz, Gal Elidan, Pieter Abbeel and Daphne Koller
Advances in Neural Information Processing Systems 19, 2006


Learning Factor Graphs in Polynomial Time and Sample Complexity
Pieter Abbeel, Daphne Koller and Andrew Y. Ng
Journal of Machine Learning Research, 2006


Exploration and apprenticeship learning in reinforcement learning
Pieter Abbeel and Andrew Y. Ng
Proceedings of the 22nd International Conference on Machine Learning (ICML-05), 2005


Learning vehicular dynamics, with application to modeling helicopters
Pieter Abbeel, Varun Ganapathi and Andrew Y. Ng
Advances in Neural Information Processing Systems 18, 2005


Apprenticeship learning via inverse reinforcement learning
Pieter Abbeel and Andrew Y. Ng
Proceedings of the 21st International Conference on Machine Learning (ICML-04), 2004


Learning first-order Markov models for control
Pieter Abbeel and Andrew Y. Ng
Advances in Neural Information Processing Systems 17, 2004


Link Prediction in Relational Data
Ben Taskar, Ming-fai Wong, Pieter Abbeel and Daphne Koller
Advances in Neural Information Processing Systems 16, 2003