Search Machine Learning Repository:
All publications by Csaba Szepesvári
authors venues years



Deterministic Independent Component Analysis
Ruitong Huang, Andras Gyorgy and Csaba Szepesvári
Proceedings of the 32nd International Conference on Machine Learning (ICML-15), 2015


Characterizing the Representer Theorem
Yaoliang Yu, Hao Cheng, Dale Schuurmans and Csaba Szepesvári
Proceedings of the 30th International Conference on Machine Learning (ICML-13), 2013


A Randomized Mirror Descent Algorithm for Large Scale Multiple Kernel Learning
Arash Afkanpour, András György, Michael Bowling and Csaba Szepesvári
Proceedings of the 30th International Conference on Machine Learning (ICML-13), 2013


Deep Representations and Codes for Image Auto-Annotation
Ryan Kiros and Csaba Szepesvári
Advances in Neural Information Processing Systems 25, 2012


Analysis of Kernel Mean Matching under Covariate Shift
Yao-liang Yu and Csaba Szepesvári
Proceedings of the 29th International Conference on Machine Learning (ICML-12), 2012


An adaptive algorithm for finite stochastic partial monitoring
Gabor Bartok, Navid Zolghadr and Csaba Szepesvári
Proceedings of the 29th International Conference on Machine Learning (ICML-12), 2012


Statistical linear estimation with penalized estimators: an application to reinforcement learning
Bernardo A. Pires and Csaba Szepesvári
Proceedings of the 29th International Conference on Machine Learning (ICML-12), 2012


The adversarial stochastic shortest path problem with unknown transition probabilities
Gergely Neu, András György and Csaba Szepesvári
Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics (AISTATS-12), 2012


Online-to-Confidence-Set Conversions and Application to Sparse Stochastic Bandits
Yasin Abbasi-yadkori, Dávid Pál and Csaba Szepesvári
Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics (AISTATS-12), 2012


Improved Algorithms for Linear Stochastic Bandits
Yasin Abbasi-yadkori, Csaba Szepesvári and David Tax
Advances in Neural Information Processing Systems 24, 2011


Model-based reinforcement learning with nearly tight exploration complexity bounds
Istvan Szita and Csaba Szepesvári
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010


Toward Off-Policy Learning Control with Function Approximation
Hamid R. Maei, Csaba Szepesvári, Shalabh Bhatnagar and Richard S. Sutton
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010


Budgeted Distribution Learning of Belief Net Parameters
Liuyang Li, Barnabás Póczos, Csaba Szepesvári and Russell Greiner
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010


Online Markov Decision Processes under Bandit Feedback
Gergely Neu, Andras Antos, András György and Csaba Szepesvári
Advances in Neural Information Processing Systems 23, 2010


Error Propagation for Approximate Policy and Value Iteration
Amir-massoud Farahmand, Csaba Szepesvári and Rémi Munos
Advances in Neural Information Processing Systems 23, 2010


Parametric Bandits: The Generalized Linear Case
Sarah Filippi, Olivier Cappe, Aurélien Garivier and Csaba Szepesvári
Advances in Neural Information Processing Systems 23, 2010


Estimation of Renyi Entropy and Mutual Information Based on Generalized Nearest-Neighbor Graphs
Barnabás Póczos, Csaba Szepesvári and David Tax
Advances in Neural Information Processing Systems 23, 2010


A Markov-Chain Monte Carlo Approach to Simultaneous Localization and Mapping
Péter Torma, András György and Csaba Szepesvári
Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics (AISTATS-10), 2010


REGO: Rank-based Estimation of Renyi Information using Euclidean Graph Optimization
Barnabás Póczos, Sergey Kirshner and Csaba Szepesvári
Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics (AISTATS-10), 2010


Fast gradient-descent methods for temporal-difference learning with linear function approximation
Richard S. Sutton, Hamid R. Maei, Doina Precup, Shalabh Bhatnagar, David Silver, Csaba Szepesvári and Eric Wiewiora
Proceedings of the 26th International Conference on Machine Learning (ICML-09), 2009


Learning when to stop thinking and do something!
Barnabás Póczos, Yasin Abbasi-yadkori, Csaba Szepesvári, Russell Greiner and Nathan R. Sturtevant
Proceedings of the 26th International Conference on Machine Learning (ICML-09), 2009


Learning to segment from a few well-selected training images
Alireza Farhangfar, Russell Greiner and Csaba Szepesvári
Proceedings of the 26th International Conference on Machine Learning (ICML-09), 2009


Workshop summary: On-line learning with limited feedback
Jean-yves Audibert, Peter Auer, Alessandro Lazaric, Rémi Munos, Daniil Ryabko and Csaba Szepesvári
Proceedings of the 26th International Conference on Machine Learning (ICML-09), 2009


Convergent Temporal-Difference Learning with Arbitrary Smooth Function Approximation
Shalabh Bhatnagar, Doina Precup, David Silver, Richard S. Sutton, Hamid R. Maei and Csaba Szepesvári
Advances in Neural Information Processing Systems 22, 2009


Multi-Step Dyna Planning for Policy Evaluation and Control
Hengshuai Yao, Shalabh Bhatnagar, Dongcui Diao, Richard S. Sutton and Csaba Szepesvári
Advances in Neural Information Processing Systems 22, 2009


A General Projection Property for Distribution Families
Yao-liang Yu, Yuxi Li, Dale Schuurmans and Csaba Szepesvári
Advances in Neural Information Processing Systems 22, 2009


Learning Exercise Policies for American Options
Yuxi Li, Csaba Szepesvári and Dale Schuurmans
Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics (AISTATS-09), 2009


Finite-Time Bounds for Fitted Value Iteration
Rémi Munos and Csaba Szepesvári
Journal of Machine Learning Research, 2008


Empirical Bernstein stopping
Volodymyr Mnih, Csaba Szepesvári and Jean-yves Audibert
Proceedings of the 25th International Conference on Machine Learning (ICML-08), 2008


Regularized Policy Iteration
Amir M. Farahmand, Mohammad Ghavamzadeh, Shie Mannor and Csaba Szepesvári
Advances in Neural Information Processing Systems 21, 2008


Online Optimization in X-Armed Bandits
Sébastien Bubeck, Gilles Stoltz, Csaba Szepesvári and Rémi Munos
Advances in Neural Information Processing Systems 21, 2008


A Convergent $O(n)$ Temporal-difference Algorithm for Off-policy Learning with Linear Function Approximation
Richard S. Sutton, Hamid R. Maei and Csaba Szepesvári
Advances in Neural Information Processing Systems 21, 2008


Manifold-adaptive dimension estimation
Amir M. Farahmand, Csaba Szepesvári and Jean-yves Audibert
Proceedings of the 24th International Conference on Machine Learning (ICML-07), 2007


Fitted Q-iteration in continuous action-space MDPs
András Antos, Csaba Szepesvári and Rémi Munos
Advances in Neural Information Processing Systems 20, 2007


Finite time bounds for sampling based fitted value iteration
Csaba Szepesvári and Rémi Munos
Proceedings of the 22nd International Conference on Machine Learning (ICML-05), 2005


Interpolation-based Q-learning
Csaba Szepesvári and William D. Smart
Proceedings of the 21st International Conference on Machine Learning (ICML-04), 2004