Search Machine Learning Repository:
Online Regret Bounds for Undiscounted Continuous Reinforcement Learning
Authors: Ronald Ortner and Daniil Ryabko
Conference: Advances in Neural Information Processing Systems 25
authors venues years
Suggest Changes to this paper.
Brought to you by the WUSTL Machine Learning Group. We have open faculty positions (tenured and tenure-track).