Search Machine Learning Repository:
All publications by Paul Wagner
authors venues years



Optimistic policy iteration and natural actor-critic: A unifying view and a non-optimality result
Paul Wagner
Advances in Neural Information Processing Systems 26, 2013


A reinterpretation of the policy oscillation phenomenon in approximate policy iteration
Paul Wagner
Advances in Neural Information Processing Systems 24, 2011