A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees

Photo by rawpixel on Unsplash
Toshinori Kitamura
Toshinori Kitamura
Ph.D. student

My research interests include reinforcement learning theory and deep learning.

Related