A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees

Jan 1, 0001

Go to Project Site

Photo by rawpixel on Unsplash

Reinforcement Learning

Toshinori Kitamura

Postdoc Researcher

My research interest is reinforcement learning theory.