A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees Jan 1, 0001 Go to Project Site Photo by rawpixel on Unsplash Reinforcement Learning Toshinori Kitamura Ph.D. student My research interests include reinforcement learning theory and deep learning. Related Japanese Translation of "CMPUT 653 Theoretical Foundations of Reinforcement Learning" Near-Optimal Policy Identification in Robust Constrained Markov Decision Processes via Epigraph Form PyTorch-RL-IL (rlil) ShinRL Variance Weighted MDVI