A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees

Publication
arXiv preprint arXiv:2401.17780