Provably Efficient RL under Episode-Wise Safety in Constrained MDPs with Linear Function Approximation

Toshinori Kitamura, Arnob Ghosh, Tadashi Kozuno, Wataru Kumagai, Kazumi Kasaura, Kenta Hoshino, Yohei Hosoe, Yutaka Matsuo

January 2025

Type

Publication

Neural Information Processing Systems (NeuralIPS) Spotlight

Add the full text or supplementary notes for the publication here using Markdown formatting.