Provably Efficient RL under Episode-Wise Safety in Constrained MDPs with Linear Function Approximation
Toshinori Kitamura,
Arnob Ghosh,
Tadashi Kozuno,
Wataru Kumagai,
Kazumi Kasaura,
Kenta Hoshino,
Yohei Hosoe,
Yutaka Matsuo
January 2025
Publication
Neural Information Processing Systems (NeuralIPS) Spotlight
Add the full text or supplementary notes for the publication here using Markdown formatting.