Near-Optimal Policy Identification in Robust Constrained Markov Decision Processes via Epigraph Form
Toshinori Kitamura,
Tadashi Kozuno,
Wataru Kumagai,
Kenta Hoshino,
Yohei Hosoe,
Kazumi Kasaura,
Masashi Hamaya,
Paavo Parmas,
Yutaka Matsuo
January 2024