Toshinori Kitamura
Toshinori Kitamura
Home
Experience
Posts
Projects
Teaching
Publications
CV
Light
Dark
Automatic
1
Near-Optimal Policy Identification in Robust Constrained Markov Decision Processes via Epigraph Form
Toshinori Kitamura
,
Tadashi Kozuno
,
Wataru Kumagai
,
Kenta Hoshino
,
Yohei Hosoe
,
Kazumi Kasaura
,
Masashi Hamaya
,
Paavo Parmas
,
Yutaka Matsuo
Provably Efficient RL under Episode-Wise Safety in Constrained MDPs with Linear Function Approximation
Add the full text or supplementary notes for the publication here using Markdown formatting.
Toshinori Kitamura
,
Arnob Ghosh
,
Tadashi Kozuno
,
Wataru Kumagai
,
Kazumi Kasaura
,
Kenta Hoshino
,
Yohei Hosoe
,
Yutaka Matsuo
A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees
Toshinori Kitamura
,
Tadashi Kozuno
,
Masahiro Kato
,
Yuki Ichihara
,
Soichiro Nishimori
,
Akiyoshi Sannai
,
Sho Sonoda
,
Wataru Kumagai
,
Yutaka Matsuo
Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice
Toshinori Kitamura
,
Tadashi Kozuno
,
Yunhao Tang
,
Nino Vieillard
,
Michal Valko
,
Wenhao Yang
,
Jincheng Mei
,
Pierre Ménard
,
Mohammad Gheshlaghi Azar
,
Rémi Munos
,
others
Cautious Actor-Critic
Lingwei Zhu
,
Toshinori Kitamura
,
Matsubara Takamitsu
Geometric Value Iteration: Dynamic Error-Aware KL Regularization for Reinforcement Learning
Toshinori Kitamura
,
Lingwei Zhu
,
Takamitsu Matsubara
Cite
×