Toshinori Kitamura
Toshinori Kitamura
Home
Experience
Posts
Projects
Teaching
Publications
CV
Light
Dark
Automatic
Cautious Policy Programming: Exploiting KL Regularization in Monotonic Policy Improvement for Reinforcement Learning
Lingwei Zhu
,
Toshinori Kitamura
,
Takamitsu Matsubara
January 2021
Cite
Source Document
Type
Conference paper
Publication
arXiv preprint arXiv:2107.05798
Cite
×