Toshinori Kitamura,
Tadashi Kozuno,
Yunhao Tang,
Nino Vieillard,
Michal Valko,
Wenhao Yang,
Jincheng Mei,
Pierre Ménard,
Mohammad Gheshlaghi Azar,
Rémi Munos,
others
(2023).
Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice.
International Conference on Machine Learning (ICML).
Tadashi Kozuno,
Wenhao Yang,
Nino Vieillard,
Toshinori Kitamura,
Yunhao Tang,
Jincheng Mei,
Pierre Ménard,
Mohammad Gheshlaghi Azar,
Michal Valko,
Rémi Munos,
others
(2022).
KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal.
arXiv preprint arXiv:2205.14211.
Lingwei Zhu,
Toshinori Kitamura,
Matsubara Takamitsu
(2021).
Cautious Actor-Critic.
Asian Conference on Machine Learning (ACML).