Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice
Toshinori Kitamura,
Tadashi Kozuno,
Yunhao Tang,
Nino Vieillard,
Michal Valko,
Wenhao Yang,
Jincheng Mei,
Pierre Ménard,
Mohammad Gheshlaghi Azar,
Rémi Munos,
others
January 2023