Policy Mirror Descent for Regularized Reinforcement Learning: A Generalized Framework with Linear Convergence

From MaRDI portal
Revision as of 06:48, 10 July 2024 by Import240710060729 (talk | contribs) (Created automatically from import240710060729)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:6161312

DOI10.1137/21M1456789arXiv2105.11066MaRDI QIDQ6161312

Jason D. Lee, Yuxin Chen, Yuejie Chi, Shicong Cen, Unnamed Author, Unnamed Author

Publication date: 27 June 2023

Published in: SIAM Journal on Optimization (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/2105.11066






Related Items (5)




Cites Work




This page was built for publication: Policy Mirror Descent for Regularized Reinforcement Learning: A Generalized Framework with Linear Convergence