Breaking the sample complexity barrier to regret-optimal model-free reinforcement learning

From MaRDI portal

Revision as of 16:14, 25 April 2024 by Import240425040427 (talk | contribs) (Created automatically from import240425040427)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:6039766

Jump to:navigation, search

DOI10.1093/imaiai/iaac034zbMath1522.68473arXiv2110.04645OpenAlexW3206149081MaRDI QIDQ6039766

Yuejie Chi, Laixi Shi, Yuxin Chen, Unnamed Author

Publication date: 23 May 2023

Published in: Information and Inference: A Journal of the IMA (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/2110.04645

zbMATH Keywords

variance reduction Q-learning upper confidence bounds lower confidence bounds memory efficiency model-free RL

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Markov and semi-Markov decision processes (90C40) Online algorithms; streaming algorithms (68W27) Computational aspects of data analysis and big data (68T09)

This page was built for publication: Breaking the sample complexity barrier to regret-optimal model-free reinforcement learning

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:6039766&oldid=33878199"