Reinforcement Learning, Bit by Bit

From MaRDI portal
Publication:6139546

DOI10.1561/2200000097zbMATH Open1525.68120arXiv2103.04047OpenAlexW4383982036MaRDI QIDQ6139546FDOQ6139546


Authors: Benjamin Van Roy, Vikranth R. Dwaracherla, Morteza Ibrahimi, Ian Osband, Zheng Wen Edit this on Wikidata


Publication date: 19 December 2023

Published in: Foundations and Trends® in Machine Learning (Search for Journal in Brave)

Abstract: Reinforcement learning agents have demonstrated remarkable achievements in simulated environments. Data efficiency poses an impediment to carrying this success over to real environments. The design of data-efficient agents calls for a deeper understanding of information acquisition and representation. We discuss concepts and regret analysis that together offer principled guidance. This line of thinking sheds light on questions of what information to seek, how to seek that information, and what information to retain. To illustrate concepts, we design simple agents that build on them and present computational results that highlight data efficiency.


Full work available at URL: https://arxiv.org/abs/2103.04047




Recommendations




Cites Work


Cited In (3)





This page was built for publication: Reinforcement Learning, Bit by Bit

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6139546)