Reinforcement Learning, Bit by Bit

DOI10.1561/2200000097MaRDI QIDQ6139546zbMATH OpenOpenAlexFDO

Authors Benjamin Van Roy, Vikranth R. Dwaracherla, Morteza Ibrahimi, Ian Osband, Zheng Wen

Publication date 19 December 2023

Published in Foundations and Trends® in Machine Learning (Search for Journal in Brave)

Full work available at URL https://arxiv.org/abs/2103.04047

Learning and adaptive systems in artificial intelligence (68T05) Research exposition (monographs, survey articles) pertaining to computer science (68-02)

Abstract: Reinforcement learning agents have demonstrated remarkable achievements in simulated environments. Data efficiency poses an impediment to carrying this success over to real environments. The design of data-efficient agents calls for a deeper understanding of information acquisition and representation. We discuss concepts and regret analysis that together offer principled guidance. This line of thinking sheds light on questions of what information to seek, how to seek that information, and what information to retain. To illustrate concepts, we design simple agents that build on them and present computational results that highlight data efficiency.

Recommendations

Cites work

Cited in

(6)

This page was built for publication: Reinforcement Learning, Bit by Bit

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6139546)