Provably efficient offline reinforcement learning with trajectory-wise reward
From MaRDI portal
Publication:6670141
DOI10.1109/TIT.2024.3427141MaRDI QIDQ6670141FDOQ6670141
Authors: Tengyu Xu, Yue Wang, Shaofeng Zou, Yingbin Liang
Publication date: 23 January 2025
Published in: IEEE Transactions on Information Theory (Search for Journal in Brave)
This page was built for publication: Provably efficient offline reinforcement learning with trajectory-wise reward
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6670141)