Projected state-action balancing weights for offline reinforcement learning

From MaRDI portal
Publication:6183753

DOI10.1214/23-aos2302arXiv2109.04640MaRDI QIDQ6183753

Raymond K. W. Wong, Zhengling Qi, Jiayi Wang

Publication date: 4 January 2024

Published in: The Annals of Statistics (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/2109.04640






Cites Work