Concentration bounds for temporal difference learning with linear function approximation: the case of batch data and uniform sampling

From MaRDI portal
Publication:2051259

DOI10.1007/s10994-020-05912-5OpenAlexW3118861484MaRDI QIDQ2051259

Nathaniel Korda, L. A. Prashanth, Rémi Munos

Publication date: 24 November 2021

Published in: Machine Learning (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1306.2557



Related Items



Cites Work