A Finite Time Analysis of Temporal Difference Learning with Linear Function Approximation

From MaRDI portal
Publication:5003727

DOI10.1287/opre.2020.2024zbMath1472.90150arXiv1806.02450OpenAlexW2963616027MaRDI QIDQ5003727

Daniel J. Russo, Raghav Singal, Jalaj Bhandari

Publication date: 29 July 2021

Published in: Operations Research (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1806.02450



Related Items



Cites Work