Analytical mean squared error curves for temporal difference learning
From MaRDI portal
Publication:1266172
DOI10.1023/A:1007495401240zbMath0901.68168OpenAlexW1716849269MaRDI QIDQ1266172
Peter Dayan, Satinder Pal Singh
Publication date: 7 September 1998
Published in: Machine Learning (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1023/a:1007495401240
Related Items (4)
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems ⋮ The optimal unbiased value estimator and its relation to LSTD, TD and MC ⋮ Temporal-difference search in Computer Go ⋮ Q( $$\lambda $$ ) with Off-Policy Corrections
This page was built for publication: Analytical mean squared error curves for temporal difference learning