Least squares policy evaluation algorithms with linear function approximation
Publication:1870310
DOI10.1023/A:1022192903948zbMath1030.93061MaRDI QIDQ1870310
Dimitri P. Bertsekas, Angelia Nedić
Publication date: 11 May 2003
Published in: Discrete Event Dynamic Systems (Search for Journal in Brave)
simulationmartingaleconvergence resultstemporal differencestepsizelinear function approximationleast-square methods\(\text{LSTD}(\lambda)\) algorithmdiscrete-time stationary Markov chaininfinite-horizon dynamic programmingpolicy evaluation algorithms
Discrete-time control/observation systems (93C55) Least squares and related methods for stochastic control systems (93E24) Markov chains (discrete-time Markov processes on discrete state spaces) (60J10) Linearizations (93B18) Optimal stochastic control (93E20)
Related Items (22)
This page was built for publication: Least squares policy evaluation algorithms with linear function approximation