Hybrid least-squares algorithms for approximate policy evaluation
From MaRDI portal
Publication:1959511
DOI10.1007/s10994-009-5128-4zbMath1470.68124WikidataQ115146324 ScholiaQ115146324MaRDI QIDQ1959511
Marek Petrik, Jeff Johns, Sridhar Mahadevan
Publication date: 7 October 2010
Published in: Machine Learning (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1007/s10994-009-5128-4
68T05: Learning and adaptive systems in artificial intelligence
90C40: Markov and semi-Markov decision processes
Related Items
Cites Work