Q-Learning and Enhanced Policy Iteration in Discounted Dynamic Programming

From MaRDI portal
Publication:2884305

DOI10.1287/moor.1110.0532zbMath1243.90231OpenAlexW2148864095MaRDI QIDQ2884305

Huizhen Yu, Dimitri P. Bertsekas

Publication date: 24 May 2012

Published in: Mathematics of Operations Research (Search for Journal in Brave)

Full work available at URL: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.294.8483




Related Items (12)






This page was built for publication: Q-Learning and Enhanced Policy Iteration in Discounted Dynamic Programming