Recursive adaptive control of Markov decision processes with the average reward criterion

From MaRDI portal
Publication:2276895

DOI10.1007/BF01442397zbMath0723.90085OpenAlexW2093026711MaRDI QIDQ2276895

Rolando Cavazos-Cadena, Onésimo Hernández-Lerma

Publication date: 1991

Published in: Applied Mathematics and Optimization (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1007/bf01442397




Related Items (2)



Cites Work


This page was built for publication: Recursive adaptive control of Markov decision processes with the average reward criterion