Recursive adaptive control of Markov decision processes with the average reward criterion

From MaRDI portal

Publication:2276895

Jump to:navigation, search

DOI10.1007/BF01442397zbMath0723.90085OpenAlexW2093026711MaRDI QIDQ2276895

Rolando Cavazos-Cadena, Onésimo Hernández-Lerma

Publication date: 1991

Published in: Applied Mathematics and Optimization (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1007/bf01442397

zbMATH Keywords

Borel state and action spaces additive-noise systems average return criterion recursive adaptive nonstationary value iteration policy unknown noise distribution

Mathematics Subject Classification ID

Stochastic learning and adaptive control (93E35) Markov and semi-Markov decision processes (90C40)

Related Items (2)

Recurrence conditions for Markov decision processes with Borel state space: A survey ⋮ Recursive adaptive control of Markov decision processes with the average reward criterion

Cites Work

This page was built for publication: Recursive adaptive control of Markov decision processes with the average reward criterion

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:2276895&oldid=14839275"