Markov control process with the expected total cost criterion: Optimality, stability, and transient models
From MaRDI portal
Publication:1973302
DOI10.1023/A:1006368714127zbMath0964.93086OpenAlexW1920513863MaRDI QIDQ1973302
Guadalupe Carrasco, Rubén Pérez-Hernández, Onésimo Hernández-Lerma
Publication date: 29 August 2000
Published in: Acta Applicandae Mathematicae (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1023/a:1006368714127
dynamic programmingLagrange stabilitypolicy iterationstability with probability oneexpected total costdiscrete-time Markov control processestransient control models
Related Items (5)
Stability Estimation of Transient Markov Decision Processes ⋮ Markov decision processes with iterated coherent risk measures ⋮ An analysis of transient Markov decision processes ⋮ A finite exact algorithm to solve a dice game ⋮ Robust shortest path planning and semicontractive dynamic programming
This page was built for publication: Markov control process with the expected total cost criterion: Optimality, stability, and transient models