Average cost Markov decision processes: Optimality conditions

From MaRDI portal

Publication:1176301

Jump to:navigation, search

DOI10.1016/0022-247X(91)90244-TzbMath0739.90072MaRDI QIDQ1176301

Jean-Bernard Lasserre, Jean Claude Hennet, Onésimo Hernández-Lerma

Publication date: 25 June 1992

Published in: Journal of Mathematical Analysis and Applications (Search for Journal in Brave)

zbMATH Keywords

duality theorem opportunity cost ergodicity conditions strong average optimality long run expected average cost criterion

Mathematics Subject Classification ID

Optimal stochastic control (93E20) Markov and semi-Markov decision processes (90C40)

Related Items

A partial history of the early development of continuous-time nonlinear stochastic systems theory ⋮ Recurrence conditions for Markov decision processes with Borel state space: A survey ⋮ Value iteration in average cost Markov control processes on Borel spaces ⋮ The LP approach in average reward MDPs with multiple cost constraints: The countable state case ⋮ On strong average optimality of Markov decision processes with unbounded costs ⋮ Remarks on the existence of solutions to the average cost optimality equation in Markov decision processes ⋮ Numerical comparison of controls and verification of optimality for stochastic control problems

Cites Work

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:1176301&oldid=12010194"