Value iteration in a class of average controlled Markov chains with unbounded costs: necessary and sufficient conditions for pointwise convergence
From MaRDI portal
Publication:3122864
DOI10.2307/3214980zbMath0869.93054OpenAlexW4255849103MaRDI QIDQ3122864
Emmanuel Fernández-Gaucherand, Rolando Cavazos-Cadena
Publication date: 5 March 1997
Published in: Journal of Applied Probability (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.2307/3214980
Related Items
Continuous-time controlled Markov chains. ⋮ Approximation of average cost optimal policies for general Markov decision processes with unbounded costs ⋮ A pause control approach to the value iteration scheme in average Markov decision processes ⋮ A note on the convergence rate of the value iteration scheme in controlled Markov chains ⋮ STRONG AVERAGE OPTIMALITY FOR CONTROLLED NONHOMOGENEOUS MARKOV CHAINS*