A new condition for the existence of optimal stationary policies in average cost Markov decision processes (Q1076617): Difference between revisions
From MaRDI portal
Latest revision as of 13:32, 17 June 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | A new condition for the existence of optimal stationary policies in average cost Markov decision processes |
scientific article |
Statements
A new condition for the existence of optimal stationary policies in average cost Markov decision processes (English)
0 references
1986
0 references
The author considers a discrete time, countable state Markov decision processes with finite decision sets and bounded costs. He obtains conditions under which (possibly) unbounded solution to the average cost equation for the optimal value exists and yields an optimal stationary policy. In the special case in which every stationary policy induces an ergodic Markov chain, he obtains a new form for the optimality equation and gives a sufficient condition for the existence of an optimal stationary policy. The results are illustrated by some examples.
0 references
stochastic dynamic programming
0 references
discrete time, countable state Markov decision processes
0 references
finite decision sets
0 references
bounded costs
0 references
optimal stationary policy
0 references
ergodic Markov chain
0 references
optimality equation
0 references
0 references