A new condition for the existence of optimal stationary policies in average cost Markov decision processes (Q1076617)

From MaRDI portal
Revision as of 13:32, 17 June 2024 by ReferenceBot (talk | contribs) (‎Changed an Item)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)





scientific article
Language Label Description Also known as
English
A new condition for the existence of optimal stationary policies in average cost Markov decision processes
scientific article

    Statements

    A new condition for the existence of optimal stationary policies in average cost Markov decision processes (English)
    0 references
    0 references
    1986
    0 references
    The author considers a discrete time, countable state Markov decision processes with finite decision sets and bounded costs. He obtains conditions under which (possibly) unbounded solution to the average cost equation for the optimal value exists and yields an optimal stationary policy. In the special case in which every stationary policy induces an ergodic Markov chain, he obtains a new form for the optimality equation and gives a sufficient condition for the existence of an optimal stationary policy. The results are illustrated by some examples.
    0 references
    0 references
    stochastic dynamic programming
    0 references
    discrete time, countable state Markov decision processes
    0 references
    finite decision sets
    0 references
    bounded costs
    0 references
    optimal stationary policy
    0 references
    ergodic Markov chain
    0 references
    optimality equation
    0 references

    Identifiers