Finite state approximation algorithms for average cost denumerable state Markov decision processes (Q1066819)

From MaRDI portal
Revision as of 10:08, 30 July 2024 by Openalex240730090724 (talk | contribs) (Set OpenAlex properties.)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
scientific article
Language Label Description Also known as
English
Finite state approximation algorithms for average cost denumerable state Markov decision processes
scientific article

    Statements

    Finite state approximation algorithms for average cost denumerable state Markov decision processes (English)
    0 references
    0 references
    0 references
    1985
    0 references
    This paper describes six algorithms for finding approximate solutions to infinite state space Markov decision processes under the average cost criteria. Three of the algorithms are variants on value iteration and three are variants on policy iteration. The convergence of these algorithms is guaranteed by a scrambling-type recurrency condition, (which ensures the average cost is independent of the starting state), and ''tail'' conditions (which allows states in the tail to be ignored). Computational results on the various algorithms are given.
    0 references
    approximate solutions
    0 references
    infinite state space Markov decision processes
    0 references
    average cost criteria
    0 references
    value iteration
    0 references
    policy iteration
    0 references
    convergence
    0 references
    0 references
    0 references
    0 references
    0 references

    Identifiers