Singulary perturbed Markov control problem: Limiting average cost (Q1174700)

From MaRDI portal
scientific article
Language Label Description Also known as
English
Singulary perturbed Markov control problem: Limiting average cost
scientific article

    Statements

    Singulary perturbed Markov control problem: Limiting average cost (English)
    0 references
    0 references
    0 references
    25 June 1992
    0 references
    The paper deals with a perturbed Markov decision process with discrete time and finite state and action sets. The criterion is average rewards per unit time. The authors assume that the space may be decomposed into several non-overlapping subsets and each of these subsets is an ergodic class for all Markov chains generated by stationary strategies. The perturbation vectors depend on states and actions. It is assumed that, if a perturbation parameter is small, any perturbed Markov chain, generated by a stationary strategy, is irreducible. The authors prove that an optimal solution to the limit perturbed problem, when the perturbation parameter tends to 0, can be approximated by an optimal solution to the perturbed problem when the parameter is small. They formulate a nonlinear program in the space of limit state-action frequencies. The solution of this problem determines an optimal limit strategy for a perturbed process.
    0 references
    perturbed Markov decision process
    0 references
    discrete time
    0 references
    finite state and action sets
    0 references
    average rewards
    0 references
    optimal limit strategy
    0 references

    Identifiers