Singulary perturbed Markov control problem: Limiting average cost (Q1174700)

From MaRDI portal

Jump to:navigation, search

scientific article

Language	Label	Description	Also known as
English	Singulary perturbed Markov control problem: Limiting average cost	scientific article

Statements

scholarly article

0 references

Singulary perturbed Markov control problem: Limiting average cost (English)

0 references

Tomasz R. Bielecki

0 references

0 references

Annals of Operations Research

0 references

publication date

25 June 1992

0 references

The paper deals with a perturbed Markov decision process with discrete time and finite state and action sets. The criterion is average rewards per unit time. The authors assume that the space may be decomposed into several non-overlapping subsets and each of these subsets is an ergodic class for all Markov chains generated by stationary strategies. The perturbation vectors depend on states and actions. It is assumed that, if a perturbation parameter is small, any perturbed Markov chain, generated by a stationary strategy, is irreducible. The authors prove that an optimal solution to the limit perturbed problem, when the perturbation parameter tends to 0, can be approximated by an optimal solution to the perturbed problem when the parameter is small. They formulate a nonlinear program in the space of limit state-action frequencies. The solution of this problem determines an optimal limit strategy for a perturbed process.

0 references

zbMATH Keywords

perturbed Markov decision process

0 references

discrete time

0 references

finite state and action sets

0 references

average rewards

0 references

optimal limit strategy

0 references

Eugene A. Feinberg

0 references

MaRDI profile type

MaRDI publication profile

0 references

Discrete Dynamic Programming

0 references

Hierarchical aggregation of linear systems with multiple time scales

0 references

A Reduction Process for Perturbed Markov Chains

0 references

Optimal control of Markov chains admitting strong and weak interactions

0 references

Perturbation theory for unbounded Markov reward processes with applications to queueing

0 references

Perturbation theory for Markov reward processes with applications to queueing systems

0 references

0 references

0 references

0 references

Applications of Singular Perturbation Techniques to Control Problems

0 references

A singular perturbation approach to modeling and control of Markov chains

0 references

Multiple time scale decomposition of discrete time Markov chains

0 references

Perturbation theory and finite Markov chains

0 references

full work available at URL

https://doi.org/10.1007/bf02055579

0 references

Identifiers

zbMATH Open document ID

0 references

10.1007/BF02055579

0 references

Mathematics Subject Classification ID

0 references

0 references

zbMATH DE Number

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1174700

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Item:Q1174700&oldid=37187382"