Continuous-time controlled Markov chains. (Q1872339): Difference between revisions
From MaRDI portal
ReferenceBot (talk | contribs) Changed an Item |
Normalize DOI. |
||
Property / DOI | |||
Property / DOI: 10.1214/aoap/1042765671 / rank | |||
Property / DOI | |||
Property / DOI: 10.1214/AOAP/1042765671 / rank | |||
Normal rank |
Latest revision as of 10:54, 16 December 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Continuous-time controlled Markov chains. |
scientific article |
Statements
Continuous-time controlled Markov chains. (English)
0 references
6 May 2003
0 references
The authors consider continuous-time Markov decision processes with a denumerable state space and Borel action space where the cost and the transition rates are allowed to be unbounded. In the class of deterministic stationary policies they first give very weak conditions under which the existence of \(\varepsilon\)-optimal \((\varepsilon\geq 0)\) policies is proved using the construction of a minimum \(Q\)-process. In the class of randomized Markov policies they secondly present a new necessary and sufficient condition for a nonhomogeneous \(Q\)-process to be regular and under this condition the existence of \(\varepsilon\)-optimal stationary policies is proved. In the case of nonregular \(Q\)-process, they further use Kolmogorov's forward equation for a nonhomogeneous minimum \(Q\)-process to prove the existence of \(\varepsilon\)-optimal stationary policies.
0 references
continuous-time
0 references
Markov decision process
0 references
optimal stationary policies
0 references
controlled \(Q\)-process
0 references
0 references
0 references