Constrained continuous-time Markov decision processes with average criteria (Q2483010)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Constrained continuous-time Markov decision processes with average criteria |
scientific article; zbMATH DE number 5270715
| Language | Label | Description | Also known as |
|---|---|---|---|
| default for all languages | No label defined |
||
| English | Constrained continuous-time Markov decision processes with average criteria |
scientific article; zbMATH DE number 5270715 |
Statements
Constrained continuous-time Markov decision processes with average criteria (English)
0 references
5 May 2008
0 references
Constrained continuous-time Markov decision processes with a denumerable state space and unbounded reward/cost and transition rates are studied. The criterion to be maximized is the expected average reward, and a constraint is imposed on an expected average cost. The authors give suitable conditions that ensure the existence of a constrained-optimal policy. Moreover, they show that the constrained-optimal policy randomizes between two stationary policies differing in at most one state. A controlled queueing system is used to illustrate the obtained conditions.
0 references
continuous-time Markov decision process
0 references
unbounded reward/cost and transition rates
0 references
average criteria
0 references
constrained-optimal policy
0 references
0 references
0 references
0 references
0 references
0.94468766
0 references
0.94035214
0 references
0.93646955
0 references
0.93560874
0 references
0.9318974
0 references
0.9308855
0 references
0.9305469
0 references
0.9302256
0 references