Continuous time Markov decision programming with average reward criterion and unbounded reward rate (Q1179405)

From MaRDI portal
Revision as of 09:06, 30 July 2024 by Openalex240730090724 (talk | contribs) (Set OpenAlex properties.)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
scientific article
Language Label Description Also known as
English
Continuous time Markov decision programming with average reward criterion and unbounded reward rate
scientific article

    Statements

    Continuous time Markov decision programming with average reward criterion and unbounded reward rate (English)
    0 references
    0 references
    26 June 1992
    0 references
    Markov decision problems with continuous time and unbounded reward rates are studied for countable state sets and compact metric action sets. The transitive law is described by a controlled conservative transition rate matrix. For these problems the average expected reward is to be maximized under some (time dependent) deterministic Markov strategies where the resulting transition probabilities are continuous in time. Additional assumptions are given to obtain the existence of stationary optimal policies. The essential arguments are based on an imbedded finite state Markov decision chain with bounded rewards.
    0 references
    continuous time
    0 references
    unbounded reward
    0 references
    countable state sets
    0 references
    compact metric action sets
    0 references
    average expected reward
    0 references

    Identifiers