Discounted and average Markov decision processes with unbounded rewards: New conditions (Q1206951)

!

WARNING

This is the item page for this Wikibase entity, intended for internal use and editing purposes.

Please use the normal view instead:

Discounted and average Markov decision processes with unbounded rewards: New conditions

scientific article; zbMATH DE number 150681

Language	Label	Description	Also known as
default for all languages	No label defined
English	Discounted and average Markov decision processes with unbounded rewards: New conditions	scientific article; zbMATH DE number 150681

Statements

instance of

scholarly article

0 references

title

Discounted and average Markov decision processes with unbounded rewards: New conditions (English)

0 references

author

Qiying Hu

0 references

published in

Journal of Mathematical Analysis and Applications

0 references

publication date

1 April 1993

0 references

review text

For Markov decision processes (MDP) with discrete time and countable state space, conditions are given under which the known optimality equation for the discounted and for the average case, resp., has a unique resp. a bounded solution and which also regard unbounded rewards. The conditions are weaker than those given by \textit{J. M. Harrison} [Ann. Math. Statist. 43, No. 2, 636-644 (1972; Zbl 0262.90064)], resp. by \textit{S. M. Ross} [Ann. Math. Statist. 39, No. 2, 412-423 (1968; Zbl 0157.505)]. Because these new conditions are difficult to verify, further sufficient, recursively operating conditions are given. In the last part of the paper, continuous-time MDP are investigated by these methods and reduced to discrete-time ones. The paper contains a rich material so that only essentially new lemmas and theorems are proved. Also not all used symbols are explicitely defined. However, many remarks permit the reader to make connections to previously known facts.

0 references

zbMATH Keywords

discounted case

0 references

discrete time

0 references

countable state space

0 references

average case

0 references

MaRDI profile type

MaRDI publication profile