Markov decision processes on Borel spaces with total cost and random horizon (Q467433)

scientific article; zbMATH DE number 6363579

Language	Label	Description	Also known as
default for all languages	No label defined
English	Markov decision processes on Borel spaces with total cost and random horizon	scientific article; zbMATH DE number 6363579

Statements

instance of

scholarly article

0 references

title

Markov decision processes on Borel spaces with total cost and random horizon (English)

0 references

author

Hugo Cruz-Suárez

0 references

Rocio Ilhuicatzi-Roldán

0 references

Raúl Montes-de-Oca

0 references

published in

Journal of Optimization Theory and Applications

0 references

publication date

3 November 2014

0 references

review text

The paper deals with Markov decision processes (MDPs) on Borel spaces with possibly unbounded costs. It was motivated by the study of the discounted optimal control problem given in the book by \textit{M. L. Puterman} [Markov decision processes: discrete stochastic dynamic programming. New York, NY: John Wiley \& Sons (1994; Zbl 0829.90134)]. In the book it is proved that the discounted control problem can be treated as a control problem where the horizon is a random variable, which is supposed to follow a geometric distribution independent of the process. The results of the paper are obtained with the help of a dynamic programming approach. They permit working with discounted control problem with varying-time discount factor, possibly depending on the state of the system and the corresponding action as well. To illustrate the theory developed, a version of the linear-quadratic model with a random horizon and a logarithm consumption-investment model are presented.

0 references

reviewed by

Wiesław Kotarski

0 references

zbMATH Keywords

Markov decision process

0 references

total cost

0 references

varying-time discount factor

0 references

dynamic programming equation