On risk-sensitive piecewise deterministic Markov decision processes (Q2187326): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
Added link to MaRDI item.
links / mardi / namelinks / mardi / name
 

Revision as of 01:23, 2 February 2024

scientific article
Language Label Description Also known as
English
On risk-sensitive piecewise deterministic Markov decision processes
scientific article

    Statements

    On risk-sensitive piecewise deterministic Markov decision processes (English)
    0 references
    0 references
    0 references
    2 June 2020
    0 references
    This paper is devoted to a risk-sensitive piecewise deterministic Markov process (PDMDP) in Borel state and an action space with nonnegative cost rate. The transition and cost rates are assumed to be weakly integrable along the drift and the exponential utility of the total cost has to be minimized. The authors show that the value function is a solution to the optimality equation, justify the value iteration algorithm and prove the existence of the deterministic stationary policy. It should be stressed that a PDMDP, not systematically earlier studied in the literature, is an extention of a continuous-time Markov decision process (CTMDP), where between consecutive jumps, the process evolves according to a deterministic Markov process. The obtained results are further applied to improve the known results for finite horizon undisconected and infinite horizon disconected risk-sensitive CTMDP presented in: [\textit{M. K. Ghosh} and \textit{S. Saha}, Stochastics 86, No. 4, 655--675 (2014; Zbl 1337.49046)] and [\textit{Q. Wei}, Math. Methods Oper. Res. 84, No. 3, 461--487 (2016; Zbl 1354.93179)].
    0 references
    continuous-time Markov decision processes
    0 references
    piecewise deterministic Markov decision processes
    0 references
    exponential utility
    0 references
    dynamic programming
    0 references

    Identifiers