Optimal threshold probability in undiscounted Markov decision processes with a target set. (Q1427885): Difference between revisions

From MaRDI portal
ReferenceBot (talk | contribs)
Changed an Item
Set OpenAlex properties.
 
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1016/s0096-3003(03)00158-9 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2083611438 / rank
 
Normal rank

Latest revision as of 11:10, 30 July 2024

scientific article
Language Label Description Also known as
English
Optimal threshold probability in undiscounted Markov decision processes with a target set.
scientific article

    Statements

    Optimal threshold probability in undiscounted Markov decision processes with a target set. (English)
    0 references
    0 references
    14 March 2004
    0 references
    The paper is about minimizing risk models with a threshold criterion \(P_i^\pi(Z \leq r)\) where \(Z=\sum_{k=1}^{\tau-1} Y_k\) and \(\tau\) is a first passage time to a target set. The problem is modeled as undiscounted Markov decision process with discrete time space and countable state, action, and reward space. The main results are that the optimal value function is a unique solution to an optimality equation and that an optimal right continuous stationary policy exists. Some value iteration methods and a policy space iteration method are presented.
    0 references
    minimizing risk model
    0 references
    opimal value function
    0 references
    stationary policy
    0 references

    Identifiers