Optimal threshold probability in undiscounted Markov decision processes with a target set. (Q1427885)

From MaRDI portal
scientific article
Language Label Description Also known as
English
Optimal threshold probability in undiscounted Markov decision processes with a target set.
scientific article

    Statements

    Optimal threshold probability in undiscounted Markov decision processes with a target set. (English)
    0 references
    0 references
    14 March 2004
    0 references
    The paper is about minimizing risk models with a threshold criterion \(P_i^\pi(Z \leq r)\) where \(Z=\sum_{k=1}^{\tau-1} Y_k\) and \(\tau\) is a first passage time to a target set. The problem is modeled as undiscounted Markov decision process with discrete time space and countable state, action, and reward space. The main results are that the optimal value function is a unique solution to an optimality equation and that an optimal right continuous stationary policy exists. Some value iteration methods and a policy space iteration method are presented.
    0 references
    minimizing risk model
    0 references
    opimal value function
    0 references
    stationary policy
    0 references

    Identifiers