Minimising a threshold probability in discounted Markov decision processes (Q2367616)

From MaRDI portal
scientific article
Language Label Description Also known as
English
Minimising a threshold probability in discounted Markov decision processes
scientific article

    Statements

    Minimising a threshold probability in discounted Markov decision processes (English)
    0 references
    26 August 1993
    0 references
    The paper deals with a discrete time Markov decision process with finite state and action sets. The goal is to find a policy which maximizes the probability that the total discounted reward is greater than a given level. The author derives the optimality equation, proves the existence of an optimal stationary policy, and describes policy and value iteration algorithms for this problem.
    0 references
    threshold probability
    0 references
    discrete time Markov decision process
    0 references
    finite state and action sets
    0 references
    total discounted reward
    0 references
    optimality equation
    0 references
    existence of an optimal stationary policy
    0 references
    policy and value iteration algorithms
    0 references
    0 references

    Identifiers