A variance minimization problem for a Markov decision process (Q1091952)

From MaRDI portal
Revision as of 09:51, 18 June 2024 by ReferenceBot (talk | contribs) (‎Changed an Item)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
scientific article
Language Label Description Also known as
English
A variance minimization problem for a Markov decision process
scientific article

    Statements

    A variance minimization problem for a Markov decision process (English)
    0 references
    0 references
    1987
    0 references
    This paper deals with a discrete time Markov decision process with finite states and finite actions. The author investigates the problem to determine an optimal random policy that minimizes the variance of reward, with some constraint on the average reward. Introducing a parametric Markov decision process, he gives a procedure to find this optimal policy.
    0 references
    discrete time Markov decision process
    0 references
    finite states
    0 references
    finite actions
    0 references
    optimal random policy
    0 references
    variance of reward
    0 references
    parametric Markov decision process
    0 references
    0 references

    Identifiers