On Markovian decision programming with recursive reward functions (Q2640462)

From MaRDI portal
scientific article
Language Label Description Also known as
English
On Markovian decision programming with recursive reward functions
scientific article

    Statements

    On Markovian decision programming with recursive reward functions (English)
    0 references
    0 references
    0 references
    1990
    0 references
    The paper deals with a discrete time Markov decision model with infinite horizon and recursive reward functions. This model was considered earlier by \textit{N. Furukawa} and \textit{S. Iwamoto} [Bull. Math. Statist. 15, No.3/4, 79-91 (1973; Zbl 0304.90117)]. The state and action sets are countable or finite. The authors establish the optimality principle and optimality equation when the model satisfies some additional conditions. They prove the existence of Markov optimal and \(\epsilon\)-optimal policies for nonhomogeneous models. For homogeneous models the existence of stationary optimal and \(\epsilon\)-optimal policies is proved. The authors also consider a policy iteration algorithm for homogeneous models.
    0 references
    discrete time Markov decision model
    0 references
    infinite horizon
    0 references
    recursive reward functions
    0 references
    optimality principle
    0 references
    optimality equation
    0 references
    \(\epsilon \) - optimal policies
    0 references

    Identifiers