On Markovian decision programming with recursive reward functions (Q2640462)
From MaRDI portal
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | On Markovian decision programming with recursive reward functions |
scientific article |
Statements
On Markovian decision programming with recursive reward functions (English)
0 references
1990
0 references
The paper deals with a discrete time Markov decision model with infinite horizon and recursive reward functions. This model was considered earlier by \textit{N. Furukawa} and \textit{S. Iwamoto} [Bull. Math. Statist. 15, No.3/4, 79-91 (1973; Zbl 0304.90117)]. The state and action sets are countable or finite. The authors establish the optimality principle and optimality equation when the model satisfies some additional conditions. They prove the existence of Markov optimal and \(\epsilon\)-optimal policies for nonhomogeneous models. For homogeneous models the existence of stationary optimal and \(\epsilon\)-optimal policies is proved. The authors also consider a policy iteration algorithm for homogeneous models.
0 references
discrete time Markov decision model
0 references
infinite horizon
0 references
recursive reward functions
0 references
optimality principle
0 references
optimality equation
0 references
\(\epsilon \) - optimal policies
0 references