Necessary conditions for the optimality equation in average-reward Markov decision processes (Q1115358)
From MaRDI portal
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Necessary conditions for the optimality equation in average-reward Markov decision processes |
scientific article |
Statements
Necessary conditions for the optimality equation in average-reward Markov decision processes (English)
0 references
1989
0 references
The paper considers average reward Markov decision processes with denumerable state space, compact action space and bounded rewards. One condition for the optimality equation to have a bounded solution with constant gain rate is that the return times to a state x(f) are uniformly bounded over all stationary policies f (here condition A). This condition cannot be expected to be necessary since the special reward function is not involved. But it is shown to be necessary for the above property to hold uniformly over all bounded reward functions and all subproblems with restricted action sets (with some relation between the bounds for the rewards and for the solution). A similar question for finite state spaces was discussed by \textit{P. J. Schweitzer} [R.A.I.R.O. 19, 71-86 (1985; Zbl 0571.90095)].
0 references
existence of solutions
0 references
average reward
0 references
denumerable state space
0 references
compact action space
0 references
bounded rewards
0 references
optimality equation
0 references
0 references
0 references