Policy iteration for average cost Markov control processes on Borel spaces (Q1357514)
From MaRDI portal
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Policy iteration for average cost Markov control processes on Borel spaces |
scientific article |
Statements
Policy iteration for average cost Markov control processes on Borel spaces (English)
0 references
13 October 1997
0 references
Howard's algorithm for the average cost problem of discrete time Markov control processes (MPC) with Borel state and action spaces, and possibly unbounded cost is studied. Two classes of MPC's on Borel spaces are presented for which the policy iteration algorithm (PIA) converges. (i) restricted growth unbounded cost, compact control constraint sets and strong ergodicity, (ii) strictly unbounded cost, non-compact control constraint sets. Conditions are given under which the PIA converges to a solution of the average cost optimality equation, thus giving the optimal cost and an optimal stationary control policy. An example illustrates the result.
0 references
Howard's algorithm
0 references
average cost problem
0 references
discrete time Markov control processes
0 references
policy iteration algorithm
0 references