Policy iteration for average cost Markov control processes on Borel spaces (Q1357514)

From MaRDI portal
scientific article
Language Label Description Also known as
English
Policy iteration for average cost Markov control processes on Borel spaces
scientific article

    Statements

    Policy iteration for average cost Markov control processes on Borel spaces (English)
    0 references
    13 October 1997
    0 references
    Howard's algorithm for the average cost problem of discrete time Markov control processes (MPC) with Borel state and action spaces, and possibly unbounded cost is studied. Two classes of MPC's on Borel spaces are presented for which the policy iteration algorithm (PIA) converges. (i) restricted growth unbounded cost, compact control constraint sets and strong ergodicity, (ii) strictly unbounded cost, non-compact control constraint sets. Conditions are given under which the PIA converges to a solution of the average cost optimality equation, thus giving the optimal cost and an optimal stationary control policy. An example illustrates the result.
    0 references
    0 references
    Howard's algorithm
    0 references
    average cost problem
    0 references
    discrete time Markov control processes
    0 references
    policy iteration algorithm
    0 references
    0 references