A semimartingale characterization of average optimal stationary policies for Markov decision processes (Q871336)

From MaRDI portal
scientific article
Language Label Description Also known as
English
A semimartingale characterization of average optimal stationary policies for Markov decision processes
scientific article

    Statements

    A semimartingale characterization of average optimal stationary policies for Markov decision processes (English)
    0 references
    0 references
    0 references
    19 March 2007
    0 references
    Summary: This paper deals with discrete-time Markov decision processes with Borel state and action spaces. The criterion to be minimized is the average expected costs, and the costs may have neither upper nor lower bounds. In our former paper [J. Appl. Probab. 43, No. 2, 318--334 (2006; Zbl 1121.90122)], weaker conditions are proposed to ensure the existence of average optimal stationary policies. In this paper, we further study some properties of optimal policies. Under these weaker conditions, we not only obtain two necessary and sufficient conditions for optimal policies, but also give a ``semimartingale characterization'' of an average optimal stationary policy.
    0 references

    Identifiers