A semimartingale characterization of average optimal stationary policies for Markov decision processes (Q871336)

!

WARNING

This is the item page for this Wikibase entity, intended for internal use and editing purposes.

Please use the normal view instead:

A semimartingale characterization of average optimal stationary policies for Markov decision processes

scientific article; zbMATH DE number 5134583

Language	Label	Description	Also known as
default for all languages	No label defined
English	A semimartingale characterization of average optimal stationary policies for Markov decision processes	scientific article; zbMATH DE number 5134583

Statements

instance of

scholarly article

0 references

title

A semimartingale characterization of average optimal stationary policies for Markov decision processes (English)

0 references

0 references

0 references

Journal of Applied Mathematics and Stochastic Analysis

0 references

publication date

19 March 2007

0 references

full work available at URL

https://eudml.org/doc/54838

0 references

review text

Summary: This paper deals with discrete-time Markov decision processes with Borel state and action spaces. The criterion to be minimized is the average expected costs, and the costs may have neither upper nor lower bounds. In our former paper [J. Appl. Probab. 43, No. 2, 318--334 (2006; Zbl 1121.90122)], weaker conditions are proposed to ensure the existence of average optimal stationary policies. In this paper, we further study some properties of optimal policies. Under these weaker conditions, we not only obtain two necessary and sufficient conditions for optimal policies, but also give a ``semimartingale characterization'' of an average optimal stationary policy.

0 references

MaRDI profile type

MaRDI publication profile

0 references

cites work

Discrete-Time Controlled Markov Processes with Average Cost Criterion: A Survey

0 references

Finite state Markovian decision processes

0 references

Q3862198

0 references

Average cost Markov control processes with weighted norms: existence of canonical policies