A convex analytic approach to Markov decision processes (Q1093563): Difference between revisions

This paper develops a new framework for the study of Markov decision processes in which the control problem is viewed as an optimization problem on the set of canonically induced measures on the trajectory space of the joint state and control process. This set is shown to be compact convex. One then associates with each of the usual cost criteria (infinite horizon discounted cost, finite horizon, control up to an exit time) a naturally defined occupation measure such that the cost is an integral of some function with respect to this measure. These measures are shown to form a compact convex set whose extreme points are characterized. Classical results about existence of optimal strategies are recovered from this and several applications to multicriteria and constrained optimization problems are briefly indicated.

0 references

zbMATH Keywords

canonically induced measures

0 references

optimization in measure space

0 references

occupation measure

0 references

existence of optimal strategies

0 references

multicriteria

0 references

constrained optimization

0 references

Identifiers

zbMATH Open document ID

0628.90090

0 references

DOI

10.1007/BF00353877

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1093563

Revision as of 19:13, 12 July 2023 Importer (talk \| contribs) Bots 7,080,615 edits ‎Created a new Item	Revision as of 01:21, 31 January 2024 Import240129110113 (talk \| contribs) Bots 7,163,963 edits Added link to MaRDI item. Newer edit →
links / mardi / name	links / mardi / name
		Publication:1093563