A convex analytic approach to Markov decision processes (Q1093563): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
Set OpenAlex properties.
 
(3 intermediate revisions by 3 users not shown)
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5560061 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Dynamic programming and stochastic control / rank
 
Normal rank
Property / cites work
 
Property / cites work: On Minimum Cost Per Unit Time Control of Markov Chains / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5508589 / rank
 
Normal rank
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1007/bf00353877 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2004640191 / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Latest revision as of 09:54, 30 July 2024

scientific article
Language Label Description Also known as
English
A convex analytic approach to Markov decision processes
scientific article

    Statements

    A convex analytic approach to Markov decision processes (English)
    0 references
    0 references
    1988
    0 references
    This paper develops a new framework for the study of Markov decision processes in which the control problem is viewed as an optimization problem on the set of canonically induced measures on the trajectory space of the joint state and control process. This set is shown to be compact convex. One then associates with each of the usual cost criteria (infinite horizon discounted cost, finite horizon, control up to an exit time) a naturally defined occupation measure such that the cost is an integral of some function with respect to this measure. These measures are shown to form a compact convex set whose extreme points are characterized. Classical results about existence of optimal strategies are recovered from this and several applications to multicriteria and constrained optimization problems are briefly indicated.
    0 references
    canonically induced measures
    0 references
    optimization in measure space
    0 references
    occupation measure
    0 references
    existence of optimal strategies
    0 references
    multicriteria
    0 references
    constrained optimization
    0 references

    Identifiers