Remarks on the existence of solutions to the average cost optimality equation in Markov decision processes (Q1814435): Difference between revisions

From MaRDI portal
Created claim: Wikidata QID (P12): Q60167570, #quickstatements; #temporary_batch_1712190744730
ReferenceBot (talk | contribs)
Changed an Item
 
Property / cites work
 
Property / cites work: Optimal control of Markov processes with incomplete state information / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3795523 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic optimal control. The discrete time case / rank
 
Normal rank
Property / cites work
 
Property / cites work: Control of Markov Chains with Long-Run Average Cost Criterion: The Dynamic Programming Equations / rank
 
Normal rank
Property / cites work
 
Property / cites work: Necessary and sufficient conditions for a bounded solution to the optimality equation in average reward Markov decision chains / rank
 
Normal rank
Property / cites work
 
Property / cites work: Necessary conditions for the optimality equation in average-reward Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the average cost optimality equation and the structure of optimal policies for partially observable Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4389518 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Adaptive Markov control processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Recurrence conditions for Markov decision processes with Borel state space: A survey / rank
 
Normal rank
Property / cites work
 
Property / cites work: Average cost Markov decision processes: Optimality conditions / rank
 
Normal rank
Property / cites work
 
Property / cites work: The Existence of a Minimum Pair of State and Policy for Markov Decision Processes under the Hypothesis of Doeblin / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3487241 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimal Infinite-Horizon Undiscounted Control of Finite Probabilistic Systems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Arbitrary State Markovian Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3683893 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Average Cost Optimal Stationary Policies in Infinite State Markov Decision Processes with Unbounded Costs / rank
 
Normal rank

Latest revision as of 10:40, 15 May 2024

scientific article
Language Label Description Also known as
English
Remarks on the existence of solutions to the average cost optimality equation in Markov decision processes
scientific article

    Statements

    Remarks on the existence of solutions to the average cost optimality equation in Markov decision processes (English)
    0 references
    25 June 1992
    0 references
    Considering Markov decision processes with Borel state and action spaces, the paper deals with the `` average cost optimality equation'' and presents necessary conditions for the existence of a bounded solution to this equation. Roughly spoken, the long-run expected average cost incurred by policy \(\mu\) is given by \[ J(x,\mu):=\limsup_{n\to\infty}{1\over n+1} E^ \mu_ x\left[ \sum_{t=0}^ n c(X_ t,U_ t)\right] \] for state \(X\), control \(U\), cost function \(c\) and initial state \(x\). The interest in finding conditions that guarantee solutions to the average cost optimality equation derives from a result showing that a bounded solution of that equation leads to, e.g., optimal stationary policies for the decision process. The objective of the paper is to exhibit some necessary conditions that complement known sufficient conditions. The authors stress the fact that from their results it can be appreciated more clearly how restrictive it is to require bounded solutions to the average cost optimality equation, which in turn motivates further studies dealing with unbounded solutions.
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    Markov decision processes
    0 references
    average cost optimality equation
    0 references
    0 references
    0 references
    0 references