Remarks on the existence of solutions to the average cost optimality equation in Markov decision processes (Q1814435): Difference between revisions

Considering Markov decision processes with Borel state and action spaces, the paper deals with the `` average cost optimality equation'' and presents necessary conditions for the existence of a bounded solution to this equation. Roughly spoken, the long-run expected average cost incurred by policy \(\mu\) is given by \[ J(x,\mu):=\limsup_{n\to\infty}{1\over n+1} E^ \mu_ x\left[ \sum_{t=0}^ n c(X_ t,U_ t)\right] \] for state \(X\), control \(U\), cost function \(c\) and initial state \(x\). The interest in finding conditions that guarantee solutions to the average cost optimality equation derives from a result showing that a bounded solution of that equation leads to, e.g., optimal stationary policies for the decision process. The objective of the paper is to exhibit some necessary conditions that complement known sufficient conditions. The authors stress the fact that from their results it can be appreciated more clearly how restrictive it is to require bounded solutions to the average cost optimality equation, which in turn motivates further studies dealing with unbounded solutions.

0 references

zbMATH Keywords

Markov decision processes

0 references

average cost optimality equation

0 references

0 references

0 references

full work available at URL

https://doi.org/10.1016/0167-6911(90)90067-5

0 references

cites work

Optimal control of Markov processes with incomplete state information

0 references

Q3795523

0 references

Stochastic optimal control. The discrete time case

0 references

Control of Markov Chains with Long-Run Average Cost Criterion: The Dynamic Programming Equations

0 references

Necessary and sufficient conditions for a bounded solution to the optimality equation in average reward Markov decision chains

0 references

Necessary conditions for the optimality equation in average-reward Markov decision processes

0 references

On the average cost optimality equation and the structure of optimal policies for partially observable Markov decision processes

0 references

Q4389518

0 references

Adaptive Markov control processes

0 references

Recurrence conditions for Markov decision processes with Borel state space: A survey

0 references

Average cost Markov decision processes: Optimality conditions

0 references

The Existence of a Minimum Pair of State and Policy for Markov Decision Processes under the Hypothesis of Doeblin

0 references

Q3487241

0 references

Optimal Infinite-Horizon Undiscounted Control of Finite Probabilistic Systems

0 references

Arbitrary State Markovian Decision Processes

0 references

Q3683893

0 references

Average Cost Optimal Stationary Policies in Infinite State Markov Decision Processes with Unbounded Costs

0 references

Identifiers

zbMATH Open document ID

0742.93079

0 references

DOI

10.1016/0167-6911(90)90067-5

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1814435

@@ Property / cites work @@
+Optimal control of Markov processes with incomplete state information
+Normal rank
@@ Property / cites work @@
+Q3795523
@@ Property / cites work: Q3795523 / rank @@
+Normal rank
@@ Property / cites work @@
+Stochastic optimal control. The discrete time case
+Normal rank
@@ Property / cites work @@
+Control of Markov Chains with Long-Run Average Cost Criterion: The Dynamic Programming Equations
+Normal rank
@@ Property / cites work @@
+Necessary and sufficient conditions for a bounded solution to the optimality equation in average reward Markov decision chains
+Normal rank
@@ Property / cites work @@
+Necessary conditions for the optimality equation in average-reward Markov decision processes
+Normal rank
@@ Property / cites work @@
+On the average cost optimality equation and the structure of optimal policies for partially observable Markov decision processes
+Normal rank
@@ Property / cites work @@
+Q4389518
@@ Property / cites work: Q4389518 / rank @@
+Normal rank
@@ Property / cites work @@
+Adaptive Markov control processes
@@ Property / cites work: Adaptive Markov control processes / rank @@
+Normal rank
@@ Property / cites work @@
+Recurrence conditions for Markov decision processes with Borel state space: A survey
+Normal rank
@@ Property / cites work @@
+Average cost Markov decision processes: Optimality conditions
+Normal rank
@@ Property / cites work @@
+The Existence of a Minimum Pair of State and Policy for Markov Decision Processes under the Hypothesis of Doeblin
+Normal rank
@@ Property / cites work @@
+Q3487241
@@ Property / cites work: Q3487241 / rank @@
+Normal rank
@@ Property / cites work @@
+Optimal Infinite-Horizon Undiscounted Control of Finite Probabilistic Systems
+Normal rank
@@ Property / cites work @@
+Arbitrary State Markovian Decision Processes
@@ Property / cites work: Arbitrary State Markovian Decision Processes / rank @@
+Normal rank
@@ Property / cites work @@
+Q3683893
@@ Property / cites work: Q3683893 / rank @@
+Normal rank
@@ Property / cites work @@
+Average Cost Optimal Stationary Policies in Infinite State Markov Decision Processes with Unbounded Costs
+Normal rank