Remarks on the existence of solutions to the average cost optimality equation in Markov decision processes (Q1814435): Difference between revisions

Considering Markov decision processes with Borel state and action spaces, the paper deals with the `` average cost optimality equation'' and presents necessary conditions for the existence of a bounded solution to this equation. Roughly spoken, the long-run expected average cost incurred by policy \(\mu\) is given by \[ J(x,\mu):=\limsup_{n\to\infty}{1\over n+1} E^ \mu_ x\left[ \sum_{t=0}^ n c(X_ t,U_ t)\right] \] for state \(X\), control \(U\), cost function \(c\) and initial state \(x\). The interest in finding conditions that guarantee solutions to the average cost optimality equation derives from a result showing that a bounded solution of that equation leads to, e.g., optimal stationary policies for the decision process. The objective of the paper is to exhibit some necessary conditions that complement known sufficient conditions. The authors stress the fact that from their results it can be appreciated more clearly how restrictive it is to require bounded solutions to the average cost optimality equation, which in turn motivates further studies dealing with unbounded solutions.

0 references

zbMATH Keywords

Markov decision processes

0 references

average cost optimality equation

0 references

reviewed by

E. Bertsch

0 references

MaRDI profile type

MaRDI publication profile

0 references

full work available at URL

https://doi.org/10.1016/0167-6911(90)90067-5

0 references

Identifiers

zbMATH Open document ID

0742.93079

0 references

DOI

10.1016/0167-6911(90)90067-5

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1814435

Revision as of 20:55, 19 March 2024 Openalex240319060354 (talk \| contribs) 1,841,457 edits Set OpenAlex properties. ← Older edit	Revision as of 02:02, 4 April 2024 Daniel (talk \| contribs) Bureaucrats, Interface administrators, private, Suppressors, Administrators 621,624 edits ‎Created claim: Wikidata QID (P12): Q60167570, #quickstatements; #temporary_batch_1712190744730 Tag: QuickStatements [1.0.4] Newer edit →
	Property / Wikidata QID
		Q60167570
	Property / Wikidata QID: Q60167570 / rank
		Normal rank