Mean, variance and probabilistic criteria in finite Markov decision processes: A review (Q1821706): Difference between revisions

This paper is a survey of papers which make use of nonstandard Markov decision process criteria (i.e., those which do not seek simply to optimize expected returns per unit time or expected discounted return). It covers infinite-horizon nondiscounted formulations, infinite-horizon discounted formulations, and finite-horizon formulations. For problem formulations in terms solely of the probabilities of being in each state and taking each action, policy equivalence results are given which allow policies to be restricted to the class of Markov policies or to the randomizations of deterministic Markov policies. For problems which cannot be stated in such terms, in terms of the primitive state set I, formulations involving a redefinition of the states are examined.

0 references

Mathematics Subject Classification ID

90C40

0 references

zbMATH DE Number

3999706

0 references

zbMATH Keywords

survey

0 references

nonstandard Markov decision process criteria

0 references

infinite-horizon nondiscounted formulations

0 references

discounted formulations

0 references

finite-horizon

0 references

mean

0 references

variance

0 references

probabilistic criteria

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1821706

Revision as of 10:46, 1 February 2024 Import240129110113 (talk \| contribs) Bots 7,163,963 edits Added link to MaRDI item. ← Older edit		Revision as of 09:16, 10 February 2024 RedirectionBot (talk \| contribs) Bots 2,880,369 edits ‎Removed claim: author (P16): Item:Q181183 Newer edit →
Property / author
	~~Douglas J. White~~
Property / author: Douglas J. White / rank
	~~Normal rank~~