Approximate receding horizon approach for Markov decision processes: average reward case (Q1414220): Difference between revisions

The authors consider an approximation scheme for solving Markov decision processes (MDPs) with countable state space, finite action space, and bounded rewards that uses an approximate solution of a fixed finite-horizon sub-MDP of a given infinite-horizon MDP to create a stationary policy, which they call ''approximate receding horizon control''. They analyze the performance of the approximate receding horizon control in some conditions, study two examples, also provide a simple proof on the policy improvement for countable state space, and discuss practical implementations of these schemes via simulation.

0 references

zbMATH Keywords

Markov decision process

0 references

receding horizon control

0 references

Infinite-horizon average reward

0 references

policy improvement

0 references

ergodicity

0 references

reviewed by

Mariano Ruiz Espejo

0 references

Identifiers

zbMATH Open document ID

1064.90051

0 references

DOI

10.1016/S0022-247X(03)00506-7

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1414220

Revision as of 22:07, 12 February 2024 RedirectionBot (talk \| contribs) Bots 2,880,369 edits ‎Removed claim: reviewed by (P1447): Item:Q315826 ← Older edit	Revision as of 22:07, 12 February 2024 RedirectionBot (talk \| contribs) Bots 2,880,369 edits ‎Changed an Item Newer edit →
	Property / reviewed by
		Mariano Ruiz Espejo
	Property / reviewed by: Mariano Ruiz Espejo / rank
		Normal rank