A version of the Euler equation in discounted Markov decision processes (Q1952742): Difference between revisions

The paper deals with optimal control problems in discrete time and with an infinite horizon. The problems are considered by means of the Markov decision processes theory. The optimal control problems are proposed to be solved with the dynamic programming technique. The optimal solution is characterized by a functional equation known as the dynamic programming equation. In simple cases, the value iteration procedure is used to approximate the optimal value function. However, this method does not work in more complicated functional forms. The essential tool used in the paper is the Euler equation, this equation is established and solved (in some cases empirically). The authors present an iterative method of finding the solution of the Euler equation in terms of the value iteration function. Under certain conditions, the validity of the Euler equation can be guaranteed. Using the maximizers' convergence of the optimal policy, the optimal control problem is solved. A linear quadratic problem illustrates the theory.

0 references

reviewed by

Q749165

0 references

zbMATH Keywords

optimal control

0 references

Markov decision process

0 references

dynamic programming

0 references

Euler equation

0 references

optimal value function

0 references

value iteration function

0 references

Identifiers

zbMATH Open document ID

1272.49045

0 references

DOI

10.1155/2012/103698

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

0 references

0 references

0 references

Revision as of 17:55, 29 July 2023 Importer (talk \| contribs) Bots 7,038,868 edits ‎Created a new Item	Revision as of 12:22, 28 December 2023 Daniel (talk \| contribs) Bureaucrats, Interface administrators, private, Suppressors, Administrators 586,128 edits ‎Created claim: Wikidata QID (P12): Q58905696, #quickstatements; #temporary_batch_1703762337552 Tag: QuickStatements [1.0.4] Newer edit →
	Property / Wikidata QID
		Q58905696
	Property / Wikidata QID: Q58905696 / rank
		Normal rank

A version of the Euler equation in discounted Markov decision processes (Q1952742): Difference between revisions

Revision as of 12:22, 28 December 2023

Statements

Identifiers

Sitelinks

Mathematics(0 entries)