A version of the Euler equation in discounted Markov decision processes (Q1952742): Difference between revisions

From MaRDI portal
Import240304020342 (talk | contribs)
Set profile property.
Set OpenAlex properties.
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1155/2012/103698 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2016648086 / rank
 
Normal rank

Revision as of 03:02, 20 March 2024

scientific article
Language Label Description Also known as
English
A version of the Euler equation in discounted Markov decision processes
scientific article

    Statements

    A version of the Euler equation in discounted Markov decision processes (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    3 June 2013
    0 references
    The paper deals with optimal control problems in discrete time and with an infinite horizon. The problems are considered by means of the Markov decision processes theory. The optimal control problems are proposed to be solved with the dynamic programming technique. The optimal solution is characterized by a functional equation known as the dynamic programming equation. In simple cases, the value iteration procedure is used to approximate the optimal value function. However, this method does not work in more complicated functional forms. The essential tool used in the paper is the Euler equation, this equation is established and solved (in some cases empirically). The authors present an iterative method of finding the solution of the Euler equation in terms of the value iteration function. Under certain conditions, the validity of the Euler equation can be guaranteed. Using the maximizers' convergence of the optimal policy, the optimal control problem is solved. A linear quadratic problem illustrates the theory.
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    optimal control
    0 references
    Markov decision process
    0 references
    dynamic programming
    0 references
    Euler equation
    0 references
    optimal value function
    0 references
    value iteration function
    0 references
    0 references
    0 references