A note on optimization formulations of Markov decision processes (Q2129661)

From MaRDI portal





scientific article
Language Label Description Also known as
default for all languages
No label defined
    English
    A note on optimization formulations of Markov decision processes
    scientific article

      Statements

      A note on optimization formulations of Markov decision processes (English)
      0 references
      0 references
      0 references
      22 April 2022
      0 references
      The paper summarizes the primal, primal-dual and dual problems for discounted standard Markov decision processes, discounted regularized Markov decision processes, undiscounted standard Markov decision processes and undiscounted regularized Markov decision processes. Moreover, the paper shows the equivalence between the dual problem and policy gradient as well as the equivalence between the primal problem and Bellman equation for the above four Markov decision processes. These optimization formulations are helpful for the theoretical study of Markov decision processes algorithms.
      0 references
      Markov decision processes
      0 references
      optimization
      0 references
      linear programming
      0 references
      0 references

      Identifiers

      0 references
      0 references
      0 references
      0 references
      0 references
      0 references