A note on optimization formulations of Markov decision processes (Q2129661)

scientific article; zbMATH DE number 7512733

Language	Label	Description	Also known as
default for all languages	No label defined
English	A note on optimization formulations of Markov decision processes	scientific article; zbMATH DE number 7512733

Statements

instance of

scholarly article

0 references

title

A note on optimization formulations of Markov decision processes (English)

0 references

0 references

0 references

Communications in Mathematical Sciences

0 references

publication date

22 April 2022

0 references

full work available at URL

https://arxiv.org/abs/2012.09417

0 references

review text

The paper summarizes the primal, primal-dual and dual problems for discounted standard Markov decision processes, discounted regularized Markov decision processes, undiscounted standard Markov decision processes and undiscounted regularized Markov decision processes. Moreover, the paper shows the equivalence between the dual problem and policy gradient as well as the equivalence between the primal problem and Bellman equation for the above four Markov decision processes. These optimization formulations are helpful for the theoretical study of Markov decision processes algorithms.

0 references

zbMATH Keywords

Markov decision processes

0 references

optimization

0 references

linear programming

0 references

reviewed by

Yan-Hong Song

0 references

MaRDI profile type