A note on optimization formulations of Markov decision processes (Q2129661)

From MaRDI portal
scientific article
Language Label Description Also known as
English
A note on optimization formulations of Markov decision processes
scientific article

    Statements

    A note on optimization formulations of Markov decision processes (English)
    0 references
    0 references
    0 references
    22 April 2022
    0 references
    The paper summarizes the primal, primal-dual and dual problems for discounted standard Markov decision processes, discounted regularized Markov decision processes, undiscounted standard Markov decision processes and undiscounted regularized Markov decision processes. Moreover, the paper shows the equivalence between the dual problem and policy gradient as well as the equivalence between the primal problem and Bellman equation for the above four Markov decision processes. These optimization formulations are helpful for the theoretical study of Markov decision processes algorithms.
    0 references
    Markov decision processes
    0 references
    optimization
    0 references
    linear programming
    0 references
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references