Learning parametric policies and transition probability models of Markov decision processes from data (Q2220059)

From MaRDI portal
scientific article
Language Label Description Also known as
English
Learning parametric policies and transition probability models of Markov decision processes from data
scientific article

    Statements

    Learning parametric policies and transition probability models of Markov decision processes from data (English)
    0 references
    0 references
    0 references
    21 January 2021
    0 references
    policy learning
    0 references
    learning transition dynamics
    0 references
    Markov decision processes
    0 references
    regularization
    0 references
    maximum likelihood estimation
    0 references

    Identifiers