Block Policy Mirror Descent (Q6093281)

From MaRDI portal
scientific article; zbMATH DE number 7734884
Language Label Description Also known as
English
Block Policy Mirror Descent
scientific article; zbMATH DE number 7734884

    Statements

    Block Policy Mirror Descent (English)
    0 references
    6 September 2023
    0 references
    Markov decision process
    0 references
    reinforcement learning
    0 references
    policy gradient
    0 references
    mirror descent
    0 references
    block coordinate decent
    0 references
    iteration and sample complexity
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references