Analysis and improvement of policy gradient estimation (Q448295)

From MaRDI portal
Revision as of 05:16, 30 January 2024 by Import240129110155 (talk | contribs) (Added link to MaRDI item.)
scientific article
Language Label Description Also known as
English
Analysis and improvement of policy gradient estimation
scientific article

    Statements

    Analysis and improvement of policy gradient estimation (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    30 August 2012
    0 references
    0 references
    reinforcement learning
    0 references
    policy gradients
    0 references
    policy gradients with parameter-based exploration
    0 references
    variance reduction
    0 references