Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning (Q5189863)

From MaRDI portal
Revision as of 16:36, 8 February 2024 by Import240129110113 (talk | contribs) (Added link to MaRDI item.)
scientific article; zbMATH DE number 5680295
Language Label Description Also known as
English
Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning
scientific article; zbMATH DE number 5680295

    Statements

    Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    11 March 2010
    0 references

    Identifiers