Policy-based optimization: single-step policy gradient method seen as an evolution strategy (Q6365194)

From MaRDI portal
Revision as of 10:11, 10 July 2024 by Import240710060729 (talk | contribs) (Added link to MaRDI item.)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
scientific article; zbMATH DE number 900478023
Language Label Description Also known as
English
Policy-based optimization: single-step policy gradient method seen as an evolution strategy
scientific article; zbMATH DE number 900478023

    Statements

    13 April 2021
    0 references
    math.OC
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references

    Identifiers

    0 references