Policy-based optimization: single-step policy gradient method seen as an evolution strategy (Q6365194)

From MaRDI portal
scientific article; zbMATH DE number 900478023
Language Label Description Also known as
English
Policy-based optimization: single-step policy gradient method seen as an evolution strategy
scientific article; zbMATH DE number 900478023

    Statements

    13 April 2021
    0 references
    0 references
    math.OC
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references