Policy-based optimization: single-step policy gradient method seen as an evolution strategy (Q6365194)
From MaRDI portal
scientific article; zbMATH DE number 900478023
Language | Label | Description | Also known as |
---|---|---|---|
English | Policy-based optimization: single-step policy gradient method seen as an evolution strategy |
scientific article; zbMATH DE number 900478023 |
Statements
13 April 2021
0 references
math.OC
0 references