Actor-critic algorithms based on symmetric perturbation sampling (Q2992408)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Actor-critic algorithms based on symmetric perturbation sampling |
scientific article; zbMATH DE number 6611258
| Language | Label | Description | Also known as |
|---|---|---|---|
| default for all languages | No label defined |
||
| English | Actor-critic algorithms based on symmetric perturbation sampling |
scientific article; zbMATH DE number 6611258 |
Statements
10 August 2016
0 references
actor-critic method
0 references
symmetric perturbation sampling
0 references
continuous space
0 references
reinforcement learning
0 references
基于对称扰动采样的Actor-critic 算法 (English)
0 references
0.7666417360305786
0 references
0.7296214699745178
0 references
0.7287346720695496
0 references
0.724462628364563
0 references
0.718273401260376
0 references