Natural actor-critic algorithms (Q1049136)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Natural actor-critic algorithms |
scientific article; zbMATH DE number 5655161
| Language | Label | Description | Also known as |
|---|---|---|---|
| default for all languages | No label defined |
||
| English | Natural actor-critic algorithms |
scientific article; zbMATH DE number 5655161 |
Statements
Natural actor-critic algorithms (English)
0 references
8 January 2010
0 references
actor-critic reinforcement learning algorithms
0 references
policy-gradient methods
0 references
approximate dynamic programming
0 references
function approximation
0 references
two-timescale stochastic approximation
0 references
temporal difference learning
0 references
natural gradient
0 references
0 references
0 references
0 references
0 references
0.863670289516449
0 references
0.8556258678436279
0 references
0.8158515095710754
0 references
0.7965922355651855
0 references
0.7919870018959045
0 references