Error controlled actor-critic (Q6205028)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Error controlled actor-critic |
scientific article; zbMATH DE number 7830738
| Language | Label | Description | Also known as |
|---|---|---|---|
| default for all languages | No label defined |
||
| English | Error controlled actor-critic |
scientific article; zbMATH DE number 7830738 |
Statements
Error controlled actor-critic (English)
0 references
11 April 2024
0 references
reinforcement learning
0 references
actor-critic
0 references
approximation error
0 references
overestimation
0 references
KL-divergence
0 references
0 references
0 references
0 references
0.729452908039093
0 references
0.7288190722465515
0 references
0.727982223033905
0 references
0.727603018283844
0 references
0.7211597561836243
0 references