TD-regularized actor-critic methods (Q2320580)
From MaRDI portal
!
WARNING
This is the item page for this Wikibase entity, intended for internal use and editing purposes.
Please use the normal view instead:
scientific article; zbMATH DE number 7097478
| Language | Label | Description | Also known as |
|---|---|---|---|
| default for all languages | No label defined |
||
| English | TD-regularized actor-critic methods |
scientific article; zbMATH DE number 7097478 |
Statements
TD-regularized actor-critic methods (English)
0 references
23 August 2019
0 references
reinforcement learning
0 references
actor-critic
0 references
temporal difference
0 references
0.7173351049423218
0 references
0.7132067084312439
0 references
0.7124791145324707
0 references
0.6806134581565857
0 references
0.6800230741500854
0 references