The convergence of \(TD(\lambda)\) for general \(\lambda\) (Q1812934)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: The convergence of TD() for general |
scientific article; zbMATH DE number 946
| Language | Label | Description | Also known as |
|---|---|---|---|
| default for all languages | No label defined |
||
| English | The convergence of \(TD(\lambda)\) for general \(\lambda\) |
scientific article; zbMATH DE number 946 |
Statements
The convergence of \(TD(\lambda)\) for general \(\lambda\) (English)
0 references
11 August 1992
0 references
reinforcement learning
0 references
temporal differences
0 references
asynchronous dynamic programming
0 references
0 references
0.8403036594390869
0 references
0.840119481086731
0 references
0.8313186168670654
0 references
0.8265856504440308
0 references
0.8168659806251526
0 references