The optimal unbiased value estimator and its relation to LSTD, TD and MC (Q415609)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: The optimal unbiased value estimator and its relation to LSTD, TD and MC |
scientific article
| Language | Label | Description | Also known as |
|---|---|---|---|
| default for all languages | No label defined |
||
| English | The optimal unbiased value estimator and its relation to LSTD, TD and MC |
scientific article |
Statements
The optimal unbiased value estimator and its relation to LSTD, TD and MC (English)
0 references
8 May 2012
0 references
optimal unbiased value estimator
0 references
maximum likelihood value estimator
0 references
sufficient statistics
0 references
Lehmann-Scheffe theorem
0 references
Monte Carlo estimation (MC)
0 references
temporal difference learning (TD)
0 references
least-squares temporal difference learning (LSTD)
0 references
0.7316581606864929
0 references
0.7253820896148682
0 references
0.7230085730552673
0 references
0.7214592099189758
0 references
0.7192857265472412
0 references