Policy evaluation with temporal differences: a survey and comparison (Q2934010)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Policy evaluation with temporal differences: a survey and comparison |
scientific article; zbMATH DE number 6378096
| Language | Label | Description | Also known as |
|---|---|---|---|
| default for all languages | No label defined |
||
| English | Policy evaluation with temporal differences: a survey and comparison |
scientific article; zbMATH DE number 6378096 |
Statements
8 December 2014
0 references
differences
0 references
policy evaluation
0 references
value function estimation
0 references
reinforcement learning
0 references
0.7850978970527649
0 references
0.7684051990509033
0 references
0.7684051990509033
0 references
0.7653366327285767
0 references
0.7638019919395447
0 references