Least squares policy iteration with instrumental variables vs. direct policy search: comparison against optimal benchmarks using energy storage (Q5882386)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Least squares policy iteration with instrumental variables vs. direct policy search: comparison against optimal benchmarks using energy storage |
scientific article; zbMATH DE number 7663649
| Language | Label | Description | Also known as |
|---|---|---|---|
| default for all languages | No label defined |
||
| English | Least squares policy iteration with instrumental variables vs. direct policy search: comparison against optimal benchmarks using energy storage |
scientific article; zbMATH DE number 7663649 |
Statements
Least squares policy iteration with instrumental variables vs. direct policy search: comparison against optimal benchmarks using energy storage (English)
0 references
15 March 2023
0 references
dynamic programming
0 references
approximate dynamic programming
0 references
approximate policy iteration
0 references
Bellman error minimization
0 references
direct policy search
0 references
energy storage
0 references
0 references
0 references
0 references
0 references
0 references
0 references
0 references
0.7290368676185608
0 references
0.7289639711380005
0 references
0.7266700863838196
0 references
0.7251091003417969
0 references