Reliability of internal prediction/estimation and its application. I: Adaptive action selection reflecting reliability of value function (Q1886590): Difference between revisions
From MaRDI portal
Created a new Item |
Added link to MaRDI item. |
||
links / mardi / name | links / mardi / name | ||
Revision as of 13:07, 1 February 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Reliability of internal prediction/estimation and its application. I: Adaptive action selection reflecting reliability of value function |
scientific article |
Statements
Reliability of internal prediction/estimation and its application. I: Adaptive action selection reflecting reliability of value function (English)
0 references
18 November 2004
0 references
Internal prediction
0 references
Reliability
0 references
Model-free reinforcement learning
0 references
TD learning
0 references
Discount rate
0 references
Exploration-exploitation balance
0 references
Temperature parameter
0 references
Meta-learning
0 references