Reliability of internal prediction/estimation and its application. I: Adaptive action selection reflecting reliability of value function (Q1886590): Difference between revisions
From MaRDI portal
Created claim: Wikidata QID (P12): Q40489238, #quickstatements; #temporary_batch_1707216511891 |
Set profile property. |
||
Property / MaRDI profile type | |||
Property / MaRDI profile type: MaRDI publication profile / rank | |||
Normal rank |
Revision as of 05:07, 5 March 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Reliability of internal prediction/estimation and its application. I: Adaptive action selection reflecting reliability of value function |
scientific article |
Statements
Reliability of internal prediction/estimation and its application. I: Adaptive action selection reflecting reliability of value function (English)
0 references
18 November 2004
0 references
Internal prediction
0 references
Reliability
0 references
Model-free reinforcement learning
0 references
TD learning
0 references
Discount rate
0 references
Exploration-exploitation balance
0 references
Temperature parameter
0 references
Meta-learning
0 references