Reliability of internal prediction/estimation and its application. I: Adaptive action selection reflecting reliability of value function (Q1886590): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
Added link to MaRDI item.
links / mardi / namelinks / mardi / name
 

Revision as of 13:07, 1 February 2024

scientific article
Language Label Description Also known as
English
Reliability of internal prediction/estimation and its application. I: Adaptive action selection reflecting reliability of value function
scientific article

    Statements

    Reliability of internal prediction/estimation and its application. I: Adaptive action selection reflecting reliability of value function (English)
    0 references
    0 references
    0 references
    18 November 2004
    0 references
    0 references
    Internal prediction
    0 references
    Reliability
    0 references
    Model-free reinforcement learning
    0 references
    TD learning
    0 references
    Discount rate
    0 references
    Exploration-exploitation balance
    0 references
    Temperature parameter
    0 references
    Meta-learning
    0 references