Reliability of internal prediction/estimation and its application. I: Adaptive action selection reflecting reliability of value function (Q1886590)

From MaRDI portal
Revision as of 15:24, 7 June 2024 by ReferenceBot (talk | contribs) (‎Changed an Item)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
scientific article
Language Label Description Also known as
English
Reliability of internal prediction/estimation and its application. I: Adaptive action selection reflecting reliability of value function
scientific article

    Statements

    Reliability of internal prediction/estimation and its application. I: Adaptive action selection reflecting reliability of value function (English)
    0 references
    0 references
    0 references
    18 November 2004
    0 references
    0 references
    Internal prediction
    0 references
    Reliability
    0 references
    Model-free reinforcement learning
    0 references
    TD learning
    0 references
    Discount rate
    0 references
    Exploration-exploitation balance
    0 references
    Temperature parameter
    0 references
    Meta-learning
    0 references
    0 references
    0 references