On the existence of fixed points for approximate value iteration and temporal-difference learning (Q1586803): Difference between revisions

From MaRDI portal
RedirectionBot (talk | contribs)
Removed claim: author (P16): Item:Q399882
ReferenceBot (talk | contribs)
Changed an Item
 
(2 intermediate revisions by 2 users not shown)
Property / author
 
Property / author: Benjamin van Roy / rank
 
Normal rank
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / cites work
 
Property / cites work: Functional Approximations and Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: An analysis of temporal-difference learning with function approximation / rank
 
Normal rank
Property / cites work
 
Property / cites work: The convergence of \(TD(\lambda)\) for general \(\lambda\) / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4886156 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3997575 / rank
 
Normal rank

Latest revision as of 08:59, 3 June 2024

scientific article
Language Label Description Also known as
English
On the existence of fixed points for approximate value iteration and temporal-difference learning
scientific article

    Statements

    On the existence of fixed points for approximate value iteration and temporal-difference learning (English)
    0 references
    0 references
    0 references
    19 February 2001
    0 references
    dynamic programming
    0 references
    neurodynamic programming
    0 references
    reinforcement learning
    0 references
    temporal-difference learning
    0 references
    value iteration
    0 references

    Identifiers