The convergence of \(TD(\lambda)\) for general \(\lambda\) (Q1812934): Difference between revisions

From MaRDI portal
Import240304020342 (talk | contribs)
Set profile property.
ReferenceBot (talk | contribs)
Changed an Item
 
Property / cites work
 
Property / cites work: A New Approach to Manipulator Control: The Cerebellar Model Articulation Controller (CMAC) / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3292915 / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Boolean complete neural model of adaptive behavior / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3996720 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5615053 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3809291 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5342712 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3724211 / rank
 
Normal rank
Property / cites work
 
Property / cites work: An adaptive optimal controller for discrete-time Markov environments / rank
 
Normal rank

Latest revision as of 15:53, 14 May 2024

scientific article
Language Label Description Also known as
English
The convergence of \(TD(\lambda)\) for general \(\lambda\)
scientific article

    Statements

    The convergence of \(TD(\lambda)\) for general \(\lambda\) (English)
    0 references
    0 references
    11 August 1992
    0 references
    reinforcement learning
    0 references
    temporal differences
    0 references
    asynchronous dynamic programming
    0 references

    Identifiers