Asymptotic analysis of temporal-difference learning algorithms with constant step-sizes (Q5898263)

From MaRDI portal
Revision as of 10:37, 25 June 2024 by ReferenceBot (talk | contribs) (‎Changed an Item)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
scientific article; zbMATH DE number 5075625
Language Label Description Also known as
English
Asymptotic analysis of temporal-difference learning algorithms with constant step-sizes
scientific article; zbMATH DE number 5075625

    Statements

    Asymptotic analysis of temporal-difference learning algorithms with constant step-sizes (English)
    0 references
    22 November 2006
    0 references
    temporal-difference learning
    0 references
    neuro-dynamic programming
    0 references
    reinforcement learning
    0 references
    stochastic approximation
    0 references
    Markov chains
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references