Is Temporal Difference Learning Optimal? An Instance-Dependent Analysis (Q5162625)

From MaRDI portal
Revision as of 08:47, 30 July 2024 by Openalex240730090724 (talk | contribs) (Set OpenAlex properties.)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
scientific article; zbMATH DE number 7419556
Language Label Description Also known as
English
Is Temporal Difference Learning Optimal? An Instance-Dependent Analysis
scientific article; zbMATH DE number 7419556

    Statements

    Is Temporal Difference Learning Optimal? An Instance-Dependent Analysis (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    3 November 2021
    0 references
    temporal difference learning
    0 references
    Polyak-Ruppert averaging
    0 references
    variance reduction
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references