Target Network and Truncation Overcome the Deadly Triad in \(\boldsymbol{Q}\)-Learning (Q6148353): Difference between revisions

From MaRDI portal
ReferenceBot (talk | contribs)
Changed an Item
Normalize DOI.
 
Property / DOI
 
Property / DOI: 10.1137/22m1499261 / rank
Normal rank
 
Property / DOI
 
Property / DOI: 10.1137/22M1499261 / rank
 
Normal rank

Latest revision as of 18:54, 30 December 2024

scientific article; zbMATH DE number 7786787
Language Label Description Also known as
English
Target Network and Truncation Overcome the Deadly Triad in \(\boldsymbol{Q}\)-Learning
scientific article; zbMATH DE number 7786787

    Statements

    Target Network and Truncation Overcome the Deadly Triad in \(\boldsymbol{Q}\)-Learning (English)
    0 references
    0 references
    0 references
    0 references
    11 January 2024
    0 references
    reinforcement learning
    0 references
    \(Q\)-learning
    0 references
    linear function approximation
    0 references
    finite-sample analysis
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references