Balanced Q-learning: combining the influence of optimistic and pessimistic targets (Q6067050): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
Set OpenAlex properties.
 
(One intermediate revision by one other user not shown)
Property / OpenAlex ID
 
Property / OpenAlex ID: W3211287969 / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Latest revision as of 10:43, 30 July 2024

scientific article; zbMATH DE number 7777848
Language Label Description Also known as
English
Balanced Q-learning: combining the influence of optimistic and pessimistic targets
scientific article; zbMATH DE number 7777848

    Statements

    Balanced Q-learning: combining the influence of optimistic and pessimistic targets (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    14 December 2023
    0 references
    0 references
    reinforcement learning
    0 references
    maximization bias
    0 references
    \(Q\)-learning target
    0 references
    optimistic updates
    0 references
    pessimistic updates
    0 references
    0 references