Breaking the sample complexity barrier to regret-optimal model-free reinforcement learning (Q6039766): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
Set OpenAlex properties.
 
(One intermediate revision by one other user not shown)
Property / OpenAlex ID
 
Property / OpenAlex ID: W3206149081 / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Latest revision as of 08:45, 30 July 2024

scientific article; zbMATH DE number 7688060
Language Label Description Also known as
English
Breaking the sample complexity barrier to regret-optimal model-free reinforcement learning
scientific article; zbMATH DE number 7688060

    Statements

    Breaking the sample complexity barrier to regret-optimal model-free reinforcement learning (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    23 May 2023
    0 references
    model-free RL
    0 references
    memory efficiency
    0 references
    variance reduction
    0 references
    Q-learning
    0 references
    upper confidence bounds
    0 references
    lower confidence bounds
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references