A minimax and asymptotically optimal algorithm for stochastic bandits (Q4645646)

From MaRDI portal
Revision as of 19:26, 19 April 2024 by Importer (talk | contribs) (‎Changed an Item)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)





scientific article; zbMATH DE number 6999898
Language Label Description Also known as
English
A minimax and asymptotically optimal algorithm for stochastic bandits
scientific article; zbMATH DE number 6999898

    Statements

    0 references
    0 references
    10 January 2019
    0 references
    stochastic multi-armed bandits
    0 references
    regret analysis
    0 references
    upper confidence bound (UCB)
    0 references
    minimax optimality
    0 references
    asymptotic optimality
    0 references
    stat.ML
    0 references
    cs.LG
    0 references
    math.ST
    0 references
    stat.TH
    0 references

    Identifiers