A minimax and asymptotically optimal algorithm for stochastic bandits (Q4645646)

From MaRDI portal
scientific article; zbMATH DE number 6999898
Language Label Description Also known as
English
A minimax and asymptotically optimal algorithm for stochastic bandits
scientific article; zbMATH DE number 6999898

    Statements

    0 references
    0 references
    10 January 2019
    0 references
    stochastic multi-armed bandits
    0 references
    regret analysis
    0 references
    upper confidence bound (UCB)
    0 references
    minimax optimality
    0 references
    asymptotic optimality
    0 references
    stat.ML
    0 references
    cs.LG
    0 references
    math.ST
    0 references
    stat.TH
    0 references

    Identifiers