A minimax and asymptotically optimal algorithm for stochastic bandits (Q4645646)

scientific article; zbMATH DE number 6999898

Language	Label	Description	Also known as
English	A minimax and asymptotically optimal algorithm for stochastic bandits	scientific article; zbMATH DE number 6999898

Statements

instance of

0 references

0 references

0 references

10 January 2019

0 references

full work available at URL

https://arxiv.org/abs/1702.07211

0 references

http://proceedings.mlr.press/v76/m%C3%A9nard17a.html

0 references

zbMATH Keywords

stochastic multi-armed bandits

0 references

regret analysis

0 references

upper confidence bound (UCB)

0 references

minimax optimality

0 references

asymptotic optimality

0 references

MaRDI profile type

Publication

0 references

arXiv classification

stat.ML

0 references

cs.LG

0 references

math.ST

0 references

stat.TH

0 references

Identifiers

zbMATH Open document ID

1407.62046

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:4645646