UCB revisited: improved regret bounds for the stochastic multi-armed bandit problem (Q653803)

From MaRDI portal

Revision as of 23:56, 9 December 2024 by Import241208061232 (talk | contribs) (Normalize DOI.)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Jump to:navigation, search

scientific article

Language	Label	Description	Also known as
English	UCB revisited: improved regret bounds for the stochastic multi-armed bandit problem	scientific article

Statements

scholarly article

0 references

UCB revisited: improved regret bounds for the stochastic multi-armed bandit problem (English)

0 references

0 references

0 references

Periodica Mathematica Hungarica

0 references

publication date

19 December 2011

0 references

A modification of the UCB algorithm developed for the stochastic multi-armed bandit problem by \textit{P. Auer, N. Cesa-Bianchi} and \textit{P. Fischer} [Mach. Learn. 47, No. 2--3, 235--256 (2002; Zbl 1012.68093)] is presented, which leads to an improvement of the regret bounds of the original UCB algorithm. Both cases of known as well as unknown horizon T are considered.

0 references

zbMATH Keywords

stochastic multi-armed bandit problem

0 references

UCB algorithm

0 references

regret bounds

0 references

MaRDI profile type

MaRDI publication profile

0 references

full work available at URL

https://doi.org/10.1007/s10998-010-3055-6

0 references

Sample mean based index policies by <i>O</i>(log <i>n</i>) regret for the multi-armed bandit problem

0 references

Exploration-exploitation tradeoff using variance estimates in multi-armed bandits

0 references

Finite-time analysis of the multiarmed bandit problem

0 references

The Nonstochastic Multiarmed Bandit Problem

0 references

0 references

Probability Inequalities for Sums of Bounded Random Variables

0 references

Asymptotically efficient adaptive allocation rules

0 references

0 references

Identifiers

zbMATH Open document ID

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

zbMATH DE Number

0 references

0 references

10.1007/S10998-010-3055-6

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:653803

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Item:Q653803&oldid=38434071"