Combining multiple strategies for multiarmed bandit problems and asymptotic optimality (Q892592)

!

WARNING

This is the item page for this Wikibase entity, intended for internal use and editing purposes.

Please use the normal view instead:

Combining multiple strategies for multiarmed bandit problems and asymptotic optimality

scientific article; zbMATH DE number 6511718

Language	Label	Description	Also known as
default for all languages	No label defined
English	Combining multiple strategies for multiarmed bandit problems and asymptotic optimality	scientific article; zbMATH DE number 6511718

Statements

instance of

scholarly article

0 references

title

Combining multiple strategies for multiarmed bandit problems and asymptotic optimality (English)

0 references

0 references

0 references

Journal of Control Science and Engineering

0 references

publication date

19 November 2015

0 references

review text

Summary: This brief paper provides a simple algorithm that selects a strategy at each time in a given set of multiple strategies for stochastic multiarmed bandit problems, thereby playing the arm by the chosen strategy at each time. The algorithm follows the idea of the probabilistic \(\epsilon_t\)-switching in the \(\epsilon_t\)-greedy strategy and is asymptotically optimal in the sense that the selected strategy converges to the best in the set under some conditions on the strategies in the set and the sequence of \(\epsilon_t\).

0 references

zbMATH Keywords

multiarmed bandit problems

0 references

asymptotic optimality

0 references

multiple strategies

0 references

MaRDI profile type

MaRDI publication profile

0 references

full work available at URL