Combining multiple strategies for multiarmed bandit problems and asymptotic optimality (Q892592): Difference between revisions
From MaRDI portal
Set profile property. |
Set OpenAlex properties. |
||
Property / full work available at URL | |||
Property / full work available at URL: https://doi.org/10.1155/2015/264953 / rank | |||
Normal rank | |||
Property / OpenAlex ID | |||
Property / OpenAlex ID: W2010356817 / rank | |||
Normal rank |
Revision as of 20:19, 19 March 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Combining multiple strategies for multiarmed bandit problems and asymptotic optimality |
scientific article |
Statements
Combining multiple strategies for multiarmed bandit problems and asymptotic optimality (English)
0 references
19 November 2015
0 references
Summary: This brief paper provides a simple algorithm that selects a strategy at each time in a given set of multiple strategies for stochastic multiarmed bandit problems, thereby playing the arm by the chosen strategy at each time. The algorithm follows the idea of the probabilistic \(\epsilon_t\)-switching in the \(\epsilon_t\)-greedy strategy and is asymptotically optimal in the sense that the selected strategy converges to the best in the set under some conditions on the strategies in the set and the sequence of \(\epsilon_t\).
0 references
multiarmed bandit problems
0 references
asymptotic optimality
0 references
multiple strategies
0 references