Combining multiple strategies for multiarmed bandit problems and asymptotic optimality (Q892592)

From MaRDI portal





scientific article; zbMATH DE number 6511718
Language Label Description Also known as
default for all languages
No label defined
    English
    Combining multiple strategies for multiarmed bandit problems and asymptotic optimality
    scientific article; zbMATH DE number 6511718

      Statements

      Combining multiple strategies for multiarmed bandit problems and asymptotic optimality (English)
      0 references
      0 references
      0 references
      19 November 2015
      0 references
      Summary: This brief paper provides a simple algorithm that selects a strategy at each time in a given set of multiple strategies for stochastic multiarmed bandit problems, thereby playing the arm by the chosen strategy at each time. The algorithm follows the idea of the probabilistic \(\epsilon_t\)-switching in the \(\epsilon_t\)-greedy strategy and is asymptotically optimal in the sense that the selected strategy converges to the best in the set under some conditions on the strategies in the set and the sequence of \(\epsilon_t\).
      0 references
      multiarmed bandit problems
      0 references
      asymptotic optimality
      0 references
      multiple strategies
      0 references

      Identifiers

      0 references
      0 references
      0 references
      0 references
      0 references
      0 references
      0 references