Discrete multiarmed bandits and multiparameter processes (Q1317211)

From MaRDI portal
scientific article
Language Label Description Also known as
English
Discrete multiarmed bandits and multiparameter processes
scientific article

    Statements

    Discrete multiarmed bandits and multiparameter processes (English)
    0 references
    0 references
    21 April 1994
    0 references
    The author reformulates the multiarmed bandit problem in discrete time as an optimal stochastic control problem for a multiparameter process. Within this framework, the dynamic allocation index, the so-called Gittins index, becomes a multiparameter process, and it is shown how it leads to optimal solutions. The main advantage of such an approach is that it provides a convenient and elegant representation of switching strategies by using the notion of optimal increasing paths or strategies over a partially ordered set.
    0 references
    multiarmed bandit problem
    0 references
    dynamic allocation index
    0 references
    Gittins index
    0 references
    switching strategies
    0 references

    Identifiers