Discrete multiarmed bandits and multiparameter processes (Q1317211)
From MaRDI portal
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Discrete multiarmed bandits and multiparameter processes |
scientific article |
Statements
Discrete multiarmed bandits and multiparameter processes (English)
0 references
21 April 1994
0 references
The author reformulates the multiarmed bandit problem in discrete time as an optimal stochastic control problem for a multiparameter process. Within this framework, the dynamic allocation index, the so-called Gittins index, becomes a multiparameter process, and it is shown how it leads to optimal solutions. The main advantage of such an approach is that it provides a convenient and elegant representation of switching strategies by using the notion of optimal increasing paths or strategies over a partially ordered set.
0 references
multiarmed bandit problem
0 references
dynamic allocation index
0 references
Gittins index
0 references
switching strategies
0 references