Randomization in the two-armed bandit problem (Q750006)
From MaRDI portal
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Randomization in the two-armed bandit problem |
scientific article |
Statements
Randomization in the two-armed bandit problem (English)
0 references
1990
0 references
This paper gives an elementary proof of the existence of optimal solutions to a general form of the continuous-time two-armed bandit. The formulation is the same as that used by \textit{G. Mazziotto} and \textit{A. Millet} [Stochastics 22, 251-288 (1987; Zbl 0643.60040)]; however the topological embedding of the set of randomized optimal increasing paths is new and enables a resolution of the problem that requires only straightforward topological arguments. Also, one of the conditions in Mazziotto and Millet's paper can be removed, yielding a stronger result.
0 references
randomization
0 references
existence of optimal solutions
0 references
continuous-time two-armed bandit
0 references