Randomization in the two-armed bandit problem (Q750006)

From MaRDI portal
scientific article
Language Label Description Also known as
English
Randomization in the two-armed bandit problem
scientific article

    Statements

    Randomization in the two-armed bandit problem (English)
    0 references
    0 references
    1990
    0 references
    This paper gives an elementary proof of the existence of optimal solutions to a general form of the continuous-time two-armed bandit. The formulation is the same as that used by \textit{G. Mazziotto} and \textit{A. Millet} [Stochastics 22, 251-288 (1987; Zbl 0643.60040)]; however the topological embedding of the set of randomized optimal increasing paths is new and enables a resolution of the problem that requires only straightforward topological arguments. Also, one of the conditions in Mazziotto and Millet's paper can be removed, yielding a stronger result.
    0 references
    randomization
    0 references
    existence of optimal solutions
    0 references
    continuous-time two-armed bandit
    0 references

    Identifiers