The time until the final zero crossing of random sums with application to nonparametric bandit theory (Q1335239)

From MaRDI portal
scientific article
Language Label Description Also known as
English
The time until the final zero crossing of random sums with application to nonparametric bandit theory
scientific article

    Statements

    The time until the final zero crossing of random sums with application to nonparametric bandit theory (English)
    0 references
    0 references
    28 September 1994
    0 references
    Motivated by problems in machine learning and more fundamentally by non- Bayesian nonparametric problems in sequential design of experiments, the present work deals with the task of attaining probability bounds for the number of times suboptimal bandits are chosen in a nonterminating sequence of experiments. To the author's knowledge, previously only the growth of the expectation of incorrect choices has been examined. The derivation is founded, in part, on new contributions to the theory of zero crossings for sums of biased, independent, identically distributed random variables.
    0 references
    nonparametric bandit theory
    0 references
    random sums
    0 references
    sums of biased independent identically distributed random variables
    0 references
    non-Bayesian nonparametric problems
    0 references
    probability bounds
    0 references
    zero crossings
    0 references
    0 references

    Identifiers