The time until the final zero crossing of random sums with application to nonparametric bandit theory (Q1335239)
From MaRDI portal
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | The time until the final zero crossing of random sums with application to nonparametric bandit theory |
scientific article |
Statements
The time until the final zero crossing of random sums with application to nonparametric bandit theory (English)
0 references
28 September 1994
0 references
Motivated by problems in machine learning and more fundamentally by non- Bayesian nonparametric problems in sequential design of experiments, the present work deals with the task of attaining probability bounds for the number of times suboptimal bandits are chosen in a nonterminating sequence of experiments. To the author's knowledge, previously only the growth of the expectation of incorrect choices has been examined. The derivation is founded, in part, on new contributions to the theory of zero crossings for sums of biased, independent, identically distributed random variables.
0 references
nonparametric bandit theory
0 references
random sums
0 references
sums of biased independent identically distributed random variables
0 references
non-Bayesian nonparametric problems
0 references
probability bounds
0 references
zero crossings
0 references
0 references