Nonasymptotic sequential tests for overlapping hypotheses applied to near-optimal arm identification in bandit models

DOI10.1080/07474946.2021.1847965MaRDI QIDQ4987192zbMATH OpenOpenAlexFDO

Authors Emilie Kaufmann, Aurélien Garivier

Publication date 29 April 2021

Published in Sequential Analysis (Search for Journal in Brave)

Full work available at URL https://arxiv.org/abs/1905.03495

zbMATH Keywords

generalized likelihood ratio test multi-armed bandits sequential testing best arm identification

Mathematics Subject Classification ID

Parametric hypothesis testing (62F03) Sequential statistical analysis (62L10)

Abstract: In this paper, we study sequential testing problems with emph{overlapping} hypotheses. We first focus on the simple problem of assessing if the mean

m u

of a Gaussian distribution is smaller or larger than a fixed

e p s i l o n > 0

; if

m u i n (- e p s i l o n, e p s i l o n)

, both answers are considered to be correct. Then, we consider PAC-best arm identification in a bandit model: given

K

probability distributions on

m a t h b b R

with means

m u_{1}, d o t s, m u_{K}

, we derive the asymptotic complexity of identifying, with risk at most

d e l t a

, an index

I i n 1, d o t s, K

such that

m u_{I} g e q m a x_{i} m u_{i} - e p s i l o n

. We provide non-asymptotic bounds on the error of a parallel General Likelihood Ratio Test, which can also be used for more general testing problems. We further propose lower bound on the number of observation needed to identify a correct hypothesis. Those lower bounds rely on information-theoretic arguments, and specifically on two versions of a change of measure lemma (a high-level form, and a low-level form) whose relative merits are discussed.

Recommendations

Cites work

Cited in

(3)

This page was built for publication: Nonasymptotic sequential tests for overlapping hypotheses applied to near-optimal arm identification in bandit models

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4987192)