Boundary crossing probabilities for general exponential families

From MaRDI portal
Publication:722599

DOI10.3103/S1066530718010015zbMATH Open1395.62248arXiv1705.08814WikidataQ115488668 ScholiaQ115488668MaRDI QIDQ722599FDOQ722599


Authors: Odalric-Ambrym Maillard Edit this on Wikidata


Publication date: 27 July 2018

Published in: Mathematical Methods of Statistics (Search for Journal in Brave)

Abstract: We consider parametric exponential families of dimension K on the real line. We study a variant of extit{boundary crossing probabilities} coming from the multi-armed bandit literature, in the case when the real-valued distributions form an exponential family of dimension K. Formally, our result is a concentration inequality that bounds the probability that mathcalBpsi(hathetan,hetastar)geqf(t/n)/n, where hetastar is the parameter of an unknown target distribution, hathetan is the empirical parameter estimate built from n observations, psi is the log-partition function of the exponential family and mathcalBpsi is the corresponding Bregman divergence. From the perspective of stochastic multi-armed bandits, we pay special attention to the case when the boundary function f is logarithmic, as it is enables to analyze the regret of the state-of-the-art KLUCB and KLUCBp strategies, whose analysis was left open in such generality. Indeed, previous results only hold for the case when K=1, while we provide results for arbitrary finite dimension K, thus considerably extending the existing results. Perhaps surprisingly, we highlight that the proof techniques to achieve these strong results already existed three decades ago in the work of T.L. Lai, and were apparently forgotten in the bandit community. We provide a modern rewriting of these beautiful techniques that we believe are useful beyond the application to stochastic multi-armed bandits.


Full work available at URL: https://arxiv.org/abs/1705.08814




Recommendations




Cites Work


Cited In (3)





This page was built for publication: Boundary crossing probabilities for general exponential families

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q722599)