Adaptive-treed bandits

DOI10.3150/14-BEJ644MaRDI QIDQ888482zbMATH OpenFDO

Publication date 30 October 2015

Published in Bernoulli (Search for Journal in Brave)

Full work available at URL https://arxiv.org/abs/1302.2489, https://projecteuclid.org/euclid.bj/1438777594

bandits on taxonomies continuum-armed bandits noisy global optimisation tree-armed bandits zooming dimension

Asymptotic properties of nonparametric inference (62G20) Sequential statistical design (62L05) Nonconvex programming, global optimization (90C26)

Abstract: We describe a novel algorithm for noisy global optimisation and continuum-armed bandits, with good convergence properties over any continuous reward function having finitely many polynomial maxima. Over such functions, our algorithm achieves square-root regret in bandits, and inverse-square-root error in optimisation, without prior information. Our algorithm works by reducing these problems to tree-armed bandits, and we also provide new results in this setting. We show it is possible to adaptively combine multiple trees so as to minimise the regret, and also give near-matching lower bounds on the regret in terms of the zooming dimension.

Recommendations

Cites work

Cited in

(9)

This page was built for publication: Adaptive-treed bandits

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q888482)