Gaussian process bandits with adaptive discretization
From MaRDI portal
Abstract: In this paper, the problem of maximizing a black-box function is studied in the Bayesian framework with a Gaussian Process (GP) prior. In particular, a new algorithm for this problem is proposed, and high probability bounds on its simple and cumulative regret are established. The query point selection rule in most existing methods involves an exhaustive search over an increasingly fine sequence of uniform discretizations of . The proposed algorithm, in contrast, adaptively refines which leads to a lower computational complexity, particularly when is a subset of a high dimensional Euclidean space. In addition to the computational gains, sufficient conditions are identified under which the regret bounds of the new algorithm improve upon the known results. Finally an extension of the algorithm to the case of contextual bandits is proposed, and high probability bounds on the contextual regret are presented.
Recommendations
Cites work
- scientific article; zbMATH DE number 6433488 (Why is no real title available?)
- scientific article; zbMATH DE number 5485582 (Why is no real title available?)
- Chaining, interpolation, and convexity
- Contextual bandits with similarity information
- Convergence rates of efficient global optimization algorithms
- First Passage Time for a Particular Gaussian Process
- From bandits to Monte-Carlo tree search: the optimistic principle applied to optimization and planning
- Gaussian processes for machine learning.
- Information-Theoretic Regret Bounds for Gaussian Process Optimization in the Bandit Setting
- Learning to optimize via posterior sampling
- Multi-fidelity Gaussian process bandit optimisation
- Prediction, Learning, and Games
- Pure exploration in finitely-armed and continuous-armed bandits
- The multi-armed bandit problem with covariates
- \(X\)-armed bandits
Cited in
(11)- Optimization on Manifolds via Graph Gaussian Processes
- Adaptive-treed bandits
- Multi-fidelity Gaussian process bandit optimisation
- A PAC algorithm in relative precision for bandit problem with costly sampling
- Bayesian optimization with partially specified queries
- No-regret Bayesian optimization with unknown hyperparameters
- scientific article; zbMATH DE number 6433488 (Why is no real title available?)
- High-dimensional Bayesian optimization with projections using quantile Gaussian processes
- Contextual ranking and selection with Gaussian processes and optimal computing budget allocation
- Stochastic zeroth order descent with structured directions
- Gaussian process modelling of dependencies in multi-armed bandit problems
This page was built for publication: Gaussian process bandits with adaptive discretization
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1711556)