Nonparametric bandit methods
From MaRDI portal
Recommendations
- Bayesian nonparametric bandits
- Machine learning and nonparametric bandit theory
- The Nonstochastic Multiarmed Bandit Problem
- The multi-armed bandit problem: an efficient nonparametric solution
- Bandit algorithms
- A non-parametric solution to the multi-armed bandit problem with covariates
- Bandit convex optimization in non-stationary environments
- Nonstationary bandits with habituation and recovery dynamics
- Randomized allocation with nonparametric estimation for a multi-armed bandit problem with covariates
Cites work
- scientific article; zbMATH DE number 3168214 (Why is no real title available?)
- scientific article; zbMATH DE number 3856244 (Why is no real title available?)
- scientific article; zbMATH DE number 4078557 (Why is no real title available?)
- scientific article; zbMATH DE number 3310500 (Why is no real title available?)
- scientific article; zbMATH DE number 3418509 (Why is no real title available?)
- A statistical foundation for machine learning, with application to Go- Moku
- Adaptive treatment allocation and the multi-armed bandit problem
- Asymptotically efficient adaptive allocation rules
- Bayesian nonparametric bandits
- Genetic Algorithms and the Optimal Allocation of Trials
- Probability Inequalities for Sums of Independent Random Variables
- Random Search in the Presence of Noise, with Application to Machine Learning
- Randomised allocation of treatments in sequential trials
- Some One-Sided Theorems on the Tail Distribution of Sample Sums with Applications to the Last Time and Largest Excess of Boundary Crossings
- Some aspects of the sequential design of experiments
- The uniform convergence of nearest neighbor regression function estimators and their application in optimization
Cited in
(20)- The time until the final zero crossing of random sums with application to nonparametric bandit theory
- Negatively Correlated Bandits
- Machine learning and nonparametric bandit theory
- A comparative study of ad hoc techniques and evolutionary methods for multi-armed bandit problems
- Pure exploration in infinitely-armed bandit models with fixed-confidence
- Nonasymptotic sequential tests for overlapping hypotheses applied to near-optimal arm identification in bandit models
- A decision model and methodology for the AIDS epidemic
- Nonparametric Bayesian multiarmed bandits for single-cell experiment design
- Machine learning for optimal blackjack counting strategies
- An asymptotically optimal policy for finite support models in the multiarmed bandit problem
- Bandit and covariate processes, with finite or non-denumerable set of arms
- A non-parametric solution to the multi-armed bandit problem with covariates
- Sequential design with applications to the trim-loss problem
- Linearly parameterized bandits
- Policies without Memory for the Infinite-Armed Bernoulli Bandit under the Average-Reward Criterion
- Finite-time lower bounds for the two-armed bandit problem
- How Fast Is the Bandit?
- Learning vs earning trade-off with missing or censored observations: the two-armed Bayesian nonparametric beta-Stacy bandit problem
- MULTI-ARMED BANDITS WITH COVARIATES:THEORY AND APPLICATIONS
- The multi-armed bandit problem: an efficient nonparametric solution
This page was built for publication: Nonparametric bandit methods
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q806690)