Machine learning and nonparametric bandit theory
From MaRDI portal
Publication:4850249
DOI10.1109/9.400491zbMATH Open0883.62090OpenAlexW2143552356MaRDI QIDQ4850249FDOQ4850249
Authors: Tze Leung Lai, S. Yakowitz
Publication date: 9 October 1995
Published in: IEEE Transactions on Automatic Control (Search for Journal in Brave)
Full work available at URL: https://semanticscholar.org/paper/fe611ab14edf4c19fa90e03da42589b0e9c5d5ec
Recommendations
- Nonparametric bandit methods
- Bayesian nonparametric bandits
- Mechanisms with learning for stochastic multi-armed bandit problems
- Optimal learning and experimentation in bandit problems.
- Machine learning. Theory and applications
- Machine Learning
- Machine learning. A probabilistic perspective
- A learning algorithm for the finite-time two-armed bandit problem
- scientific article; zbMATH DE number 1332320
- The Nonstochastic Multiarmed Bandit Problem
Nonparametric inference (62G99) Optimal stopping in statistics (62L15) Stochastic learning and adaptive control (93E35)
Cited In (15)
- Sequential design with applications to the trim-loss problem
- Randomized allocation with nonparametric estimation for a multi-armed bandit problem with covariates
- Methods and theory for off-line machine learning
- Arbitrary side observations in bandit problems
- Nonparametric bandit methods
- Bandit and covariate processes, with finite or non-denumerable set of arms
- Gaussian process modelling of dependencies in multi-armed bandit problems
- MULTI-ARMED BANDITS WITH COVARIATES:THEORY AND APPLICATIONS
- A linear response bandit problem
- Convergence of least squares learning in self-referential discontinuous stochastic models.
- Active learning in heteroscedastic noise
- The time until the final zero crossing of random sums with application to nonparametric bandit theory
- Tuning Bandit Algorithms in Stochastic Environments
- Learning unknown service rates in queues: a multiarmed bandit approach
- Exploration-exploitation tradeoff using variance estimates in multi-armed bandits
This page was built for publication: Machine learning and nonparametric bandit theory
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4850249)