A bandit process with delayed responses
From MaRDI portal
Publication:1573129
DOI10.1016/S0167-7152(00)00011-0zbMATH Open0959.62067OpenAlexW1987332701WikidataQ127489012 ScholiaQ127489012MaRDI QIDQ1573129FDOQ1573129
Authors: Xikui Wang
Publication date: 2 May 2001
Published in: Statistics \& Probability Letters (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/s0167-7152(00)00011-0
Recommendations
- The two-armed bandit with delayed responses
- scientific article; zbMATH DE number 4056829
- One-armed bandit models with continuous and delayed responses
- Bernoulli multi-armed bandit problem under delayed feedback
- Indexability of bandit problems with response delays
- Delay and cooperation in nonstochastic bandits
- Bandit problems with Lévy processes
- Randomized allocation with nonparametric estimation for contextual multi-armed bandits with delayed rewards
Sequential statistical design (62L05) Applications of mathematical programming (90C90) Dynamic programming (90C39) Optimal stopping in statistics (62L15)
Cites Work
Cited In (8)
- New adaptive designs for delayed response models
- The two-armed bandit with delayed responses
- One-armed bandit models with continuous and delayed responses
- Asymptotic properties of bandit processes with geometric responses.
- Title not available (Why is that?)
- Clinical trials with exponential survival times
- Generalisations of a Bayesian decision-theoretic randomisation procedure and the impact of delayed responses
- One-armed bandit process with a covariate
This page was built for publication: A bandit process with delayed responses
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1573129)