Multi-armed bandit experiments in the online service economy
From MaRDI portal
Publication:6574679
Cites work
- scientific article; zbMATH DE number 4087408 (Why is no real title available?)
- scientific article; zbMATH DE number 3638998 (Why is no real title available?)
- scientific article; zbMATH DE number 2172949 (Why is no real title available?)
- scientific article; zbMATH DE number 6276176 (Why is no real title available?)
- Asymptotically efficient adaptive allocation rules
- Bandit problems with infinitely many arms
- Finite-time analysis of the multiarmed bandit problem
- Learning to optimize via posterior sampling
- On the likelihood that one unkrown probability exeeds another in view of the evidence of two samples.
- Some aspects of the sequential design of experiments
- Statistical methods for dynamic treatment regimes. Reinforcement learning, causal inference, and personalized medicine
- Thompson sampling: an asymptotically optimal finite-time analysis
Cited in
(5)
This page was built for publication: Multi-armed bandit experiments in the online service economy
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6574679)