Adaptive treatment allocation and the multi-armed bandit problem (Q1102059): Difference between revisions

Latest revision as of 02:55, 20 March 2024

scientific article

Language	Label	Description	Also known as
English	Adaptive treatment allocation and the multi-armed bandit problem	scientific article

Statements

instance of

scholarly article

0 references

title

Adaptive treatment allocation and the multi-armed bandit problem (English)

0 references

published in

The Annals of Statistics

0 references

publication date

1987

0 references

review text

There are k distinct statistical populations each specified by a univariate density function characterized by a parameter of unknown value. The question concerns how \(x_ 1,x_ 2,...,x_ N\) should be sampled sequentially from the k populations in order to maximize (in some sense) the mean value of their sum. A class of simple allocation rules based on upper confidence bounds for the population parameters is proposed. These rules are shown to exhibit asymptotic optimality in both a Bayesian and a frequentist sense. A simulation study provides evidence that the rules perform well even for moderate values of N.

0 references

zbMATH Keywords

adaptive treatment allocation

0 references

multi-armed bandit problem

0 references

boundary crossing

0 references

adaptive control

0 references

dynamic allocation

0 references

upper confidence bounds

0 references

asymptotic optimality

0 references

simulation study

0 references

0 references

0 references

MaRDI publication profile

0 references

full work available at URL

https://doi.org/10.1214/aos/1176350495

0 references

Identifiers

zbMATH Open document ID

0643.62054

0 references

DOI

10.1214/aos/1176350495

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1102059

@@ Property / author @@
-Tze Leung Lai
@@ Property / author: Tze Leung Lai / rank @@
-Normal rank
@@ Property / reviewed by @@
-Kevin D. Glazebrook
@@ Property / reviewed by: Kevin D. Glazebrook / rank @@
-Normal rank
@@ Property / author @@
+Tze Leung Lai
@@ Property / author: Tze Leung Lai / rank @@
+Normal rank
@@ Property / reviewed by @@
+Kevin D. Glazebrook
@@ Property / reviewed by: Kevin D. Glazebrook / rank @@
+Normal rank
@@ Property / MaRDI profile type @@
+MaRDI publication profile
@@ Property / MaRDI profile type: MaRDI publication profile / rank @@
+Normal rank
@@ Property / full work available at URL @@
+https://doi.org/10.1214/aos/1176350495
+Normal rank
@@ Property / OpenAlex ID @@
+W1973885534
@@ Property / OpenAlex ID: W1973885534 / rank @@
+Normal rank