Some reward–penalty rules for the multi-armed bandit problem which are asymptotically optimal (Q4743532)

scientific article; zbMATH DE number 3798793

Language	Label	Description	Also known as
default for all languages	No label defined
English	Some reward–penalty rules for the multi-armed bandit problem which are asymptotically optimal	scientific article; zbMATH DE number 3798793

Statements

instance of

scholarly article

0 references

title

Some reward–penalty rules for the multi-armed bandit problem which are asymptotically optimal (English)

0 references

published in

Advances in Applied Probability

0 references

publication date

1983

0 references

zbMATH Keywords

multirmed bandit problem

0 references

randomised allocation indices

0 references

Gittins index

0 references

mathematical learning

0 references

author

Kevin D. Glazebrook

0 references

MaRDI profile type

MaRDI publication profile

0 references

full work available at URL

https://doi.org/10.2307/1426995

0 references

Identifiers

zbMATH Open document ID

0506.60067

0 references

DOI

10.2307/1426995

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:4743532