Policies without Memory for the Infinite-Armed Bernoulli Bandit under the Average-Reward Criterion (Q5485352): Difference between revisions

From MaRDI portal

Jump to:navigation, search

Latest revision as of 19:45, 24 June 2024

scientific article; zbMATH DE number 5050616

Language	Label	Description	Also known as
English	Policies without Memory for the Infinite-Armed Bernoulli Bandit under the Average-Reward Criterion	scientific article; zbMATH DE number 5050616

Statements

scholarly article

0 references

Policies without Memory for the Infinite-Armed Bernoulli Bandit under the Average-Reward Criterion (English)

0 references

Stephen J. Herschkorn

0 references

0 references

Sheldon M. Ross

0 references

Probability in the Engineering and Informational Sciences

0 references

publication date

30 August 2006

0 references

MaRDI profile type

MaRDI publication profile

0 references

Nonparametric bandit methods

0 references

Some problems of optimal sampling strategy

0 references

Identifiers

zbMATH Open document ID

0 references

10.1017/S0269964800004149

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

zbMATH DE Number

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:5485352

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Item:Q5485352&oldid=34887410"