Tuning Bandit Algorithms in Stochastic Environments (Q3520056): Difference between revisions

From MaRDI portal

Jump to:navigation, search

Latest revision as of 13:59, 28 June 2024

scientific article

Language	Label	Description	Also known as
English	Tuning Bandit Algorithms in Stochastic Environments	scientific article

Statements

scholarly article

0 references

Tuning Bandit Algorithms in Stochastic Environments (English)

0 references

Jean-Yves Audibert

0 references

0 references

Csaba Szepesvári

0 references

Lecture Notes in Computer Science

0 references

publication date

19 August 2008

0 references

full work available at URL

https://hal.inria.fr/inria-00203487/file/ucb_alt.pdf

0 references

MaRDI profile type

MaRDI publication profile

0 references

Sample mean based index policies by <i>O</i>(log <i>n</i>) regret for the multi-armed bandit problem

0 references

Finite-time analysis of the multiarmed bandit problem

0 references

Asymptotically efficient adaptive allocation rules

0 references

Machine learning and nonparametric bandit theory

0 references

Some aspects of the sequential design of experiments

0 references

Identifiers

zbMATH Open document ID

0 references

10.1007/978-3-540-75225-7_15

0 references

Mathematics Subject Classification ID

0 references

0 references

zbMATH DE Number

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:3520056

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Item:Q3520056&oldid=35067501"