Exploration-exploitation tradeoff using variance estimates in multi-armed bandits (Q1017665): Difference between revisions
From MaRDI portal
Set OpenAlex properties. |
ReferenceBot (talk | contribs) Changed an Item |
||
Property / cites work | |||
Property / cites work: Sample mean based index policies by <i>O</i>(log <i>n</i>) regret for the multi-armed bandit problem / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Finite-time analysis of the multiarmed bandit problem / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: On tail probabilities for martingales / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q4692329 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Probability Inequalities for Sums of Bounded Random Variables / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Asymptotically efficient adaptive allocation rules / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Machine learning and nonparametric bandit theory / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Some aspects of the sequential design of experiments / rank | |||
Normal rank |
Revision as of 13:12, 1 July 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Exploration-exploitation tradeoff using variance estimates in multi-armed bandits |
scientific article |
Statements
Exploration-exploitation tradeoff using variance estimates in multi-armed bandits (English)
0 references
12 May 2009
0 references
exploration-exploitation tradeoff
0 references
multi-armed bandits
0 references
Bernstein's inequality
0 references
high-probability bound
0 references
risk analysis
0 references
0 references