Tuning Bandit Algorithms in Stochastic Environments (Q3520056): Difference between revisions
From MaRDI portal
Set OpenAlex properties. |
ReferenceBot (talk | contribs) Changed an Item |
||
Property / cites work | |||
Property / cites work: Sample mean based index policies by <i>O</i>(log <i>n</i>) regret for the multi-armed bandit problem / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Finite-time analysis of the multiarmed bandit problem / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Asymptotically efficient adaptive allocation rules / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Machine learning and nonparametric bandit theory / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Some aspects of the sequential design of experiments / rank | |||
Normal rank |
Latest revision as of 13:59, 28 June 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Tuning Bandit Algorithms in Stochastic Environments |
scientific article |
Statements
Tuning Bandit Algorithms in Stochastic Environments (English)
0 references
19 August 2008
0 references
0 references