Policies without Memory for the Infinite-Armed Bernoulli Bandit under the Average-Reward Criterion (Q5485352): Difference between revisions
From MaRDI portal
Set profile property. |
ReferenceBot (talk | contribs) Changed an Item |
||
(One intermediate revision by one other user not shown) | |||
Property / cites work | |||
Property / cites work: Nonparametric bandit methods / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Some problems of optimal sampling strategy / rank | |||
Normal rank | |||
links / mardi / name | links / mardi / name | ||
Latest revision as of 19:45, 24 June 2024
scientific article; zbMATH DE number 5050616
Language | Label | Description | Also known as |
---|---|---|---|
English | Policies without Memory for the Infinite-Armed Bernoulli Bandit under the Average-Reward Criterion |
scientific article; zbMATH DE number 5050616 |
Statements
Policies without Memory for the Infinite-Armed Bernoulli Bandit under the Average-Reward Criterion (English)
0 references
30 August 2006
0 references