Policies without Memory for the Infinite-Armed Bernoulli Bandit under the Average-Reward Criterion (Q5485352): Difference between revisions

From MaRDI portal
Added link to MaRDI item.
ReferenceBot (talk | contribs)
Changed an Item
 
Property / cites work
 
Property / cites work: Nonparametric bandit methods / rank
 
Normal rank
Property / cites work
 
Property / cites work: Some problems of optimal sampling strategy / rank
 
Normal rank

Latest revision as of 19:45, 24 June 2024

scientific article; zbMATH DE number 5050616
Language Label Description Also known as
English
Policies without Memory for the Infinite-Armed Bernoulli Bandit under the Average-Reward Criterion
scientific article; zbMATH DE number 5050616

    Statements

    Policies without Memory for the Infinite-Armed Bernoulli Bandit under the Average-Reward Criterion (English)
    0 references
    0 references
    0 references
    0 references
    30 August 2006
    0 references

    Identifiers