An asymptotically optimal policy for finite support models in the multiarmed bandit problem (Q415624): Difference between revisions

From MaRDI portal
Normalize DOI.
Import241208061232 (talk | contribs)
Normalize DOI.
 
Property / DOI
 
Property / DOI: 10.1007/S10994-011-5257-4 / rank
Normal rank
 
Property / DOI
 
Property / DOI: 10.1007/S10994-011-5257-4 / rank
 
Normal rank

Latest revision as of 16:55, 9 December 2024

scientific article
Language Label Description Also known as
English
An asymptotically optimal policy for finite support models in the multiarmed bandit problem
scientific article

    Statements

    An asymptotically optimal policy for finite support models in the multiarmed bandit problem (English)
    0 references
    0 references
    0 references
    8 May 2012
    0 references
    bandit problems
    0 references
    finite-time regret
    0 references
    MED policy
    0 references
    convex optimization
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references