An asymptotically optimal policy for finite support models in the multiarmed bandit problem (Q415624): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Changed an Item
ReferenceBot (talk | contribs)
Changed an Item
 
Property / cites work
 
Property / cites work: The Continuum-Armed Bandit Problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Sample mean based index policies by <i>O</i>(log <i>n</i>) regret for the multi-armed bandit problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Finite-time analysis of the multiarmed bandit problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: The Nonstochastic Multiarmed Bandit Problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4821526 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimal adaptive policies for sequential allocation problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Elements of Information Theory / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3046711 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Introduction to sensitivity and stability analysis in nonlinear programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4692329 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Multi-armed bandit problem revisited / rank
 
Normal rank
Property / cites work
 
Property / cites work: The Multi-Armed Bandit Problem: Decomposition and Computation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Asymptotically efficient adaptive allocation rules / rank
 
Normal rank
Property / cites work
 
Property / cites work: Exploration of multi-state environments: Local measures and back-propagation of uncertainty / rank
 
Normal rank
Property / cites work
 
Property / cites work: Convergence of stochastic processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Some aspects of the sequential design of experiments / rank
 
Normal rank
Property / cites work
 
Property / cites work: Non-overlapping domain decomposition for evolution operators / rank
 
Normal rank
Property / cites work
 
Property / cites work: Nonparametric bandit methods / rank
 
Normal rank

Latest revision as of 04:44, 5 July 2024

scientific article
Language Label Description Also known as
English
An asymptotically optimal policy for finite support models in the multiarmed bandit problem
scientific article

    Statements

    An asymptotically optimal policy for finite support models in the multiarmed bandit problem (English)
    0 references
    0 references
    0 references
    8 May 2012
    0 references
    0 references
    0 references
    0 references
    0 references
    bandit problems
    0 references
    finite-time regret
    0 references
    MED policy
    0 references
    convex optimization
    0 references
    0 references
    0 references
    0 references