Kullback-Leibler upper confidence bounds for optimal sequential allocation (Q366995): Difference between revisions

From MaRDI portal
Changed an Item
Set OpenAlex properties.
 
(3 intermediate revisions by 3 users not shown)
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / arXiv ID
 
Property / arXiv ID: 1210.1136 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Sample mean based index policies by <i>O</i>(log <i>n</i>) regret for the multi-armed bandit problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2896165 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Exploration-exploitation tradeoff using variance estimates in multi-armed bandits / rank
 
Normal rank
Property / cites work
 
Property / cites work: Finite-time analysis of the multiarmed bandit problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimal adaptive policies for sequential allocation problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimal Adaptive Policies for Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: ASYMPTOTIC BAYES ANALYSIS FOR THE FINITE-HORIZON ONE-ARMED-BANDIT PROBLEM / rank
 
Normal rank
Property / cites work
 
Property / cites work: Kullback-Leibler upper confidence bounds for optimal sequential allocation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimal stopping and dynamic allocation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4040465 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4391441 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4197923 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Multi‐Armed Bandit Allocation Indices / rank
 
Normal rank
Property / cites work
 
Property / cites work: Probability Inequalities for Sums of Bounded Random Variables / rank
 
Normal rank
Property / cites work
 
Property / cites work: An asymptotically optimal policy for finite support models in the multiarmed bandit problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Thompson Sampling: An Asymptotically Optimal Finite-Time Analysis / rank
 
Normal rank
Property / cites work
 
Property / cites work: Asymptotically efficient adaptive allocation rules / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4219536 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Concentration inequalities and model selection. Ecole d'Eté de Probabilités de Saint-Flour XXXIII -- 2003. / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2756704 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Some aspects of the sequential design of experiments / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the Theory of Apportionment / rank
 
Normal rank
Property / cites work
 
Property / cites work: Asymptotic Statistics / rank
 
Normal rank
Property / cites work
 
Property / cites work: Graphical Models, Exponential Families, and Variational Inference / rank
 
Normal rank
Property / cites work
 
Property / cites work: Sequential Tests of Statistical Hypotheses / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the Gittins index for multiarmed bandits / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3882215 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W3100329718 / rank
 
Normal rank

Latest revision as of 09:33, 30 July 2024

scientific article
Language Label Description Also known as
English
Kullback-Leibler upper confidence bounds for optimal sequential allocation
scientific article

    Statements

    Kullback-Leibler upper confidence bounds for optimal sequential allocation (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    25 September 2013
    0 references
    multi-armed bandit problems
    0 references
    upper confidence bound
    0 references
    Kullback-Leibler divergence
    0 references
    sequential testing
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references