Q4558161 (Q4558161): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
ReferenceBot (talk | contribs)
Changed an Item
 
(2 intermediate revisions by 2 users not shown)
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / cites work
 
Property / cites work: Sample mean based index policies by <i>O</i>(log <i>n</i>) regret for the multi-armed bandit problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Asymptotically efficient adaptive allocation schemes for controlled i.i.d. processes: finite parameter space / rank
 
Normal rank
Property / cites work
 
Property / cites work: Near-Optimal Regret Bounds for Thompson Sampling / rank
 
Normal rank
Property / cites work
 
Property / cites work: Tuning Bandit Algorithms in Stochastic Environments / rank
 
Normal rank
Property / cites work
 
Property / cites work: UCB revisited: improved regret bounds for the stochastic multi-armed bandit problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4252717 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Finite-time analysis of the multiarmed bandit problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Concentration Inequalities / rank
 
Normal rank
Property / cites work
 
Property / cites work: Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Pure Exploration in Multi-armed Bandits Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: The multi-armed bandit problem with covariates / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimal adaptive policies for sequential allocation problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Kullback-Leibler upper confidence bounds for optimal sequential allocation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4558474 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Multi‐Armed Bandit Allocation Indices / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3857528 / rank
 
Normal rank
Property / cites work
 
Property / cites work: An asymptotically optimal policy for finite support models in the multiarmed bandit problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2788426 / rank
 
Normal rank
Property / cites work
 
Property / cites work: On Bayesian index policies for sequential resource allocation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Finite-time lower bounds for the two-armed bandit problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Adaptive treatment allocation and the multi-armed bandit problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Asymptotically efficient adaptive allocation rules / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4558161 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Boundary crossing of Brownian motion. Its relation to the law of the iterated logarithm and to sequential analysis / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5405246 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Introduction to nonparametric estimation / rank
 
Normal rank
Property / cites work
 
Property / cites work: An Asymptotic Minimax Theorem for the Two Armed Bandit Problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4934558 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Lemma 1 / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Latest revision as of 10:26, 17 July 2024

scientific article; zbMATH DE number 6982311
Language Label Description Also known as
English
No label defined
scientific article; zbMATH DE number 6982311

    Statements

    0 references
    21 November 2018
    0 references
    stochastic bandits
    0 references
    sequential decision making
    0 references
    regret minimisation
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references