Learning the distribution with largest mean: two bandit frameworks (Q4606431): Difference between revisions

From MaRDI portal
RedirectionBot (talk | contribs)
Changed an Item
ReferenceBot (talk | contribs)
Changed an Item
 
(3 intermediate revisions by 3 users not shown)
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2584453124 / rank
 
Normal rank
Property / arXiv ID
 
Property / arXiv ID: 1702.00001 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Sample mean based index policies by <i>O</i>(log <i>n</i>) regret for the multi-armed bandit problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Asymptotically efficient adaptive allocation schemes for controlled i.i.d. processes: finite parameter space / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3886056 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2739396 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Finite-time analysis of the multiarmed bandit problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Single-Sample Multiple Decision Procedure for Ranking Means of Normal Populations with known Variances / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5610811 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3240573 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3809068 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5405258 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Pure exploration in finitely-armed and continuous-armed bandits / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimal adaptive policies for sequential allocation problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Kullback-Leibler upper confidence bounds for optimal sequential allocation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Prediction, Learning, and Games / rank
 
Normal rank
Property / cites work
 
Property / cites work: Sequential Design of Experiments / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3093383 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2810758 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Learning the distribution with largest mean: two bandit frameworks / rank
 
Normal rank
Property / cites work
 
Property / cites work: Context tree selection: a unifying view / rank
 
Normal rank
Property / cites work
 
Property / cites work: On Upper-Confidence Bound Policies for Switching Bandit Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4197923 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Asymptotically Efficient Adaptive Choice of Control Laws inControlled Markov Chains / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2896090 / rank
 
Normal rank
Property / cites work
 
Property / cites work: On Bayesian index policies for sequential resource allocation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Thompson Sampling: An Asymptotically Optimal Finite-Time Analysis / rank
 
Normal rank
Property / cites work
 
Property / cites work: Asymptotically efficient adaptive allocation rules / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3093197 / rank
 
Normal rank
Property / cites work
 
Property / cites work: A minimax and asymptotically optimal algorithm for stochastic bandits / rank
 
Normal rank
Property / cites work
 
Property / cites work: The multi-armed bandit problem with covariates / rank
 
Normal rank
Property / cites work
 
Property / cites work: Batched bandit problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4315289 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Some aspects of the sequential design of experiments / rank
 
Normal rank
Property / cites work
 
Property / cites work: Simple Bayesian Algorithms for Best-Arm Identification / rank
 
Normal rank
Property / cites work
 
Property / cites work: Landmark learning: An illustration of associative search / rank
 
Normal rank

Latest revision as of 07:38, 15 July 2024

scientific article; zbMATH DE number 6847917
Language Label Description Also known as
English
Learning the distribution with largest mean: two bandit frameworks
scientific article; zbMATH DE number 6847917

    Statements

    Learning the distribution with largest mean: two bandit frameworks (English)
    0 references
    0 references
    0 references
    7 March 2018
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references