Learning the distribution with largest mean: two bandit frameworks (Q4606431): Difference between revisions

@@ Property / MaRDI profile type @@
+MaRDI publication profile
@@ Property / MaRDI profile type: MaRDI publication profile / rank @@
+Normal rank
@@ Property / OpenAlex ID @@
+W2584453124
@@ Property / OpenAlex ID: W2584453124 / rank @@
+Normal rank
@@ Property / arXiv ID @@
+.00001
@@ Property / arXiv ID: 1702.00001 / rank @@
+Normal rank
@@ Property / cites work @@
+Sample mean based index policies by <i>O</i>(log <i>n</i>) regret for the multi-armed bandit problem
+Normal rank
@@ Property / cites work @@
+Asymptotically efficient adaptive allocation schemes for controlled i.i.d. processes: finite parameter space
+Normal rank
@@ Property / cites work @@
+Q3886056
@@ Property / cites work: Q3886056 / rank @@
+Normal rank
@@ Property / cites work @@
+Q2739396
@@ Property / cites work: Q2739396 / rank @@
+Normal rank
@@ Property / cites work @@
+Finite-time analysis of the multiarmed bandit problem
+Normal rank
@@ Property / cites work @@
+A Single-Sample Multiple Decision Procedure for Ranking Means of Normal Populations with known Variances
+Normal rank
@@ Property / cites work @@
+Q5610811
@@ Property / cites work: Q5610811 / rank @@
+Normal rank
@@ Property / cites work @@
+Q3240573
@@ Property / cites work: Q3240573 / rank @@
+Normal rank
@@ Property / cites work @@
+Q3809068
@@ Property / cites work: Q3809068 / rank @@
+Normal rank
@@ Property / cites work @@
+Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems
+Normal rank
@@ Property / cites work @@
+Q5405258
@@ Property / cites work: Q5405258 / rank @@
+Normal rank
@@ Property / cites work @@
+Pure exploration in finitely-armed and continuous-armed bandits
+Normal rank
@@ Property / cites work @@
+Optimal adaptive policies for sequential allocation problems
+Normal rank
@@ Property / cites work @@
+Kullback-Leibler upper confidence bounds for optimal sequential allocation
+Normal rank
@@ Property / cites work @@
+Prediction, Learning, and Games
@@ Property / cites work: Prediction, Learning, and Games / rank @@
+Normal rank
@@ Property / cites work @@
+Sequential Design of Experiments
@@ Property / cites work: Sequential Design of Experiments / rank @@
+Normal rank
@@ Property / cites work @@
+Q3093383
@@ Property / cites work: Q3093383 / rank @@
+Normal rank
@@ Property / cites work @@
+Q2810758
@@ Property / cites work: Q2810758 / rank @@
+Normal rank
@@ Property / cites work @@
+Learning the distribution with largest mean: two bandit frameworks
+Normal rank
@@ Property / cites work @@
+Context tree selection: a unifying view
@@ Property / cites work: Context tree selection: a unifying view / rank @@
+Normal rank
@@ Property / cites work @@
+On Upper-Confidence Bound Policies for Switching Bandit Problems
+Normal rank
@@ Property / cites work @@
+Q4197923
@@ Property / cites work: Q4197923 / rank @@
+Normal rank
@@ Property / cites work @@
+Asymptotically Efficient Adaptive Choice of Control Laws inControlled Markov Chains
+Normal rank
@@ Property / cites work @@
+Q2896090
@@ Property / cites work: Q2896090 / rank @@
+Normal rank
@@ Property / cites work @@
+On Bayesian index policies for sequential resource allocation
+Normal rank
@@ Property / cites work @@
+Thompson Sampling: An Asymptotically Optimal Finite-Time Analysis
+Normal rank
@@ Property / cites work @@
+Asymptotically efficient adaptive allocation rules
+Normal rank
@@ Property / cites work @@
+Q3093197
@@ Property / cites work: Q3093197 / rank @@
+Normal rank
@@ Property / cites work @@
+A minimax and asymptotically optimal algorithm for stochastic bandits
+Normal rank
@@ Property / cites work @@
+The multi-armed bandit problem with covariates
@@ Property / cites work: The multi-armed bandit problem with covariates / rank @@
+Normal rank
@@ Property / cites work @@
+Batched bandit problems
@@ Property / cites work: Batched bandit problems / rank @@
+Normal rank
@@ Property / cites work @@
+Q4315289
@@ Property / cites work: Q4315289 / rank @@
+Normal rank
@@ Property / cites work @@
+Some aspects of the sequential design of experiments
+Normal rank
@@ Property / cites work @@
+Simple Bayesian Algorithms for Best-Arm Identification
+Normal rank
@@ Property / cites work @@
+Landmark learning: An illustration of associative search
+Normal rank