Kullback-Leibler upper confidence bounds for optimal sequential allocation (Q366995): Difference between revisions

@@ Property / author @@
-Aurélien Garivier
@@ Property / author: Aurélien Garivier / rank @@
-Normal rank
@@ Property / describes a project that uses @@
+py/maBandits
@@ Property / describes a project that uses: py/maBandits / rank @@
+Normal rank
@@ Property / MaRDI profile type @@
+MaRDI publication profile
@@ Property / MaRDI profile type: MaRDI publication profile / rank @@
+Normal rank
@@ Property / arXiv ID @@
+.1136
@@ Property / arXiv ID: 1210.1136 / rank @@
+Normal rank
@@ Property / cites work @@
+Sample mean based index policies by <i>O</i>(log <i>n</i>) regret for the multi-armed bandit problem
+Normal rank
@@ Property / cites work @@
+Q2896165
@@ Property / cites work: Q2896165 / rank @@
+Normal rank
@@ Property / cites work @@
+Exploration-exploitation tradeoff using variance estimates in multi-armed bandits
+Normal rank
@@ Property / cites work @@
+Finite-time analysis of the multiarmed bandit problem
+Normal rank
@@ Property / cites work @@
+Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems
+Normal rank
@@ Property / cites work @@
+Optimal adaptive policies for sequential allocation problems
+Normal rank
@@ Property / cites work @@
+Optimal Adaptive Policies for Markov Decision Processes
+Normal rank
@@ Property / cites work @@
+ASYMPTOTIC BAYES ANALYSIS FOR THE FINITE-HORIZON  ONE-ARMED-BANDIT PROBLEM
+Normal rank
@@ Property / cites work @@
+Kullback-Leibler upper confidence bounds for optimal sequential allocation
+Normal rank
@@ Property / cites work @@
+Optimal stopping and dynamic allocation
@@ Property / cites work: Optimal stopping and dynamic allocation / rank @@
+Normal rank
@@ Property / cites work @@
+Q4040465
@@ Property / cites work: Q4040465 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4391441
@@ Property / cites work: Q4391441 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4197923
@@ Property / cites work: Q4197923 / rank @@
+Normal rank
@@ Property / cites work @@
+Multi‐Armed Bandit Allocation Indices
@@ Property / cites work: Multi‐Armed Bandit Allocation Indices / rank @@
+Normal rank
@@ Property / cites work @@
+Probability Inequalities for Sums of Bounded Random Variables
+Normal rank
@@ Property / cites work @@
+An asymptotically optimal policy for finite support models in the multiarmed bandit problem
+Normal rank
@@ Property / cites work @@
+Thompson Sampling: An Asymptotically Optimal Finite-Time Analysis
+Normal rank
@@ Property / cites work @@
+Asymptotically efficient adaptive allocation rules
+Normal rank
@@ Property / cites work @@
+Q4219536
@@ Property / cites work: Q4219536 / rank @@
+Normal rank
@@ Property / cites work @@
+Concentration inequalities and model selection. Ecole d'Eté de Probabilités de Saint-Flour XXXIII -- 2003.
+Normal rank
@@ Property / cites work @@
+Q2756704
@@ Property / cites work: Q2756704 / rank @@
+Normal rank
@@ Property / cites work @@
+Some aspects of the sequential design of experiments
+Normal rank
@@ Property / cites work @@
+On the Theory of Apportionment
@@ Property / cites work: On the Theory of Apportionment / rank @@
+Normal rank
@@ Property / cites work @@
+Asymptotic Statistics
@@ Property / cites work: Asymptotic Statistics / rank @@
+Normal rank
@@ Property / cites work @@
+Graphical Models, Exponential Families, and Variational Inference
+Normal rank
@@ Property / cites work @@
+Sequential Tests of Statistical Hypotheses
@@ Property / cites work: Sequential Tests of Statistical Hypotheses / rank @@
+Normal rank
@@ Property / cites work @@
+On the Gittins index for multiarmed bandits
@@ Property / cites work: On the Gittins index for multiarmed bandits / rank @@
+Normal rank
@@ Property / cites work @@
+Q3882215
@@ Property / cites work: Q3882215 / rank @@
+Normal rank
@@ Property / OpenAlex ID @@
+W3100329718
@@ Property / OpenAlex ID: W3100329718 / rank @@
+Normal rank