Q4558161 (Q4558161): Difference between revisions

@@ Property / MaRDI profile type @@
+MaRDI publication profile
@@ Property / MaRDI profile type: MaRDI publication profile / rank @@
+Normal rank
@@ Property / cites work @@
+Sample mean based index policies by <i>O</i>(log <i>n</i>) regret for the multi-armed bandit problem
+Normal rank
@@ Property / cites work @@
+Asymptotically efficient adaptive allocation schemes for controlled i.i.d. processes: finite parameter space
+Normal rank
@@ Property / cites work @@
+Near-Optimal Regret Bounds for Thompson Sampling
@@ Property / cites work: Near-Optimal Regret Bounds for Thompson Sampling / rank @@
+Normal rank
@@ Property / cites work @@
+Tuning Bandit Algorithms in Stochastic Environments
+Normal rank
@@ Property / cites work @@
+UCB revisited: improved regret bounds for the stochastic multi-armed bandit problem
+Normal rank
@@ Property / cites work @@
+Q4252717
@@ Property / cites work: Q4252717 / rank @@
+Normal rank
@@ Property / cites work @@
+Finite-time analysis of the multiarmed bandit problem
+Normal rank
@@ Property / cites work @@
+Concentration Inequalities
@@ Property / cites work: Concentration Inequalities / rank @@
+Normal rank
@@ Property / cites work @@
+Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems
+Normal rank
@@ Property / cites work @@
+Pure Exploration in Multi-armed Bandits Problems
@@ Property / cites work: Pure Exploration in Multi-armed Bandits Problems / rank @@
+Normal rank
@@ Property / cites work @@
+The multi-armed bandit problem with covariates
@@ Property / cites work: The multi-armed bandit problem with covariates / rank @@
+Normal rank
@@ Property / cites work @@
+Optimal adaptive policies for sequential allocation problems
+Normal rank
@@ Property / cites work @@
+Kullback-Leibler upper confidence bounds for optimal sequential allocation
+Normal rank
@@ Property / cites work @@
+Q4558474
@@ Property / cites work: Q4558474 / rank @@
+Normal rank
@@ Property / cites work @@
+Multi‐Armed Bandit Allocation Indices
@@ Property / cites work: Multi‐Armed Bandit Allocation Indices / rank @@
+Normal rank
@@ Property / cites work @@
+Q3857528
@@ Property / cites work: Q3857528 / rank @@
+Normal rank
@@ Property / cites work @@
+An asymptotically optimal policy for finite support models in the multiarmed bandit problem
+Normal rank
@@ Property / cites work @@
+Q2788426
@@ Property / cites work: Q2788426 / rank @@
+Normal rank
@@ Property / cites work @@
+On Bayesian index policies for sequential resource allocation
+Normal rank
@@ Property / cites work @@
+Finite-time lower bounds for the two-armed bandit problem
+Normal rank
@@ Property / cites work @@
+Adaptive treatment allocation and the multi-armed bandit problem
+Normal rank
@@ Property / cites work @@
+Asymptotically efficient adaptive allocation rules
+Normal rank
@@ Property / cites work @@
+Q4558161
@@ Property / cites work: Q4558161 / rank @@
+Normal rank
@@ Property / cites work @@
+Boundary crossing of Brownian motion. Its relation to the law of the iterated logarithm and to sequential analysis
+Normal rank
@@ Property / cites work @@
+Q5405246
@@ Property / cites work: Q5405246 / rank @@
+Normal rank
@@ Property / cites work @@
+Introduction to nonparametric estimation
@@ Property / cites work: Introduction to nonparametric estimation / rank @@
+Normal rank
@@ Property / cites work @@
+An Asymptotic Minimax Theorem for the Two Armed Bandit Problem
+Normal rank
@@ Property / cites work @@
+Q4934558
@@ Property / cites work: Q4934558 / rank @@
+Normal rank
@@ Property / cites work @@
+Lemma 1
@@ Property / cites work: Lemma 1 / rank @@
+Normal rank
@@ links / mardi / name / links / mardi / name @@
+Publication:4558161