Robustness of stochastic bandit policies (Q391739): Difference between revisions

@@ Property / OpenAlex ID @@
+W1985558253
@@ Property / OpenAlex ID: W1985558253 / rank @@
+Normal rank
@@ Property / arXiv ID @@
+.4506
@@ Property / arXiv ID: 1107.4506 / rank @@
+Normal rank
@@ Property / cites work @@
+Sample mean based index policies by <i>O</i>(log <i>n</i>) regret for the multi-armed bandit problem
+Normal rank
@@ Property / cites work @@
+Exploration-exploitation tradeoff using variance estimates in multi-armed bandits
+Normal rank
@@ Property / cites work @@
+Finite-time analysis of the multiarmed bandit problem
+Normal rank
@@ Property / cites work @@
+Stationary multi-choice bandit problems.
@@ Property / cites work: Stationary multi-choice bandit problems. / rank @@
+Normal rank
@@ Property / cites work @@
+Optimal adaptive policies for sequential allocation problems
+Normal rank
@@ Property / cites work @@
+Q5302093
@@ Property / cites work: Q5302093 / rank @@
+Normal rank
@@ Property / cites work @@
+Asymptotically efficient adaptive allocation rules
+Normal rank
@@ Property / cites work @@
+When can the two-armed bandit algorithm be trusted?
+Normal rank
@@ Property / cites work @@
+The tight constant in the Dvoretzky-Kiefer-Wolfowitz inequality
+Normal rank
@@ Property / cites work @@
+Some aspects of the sequential design of experiments
+Normal rank