An asymptotically optimal policy for finite support models in the multiarmed bandit problem (Q415624): Difference between revisions

@@ Property / cites work @@
+The Continuum-Armed Bandit Problem
@@ Property / cites work: The Continuum-Armed Bandit Problem / rank @@
+Normal rank
@@ Property / cites work @@
+Sample mean based index policies by <i>O</i>(log <i>n</i>) regret for the multi-armed bandit problem
+Normal rank
@@ Property / cites work @@
+Finite-time analysis of the multiarmed bandit problem
+Normal rank
@@ Property / cites work @@
+The Nonstochastic Multiarmed Bandit Problem
@@ Property / cites work: The Nonstochastic Multiarmed Bandit Problem / rank @@
+Normal rank
@@ Property / cites work @@
+Q4821526
@@ Property / cites work: Q4821526 / rank @@
+Normal rank
@@ Property / cites work @@
+Optimal adaptive policies for sequential allocation problems
+Normal rank
@@ Property / cites work @@
+Elements of Information Theory
@@ Property / cites work: Elements of Information Theory / rank @@
+Normal rank
@@ Property / cites work @@
+Q3046711
@@ Property / cites work: Q3046711 / rank @@
+Normal rank
@@ Property / cites work @@
+Introduction to sensitivity and stability analysis in nonlinear programming
+Normal rank
@@ Property / cites work @@
+Q4692329
@@ Property / cites work: Q4692329 / rank @@
+Normal rank
@@ Property / cites work @@
+Multi-armed bandit problem revisited
@@ Property / cites work: Multi-armed bandit problem revisited / rank @@
+Normal rank
@@ Property / cites work @@
+The Multi-Armed Bandit Problem: Decomposition and Computation
+Normal rank
@@ Property / cites work @@
+Asymptotically efficient adaptive allocation rules
+Normal rank
@@ Property / cites work @@
+Exploration of multi-state environments: Local measures and back-propagation of uncertainty
+Normal rank
@@ Property / cites work @@
+Convergence of stochastic processes
@@ Property / cites work: Convergence of stochastic processes / rank @@
+Normal rank
@@ Property / cites work @@
+Some aspects of the sequential design of experiments
+Normal rank
@@ Property / cites work @@
+Non-overlapping domain decomposition for evolution operators
+Normal rank
@@ Property / cites work @@
+Nonparametric bandit methods
@@ Property / cites work: Nonparametric bandit methods / rank @@
+Normal rank