Good arm identification via bandit feedback (Q2425222): Difference between revisions

@@ Property / cites work @@
+Finite-time analysis of the multiarmed bandit problem
+Normal rank
@@ Property / cites work @@
+Kullback-Leibler upper confidence bounds for optimal sequential allocation
+Normal rank
@@ Property / cites work @@
+Q3093383
@@ Property / cites work: Q3093383 / rank @@
+Normal rank
@@ Property / cites work @@
+Q2810758
@@ Property / cites work: Q2810758 / rank @@
+Normal rank
@@ Property / cites work @@
+A procedure for selecting a subset of size m containing the l best of k independent normal populations, with applications to simulation
+Normal rank
@@ Property / cites work @@
+Asymptotically efficient adaptive allocation rules
+Normal rank
@@ Property / cites work @@
+Q4626283
@@ Property / cites work: Q4626283 / rank @@
+Normal rank