Good arm identification via bandit feedback (Q2425222): Difference between revisions

@@ Property / arXiv ID @@
+.06360
@@ Property / arXiv ID: 1710.06360 / rank @@
+Normal rank
@@ Property / cites work @@
+Finite-time analysis of the multiarmed bandit problem
+Normal rank
@@ Property / cites work @@
+Kullback-Leibler upper confidence bounds for optimal sequential allocation
+Normal rank
@@ Property / cites work @@
+Q3093383
@@ Property / cites work: Q3093383 / rank @@
+Normal rank
@@ Property / cites work @@
+Q2810758
@@ Property / cites work: Q2810758 / rank @@
+Normal rank
@@ Property / cites work @@
+A procedure for selecting a subset of size m containing the l best of k independent normal populations, with applications to simulation
+Normal rank
@@ Property / cites work @@
+Asymptotically efficient adaptive allocation rules
+Normal rank
@@ Property / cites work @@
+Q4626283
@@ Property / cites work: Q4626283 / rank @@
+Normal rank
@@ Property / Wikidata QID @@
+Q128264264
@@ Property / Wikidata QID: Q128264264 / rank @@
+Normal rank