Q4998863 (Q4998863): Difference between revisions

@@ Property / cites work @@
+UCB revisited: improved regret bounds for the stochastic multi-armed bandit problem
+Normal rank
@@ Property / cites work @@
+Finite-time analysis of the multiarmed bandit problem
+Normal rank
@@ Property / cites work @@
+Bandits with Knapsacks
@@ Property / cites work: Bandits with Knapsacks / rank @@
+Normal rank
@@ Property / cites work @@
+Q3809068
@@ Property / cites work: Q3809068 / rank @@
+Normal rank
@@ Property / cites work @@
+Optimal Exploration–Exploitation in a Multi-armed Bandit Problem with Non-stationary Rewards
+Normal rank
@@ Property / cites work @@
+New approaches to statistical learning theory
@@ Property / cites work: New approaches to statistical learning theory / rank @@
+Normal rank
@@ Property / cites work @@
+Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems
+Normal rank
@@ Property / cites work @@
+The multi-armed bandit problem with covariates
@@ Property / cites work: The multi-armed bandit problem with covariates / rank @@
+Normal rank
@@ Property / cites work @@
+Prediction, Learning, and Games
@@ Property / cites work: Prediction, Learning, and Games / rank @@
+Normal rank
@@ Property / cites work @@
+On Upper-Confidence Bound Policies for Switching Bandit Problems
+Normal rank
@@ Property / cites work @@
+Multi‐Armed Bandit Allocation Indices
@@ Property / cites work: Multi‐Armed Bandit Allocation Indices / rank @@
+Normal rank
@@ Property / cites work @@
+Q5396640
@@ Property / cites work: Q5396640 / rank @@
+Normal rank
@@ Property / cites work @@
+Q2810758
@@ Property / cites work: Q2810758 / rank @@
+Normal rank
@@ Property / cites work @@
+Thompson Sampling: An Asymptotically Optimal Finite-Time Analysis
+Normal rank
@@ Property / cites work @@
+Q5302093
@@ Property / cites work: Q5302093 / rank @@
+Normal rank
@@ Property / cites work @@
+Asymptotically efficient adaptive allocation rules
+Normal rank
@@ Property / cites work @@
+Learning in a Changing World: Restless Multiarmed Bandit With Unknown Dynamics
+Normal rank