Regret bounds for restless Markov bandits (Q465253): Difference between revisions

@@ Property / OpenAlex ID @@
+W2178643644
@@ Property / OpenAlex ID: W2178643644 / rank @@
+Normal rank
@@ Property / arXiv ID @@
+.2693
@@ Property / arXiv ID: 1209.2693 / rank @@
+Normal rank
@@ Property / cites work @@
+Asymptotically efficient adaptive allocation rules
+Normal rank
@@ Property / cites work @@
+Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays-Part II: Markovian rewards
+Normal rank
@@ Property / cites work @@
+The Nonstochastic Multiarmed Bandit Problem
@@ Property / cites work: The Nonstochastic Multiarmed Bandit Problem / rank @@
+Normal rank
@@ Property / cites work @@
+Q2896090
@@ Property / cites work: Q2896090 / rank @@
+Normal rank
@@ Property / cites work @@
+Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems
+Normal rank
@@ Property / cites work @@
+Q4197923
@@ Property / cites work: Q4197923 / rank @@
+Normal rank
@@ Property / cites work @@
+Finite-time analysis of the multiarmed bandit problem
+Normal rank
@@ Property / cites work @@
+Q3815845
@@ Property / cites work: Q3815845 / rank @@
+Normal rank
@@ Property / cites work @@
+Equivalence notions and model minimization in Markov decision processes
+Normal rank
@@ Property / cites work @@
+Q4737593
@@ Property / cites work: Q4737593 / rank @@
+Normal rank
@@ Property / cites work @@
+Pseudometrics for State Aggregation in Average Reward Markov Decision Processes
+Normal rank
@@ Property / cites work @@
+Q4315289
@@ Property / cites work: Q4315289 / rank @@
+Normal rank
@@ Property / cites work @@
+On Chebyshev-Type Inequalities for Primes
@@ Property / cites work: On Chebyshev-Type Inequalities for Primes / rank @@
+Normal rank
@@ Property / cites work @@
+Threshold limits for cover times
@@ Property / cites work: Threshold limits for cover times / rank @@
+Normal rank
@@ Property / cites work @@
+On the possibility of learning in reactive environments with arbitrary dependence
+Normal rank