Regret bounds for restless Markov bandits (Q465253): Difference between revisions

@@ Property / Mathematics Subject Classification ID @@
+G40
@@ Property / Mathematics Subject Classification ID: 60G40 / rank @@
+Normal rank
@@ Property / Mathematics Subject Classification ID @@
+C40
@@ Property / Mathematics Subject Classification ID: 90C40 / rank @@
+Normal rank
@@ Property / Mathematics Subject Classification ID @@
+A60
@@ Property / Mathematics Subject Classification ID: 91A60 / rank @@
+Normal rank
@@ Property / zbMATH DE Number @@
+6362896
@@ Property / zbMATH DE Number: 6362896 / rank @@
+Normal rank
@@ Property / zbMATH Keywords @@
+restless bandits
@@ Property / zbMATH Keywords: restless bandits / rank @@
+Normal rank
@@ Property / zbMATH Keywords @@
+Markov decision processes
@@ Property / zbMATH Keywords: Markov decision processes / rank @@
+Normal rank
@@ Property / zbMATH Keywords @@
+regret
@@ Property / zbMATH Keywords: regret / rank @@
+Normal rank
@@ Property / MaRDI profile type @@
+MaRDI publication profile
@@ Property / MaRDI profile type: MaRDI publication profile / rank @@
+Normal rank
@@ Property / OpenAlex ID @@
+W2178643644
@@ Property / OpenAlex ID: W2178643644 / rank @@
+Normal rank
@@ Property / arXiv ID @@
+.2693
@@ Property / arXiv ID: 1209.2693 / rank @@
+Normal rank
@@ Property / cites work @@
+Asymptotically efficient adaptive allocation rules
+Normal rank
@@ Property / cites work @@
+Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays-Part II: Markovian rewards
+Normal rank
@@ Property / cites work @@
+The Nonstochastic Multiarmed Bandit Problem
@@ Property / cites work: The Nonstochastic Multiarmed Bandit Problem / rank @@
+Normal rank
@@ Property / cites work @@
+Q2896090
@@ Property / cites work: Q2896090 / rank @@
+Normal rank
@@ Property / cites work @@
+Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems
+Normal rank
@@ Property / cites work @@
+Q4197923
@@ Property / cites work: Q4197923 / rank @@
+Normal rank
@@ Property / cites work @@
+Finite-time analysis of the multiarmed bandit problem
+Normal rank
@@ Property / cites work @@
+Q3815845
@@ Property / cites work: Q3815845 / rank @@
+Normal rank
@@ Property / cites work @@
+Equivalence notions and model minimization in Markov decision processes
+Normal rank
@@ Property / cites work @@
+Q4737593
@@ Property / cites work: Q4737593 / rank @@
+Normal rank
@@ Property / cites work @@
+Pseudometrics for State Aggregation in Average Reward Markov Decision Processes
+Normal rank
@@ Property / cites work @@
+Q4315289
@@ Property / cites work: Q4315289 / rank @@
+Normal rank
@@ Property / cites work @@
+On Chebyshev-Type Inequalities for Primes
@@ Property / cites work: On Chebyshev-Type Inequalities for Primes / rank @@
+Normal rank
@@ Property / cites work @@
+Threshold limits for cover times
@@ Property / cites work: Threshold limits for cover times / rank @@
+Normal rank
@@ Property / cites work @@
+On the possibility of learning in reactive environments with arbitrary dependence
+Normal rank
@@ links / mardi / name / links / mardi / name @@
+Publication:465253