Minimax PAC bounds on the sample complexity of reinforcement learning with a generative model (Q399890): Difference between revisions

@@ Property / MaRDI profile type @@
+MaRDI publication profile
@@ Property / MaRDI profile type: MaRDI publication profile / rank @@
+Normal rank
@@ Property / OpenAlex ID @@
+W2120678009
@@ Property / OpenAlex ID: W2120678009 / rank @@
+Normal rank
@@ Property / arXiv ID @@
+.6461
@@ Property / arXiv ID: 1206.6461 / rank @@
+Normal rank
@@ Property / cites work @@
+Neuro-Dynamic Programming: An Overview and Recent Results
+Normal rank
@@ Property / cites work @@
+Q4257216
@@ Property / cites work: Q4257216 / rank @@
+Normal rank
@@ Property / cites work @@
+Prediction, Learning, and Games
@@ Property / cites work: Prediction, Learning, and Games / rank @@
+Normal rank
@@ Property / cites work @@
+Q3093383
@@ Property / cites work: Q3093383 / rank @@
+Normal rank
@@ Property / cites work @@
+A guided tour of Chernoff bounds
@@ Property / cites work: A guided tour of Chernoff bounds / rank @@
+Normal rank
@@ Property / cites work @@
+Q2896090
@@ Property / cites work: Q2896090 / rank @@
+Normal rank
@@ Property / cites work @@
+PAC Bounds for Discounted MDPs
@@ Property / cites work: PAC Bounds for Discounted MDPs / rank @@
+Normal rank
@@ Property / cites work @@
+Q3093197
@@ Property / cites work: Q3093197 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4315289
@@ Property / cites work: Q4315289 / rank @@
+Normal rank
@@ Property / cites work @@
+An upper bound on the loss from approximate optimal-value functions
+Normal rank
@@ Property / cites work @@
+The variance of discounted Markov decision processes
+Normal rank
@@ Property / cites work @@
+Q2880979
@@ Property / cites work: Q2880979 / rank @@
+Normal rank
@@ Property / cites work @@
+Algorithms for Reinforcement Learning
@@ Property / cites work: Algorithms for Reinforcement Learning / rank @@
+Normal rank
@@ links / mardi / name / links / mardi / name @@
+Publication:399890