Variance Reduced Value Iteration and Faster Algorithms for Solving Markov Decision Processes (Q4607932): Difference between revisions

← Older edit

@@ label / en / label / en @@
+Variance Reduced Value Iteration and Faster Algorithms for Solving Markov Decision Processes
@@ Property / publication date @@
+October 2023Timestamp +2023-10-25T00:00:00Z
Timezone +00:00
Calendar Gregorian
Precision 1 day
Before 0
After 0
-Timestamp
++2023-10-25T00:00:00Z
-Timezone
++00:00
-Calendar
+Gregorian
-Precision
+day
 Before
 After
@@ Property / publication date: 25 October 2023 / rank @@
+Normal rank
@@ Property / Mathematics Subject Classification ID @@
+C59
@@ Property / Mathematics Subject Classification ID: 90C59 / rank @@
+Normal rank
@@ Property / zbMATH DE Number @@
+7754567
@@ Property / zbMATH DE Number: 7754567 / rank @@
+Normal rank
@@ Property / arXiv classification @@
+cs.DS
@@ Property / arXiv classification: cs.DS / rank @@
+Normal rank
@@ Property / arXiv classification @@
+cs.LG
@@ Property / arXiv classification: cs.LG / rank @@
+Normal rank
@@ Property / arXiv classification @@
+math.OC
@@ Property / arXiv classification: math.OC / rank @@
+Normal rank
@@ Property / arXiv ID @@
+.09988
@@ Property / arXiv ID: 1710.09988 / rank @@
+Normal rank
@@ Property / title @@
+Variance reduced value iteration and faster algorithms for solving Markov decision processes (English)
+Normal rank
@@ Property / DOI @@
+.1002/nav.21992
@@ Property / DOI: 10.1002/nav.21992 / rank @@
+Normal rank
@@ Property / author @@
+Aaron Sidford
@@ Property / author: Aaron Sidford / rank @@
+Normal rank
@@ Property / author @@
+Q6075461
@@ Property / author: Q6075461 / rank @@
+Normal rank
@@ Property / author @@
+Q6079110
@@ Property / author: Q6079110 / rank @@
+Normal rank
@@ Property / author @@
+Yinyu Ye
@@ Property / author: Yinyu Ye / rank @@
+Normal rank
@@ Property / published in @@
+Naval Research Logistics (NRL)
@@ Property / published in: Naval Research Logistics (NRL) / rank @@
+Normal rank
@@ Property / zbMATH Keywords @@
+linear programming algorithm
@@ Property / zbMATH Keywords: linear programming algorithm / rank @@
+Normal rank
@@ Property / zbMATH Keywords @@
+Markov decision processes
@@ Property / zbMATH Keywords: Markov decision processes / rank @@
+Normal rank
@@ Property / zbMATH Keywords @@
+value iteration
@@ Property / zbMATH Keywords: value iteration / rank @@
+Normal rank
@@ Property / cites work @@
+Minimax PAC bounds on the sample complexity of reinforcement learning with a generative model
+Normal rank
@@ Property / cites work @@
+Q3241581
@@ Property / cites work: Q3241581 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4368722
@@ Property / cites work: Q4368722 / rank @@
+Normal rank
@@ Property / cites work @@
+Q3189557
@@ Property / cites work: Q3189557 / rank @@
+Normal rank
@@ Property / cites work @@
+Q3844775
@@ Property / cites work: Q3844775 / rank @@
+Normal rank
@@ Property / cites work @@
+Q3270185
@@ Property / cites work: Q3270185 / rank @@
+Normal rank
@@ Property / cites work @@
+The value iteration algorithm is not strongly polynomial for discounted dynamic programming
+Normal rank
@@ Property / cites work @@
+Strategy Iteration Is Strongly Polynomial for 2-Player Turn-Based Stochastic Games with a Constant Discount Factor
+Normal rank
@@ Property / cites work @@
+Probability Inequalities for Sums of Bounded Random Variables
+Normal rank
@@ Property / cites work @@
+Q3266141
@@ Property / cites work: Q3266141 / rank @@
+Normal rank
@@ Property / cites work @@
+A sparse sampling algorithm for near-optimal planning in large Markov decision processes
+Normal rank
@@ Property / cites work @@
+PAC Bounds for Discounted MDPs
@@ Property / cites work: PAC Bounds for Discounted MDPs / rank @@
+Normal rank
@@ Property / cites work @@
+Q2880979
@@ Property / cites work: Q2880979 / rank @@
+Normal rank
@@ Property / cites work @@
+Solving H-horizon, stationary Markov decision problems in time proportional to log (H)
+Normal rank
@@ Property / cites work @@
+A New Complexity Result on Solving the Markov Decision Problem
+Normal rank
@@ Property / cites work @@
+The Simplex and Policy-Iteration Methods Are Strongly Polynomial for the Markov Decision Problem with a Fixed Discount Rate
+Normal rank