Variance Reduced Value Iteration and Faster Algorithms for Solving Markov Decision Processes (Q4607932): Difference between revisions
From MaRDI portal
Set profile property. |
ReferenceBot (talk | contribs) Changed an Item |
||||||||||||||
(3 intermediate revisions by one other user not shown) | |||||||||||||||
label / en | label / en | ||||||||||||||
Variance Reduced Value Iteration and Faster Algorithms for Solving Markov Decision Processes | |||||||||||||||
Property / publication date | |||||||||||||||
25 October 2023
| |||||||||||||||
Property / publication date: 25 October 2023 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / Mathematics Subject Classification ID | |||||||||||||||
Property / Mathematics Subject Classification ID: 90C59 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / zbMATH DE Number | |||||||||||||||
Property / zbMATH DE Number: 7754567 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / arXiv classification | |||||||||||||||
cs.DS | |||||||||||||||
Property / arXiv classification: cs.DS / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / arXiv classification | |||||||||||||||
cs.LG | |||||||||||||||
Property / arXiv classification: cs.LG / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / arXiv classification | |||||||||||||||
math.OC | |||||||||||||||
Property / arXiv classification: math.OC / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / arXiv ID | |||||||||||||||
Property / arXiv ID: 1710.09988 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / title | |||||||||||||||
Variance reduced value iteration and faster algorithms for solving Markov decision processes (English) | |||||||||||||||
Property / title: Variance reduced value iteration and faster algorithms for solving Markov decision processes (English) / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / DOI | |||||||||||||||
Property / DOI: 10.1002/nav.21992 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / author | |||||||||||||||
Property / author: Aaron Sidford / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / author | |||||||||||||||
Property / author: Q6075461 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / author | |||||||||||||||
Property / author: Q6079110 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / author | |||||||||||||||
Property / author: Yinyu Ye / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / published in | |||||||||||||||
Property / published in: Naval Research Logistics (NRL) / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / zbMATH Keywords | |||||||||||||||
linear programming algorithm | |||||||||||||||
Property / zbMATH Keywords: linear programming algorithm / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / zbMATH Keywords | |||||||||||||||
Markov decision processes | |||||||||||||||
Property / zbMATH Keywords: Markov decision processes / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / zbMATH Keywords | |||||||||||||||
value iteration | |||||||||||||||
Property / zbMATH Keywords: value iteration / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / cites work | |||||||||||||||
Property / cites work: Minimax PAC bounds on the sample complexity of reinforcement learning with a generative model / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / cites work | |||||||||||||||
Property / cites work: Q3241581 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / cites work | |||||||||||||||
Property / cites work: Q4368722 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / cites work | |||||||||||||||
Property / cites work: Q3189557 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / cites work | |||||||||||||||
Property / cites work: Q3844775 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / cites work | |||||||||||||||
Property / cites work: Q3270185 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / cites work | |||||||||||||||
Property / cites work: The value iteration algorithm is not strongly polynomial for discounted dynamic programming / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / cites work | |||||||||||||||
Property / cites work: Strategy Iteration Is Strongly Polynomial for 2-Player Turn-Based Stochastic Games with a Constant Discount Factor / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / cites work | |||||||||||||||
Property / cites work: Probability Inequalities for Sums of Bounded Random Variables / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / cites work | |||||||||||||||
Property / cites work: Q3266141 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / cites work | |||||||||||||||
Property / cites work: A sparse sampling algorithm for near-optimal planning in large Markov decision processes / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / cites work | |||||||||||||||
Property / cites work: PAC Bounds for Discounted MDPs / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / cites work | |||||||||||||||
Property / cites work: Q2880979 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / cites work | |||||||||||||||
Property / cites work: Solving H-horizon, stationary Markov decision problems in time proportional to log (H) / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / cites work | |||||||||||||||
Property / cites work: A New Complexity Result on Solving the Markov Decision Problem / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / cites work | |||||||||||||||
Property / cites work: The Simplex and Policy-Iteration Methods Are Strongly Polynomial for the Markov Decision Problem with a Fixed Discount Rate / rank | |||||||||||||||
Normal rank |
Latest revision as of 07:08, 3 August 2024
scientific article; zbMATH DE number 6850359
Language | Label | Description | Also known as |
---|---|---|---|
English | Variance Reduced Value Iteration and Faster Algorithms for Solving Markov Decision Processes |
scientific article; zbMATH DE number 6850359 |
Statements
15 March 2018
0 references
25 October 2023
0 references
cs.DS
0 references
cs.LG
0 references
math.OC
0 references
Variance reduced value iteration and faster algorithms for solving Markov decision processes (English)
0 references
linear programming algorithm
0 references
Markov decision processes
0 references
value iteration
0 references
0 references
0 references
0 references
0 references