Variance Reduced Value Iteration and Faster Algorithms for Solving Markov Decision Processes (Q4607932): Difference between revisions

From MaRDI portal
Import240304020342 (talk | contribs)
Set profile property.
ReferenceBot (talk | contribs)
Changed an Item
 
(3 intermediate revisions by one other user not shown)
label / enlabel / en
 
Variance Reduced Value Iteration and Faster Algorithms for Solving Markov Decision Processes
Property / publication date
 
25 October 2023
Timestamp+2023-10-25T00:00:00Z
Timezone+00:00
CalendarGregorian
Precision1 day
Before0
After0
Property / publication date: 25 October 2023 / rank
 
Normal rank
Property / Mathematics Subject Classification ID
 
Property / Mathematics Subject Classification ID: 90C59 / rank
 
Normal rank
Property / zbMATH DE Number
 
Property / zbMATH DE Number: 7754567 / rank
 
Normal rank
Property / arXiv classification
 
cs.DS
Property / arXiv classification: cs.DS / rank
 
Normal rank
Property / arXiv classification
 
cs.LG
Property / arXiv classification: cs.LG / rank
 
Normal rank
Property / arXiv classification
 
math.OC
Property / arXiv classification: math.OC / rank
 
Normal rank
Property / arXiv ID
 
Property / arXiv ID: 1710.09988 / rank
 
Normal rank
Property / title
 
Variance reduced value iteration and faster algorithms for solving Markov decision processes (English)
Property / title: Variance reduced value iteration and faster algorithms for solving Markov decision processes (English) / rank
 
Normal rank
Property / DOI
 
Property / DOI: 10.1002/nav.21992 / rank
 
Normal rank
Property / author
 
Property / author: Aaron Sidford / rank
 
Normal rank
Property / author
 
Property / author: Q6075461 / rank
 
Normal rank
Property / author
 
Property / author: Q6079110 / rank
 
Normal rank
Property / author
 
Property / author: Yinyu Ye / rank
 
Normal rank
Property / published in
 
Property / published in: Naval Research Logistics (NRL) / rank
 
Normal rank
Property / zbMATH Keywords
 
linear programming algorithm
Property / zbMATH Keywords: linear programming algorithm / rank
 
Normal rank
Property / zbMATH Keywords
 
Markov decision processes
Property / zbMATH Keywords: Markov decision processes / rank
 
Normal rank
Property / zbMATH Keywords
 
value iteration
Property / zbMATH Keywords: value iteration / rank
 
Normal rank
Property / cites work
 
Property / cites work: Minimax PAC bounds on the sample complexity of reinforcement learning with a generative model / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3241581 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4368722 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3189557 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3844775 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3270185 / rank
 
Normal rank
Property / cites work
 
Property / cites work: The value iteration algorithm is not strongly polynomial for discounted dynamic programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Strategy Iteration Is Strongly Polynomial for 2-Player Turn-Based Stochastic Games with a Constant Discount Factor / rank
 
Normal rank
Property / cites work
 
Property / cites work: Probability Inequalities for Sums of Bounded Random Variables / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3266141 / rank
 
Normal rank
Property / cites work
 
Property / cites work: A sparse sampling algorithm for near-optimal planning in large Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: PAC Bounds for Discounted MDPs / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2880979 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Solving H-horizon, stationary Markov decision problems in time proportional to log (H) / rank
 
Normal rank
Property / cites work
 
Property / cites work: A New Complexity Result on Solving the Markov Decision Problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: The Simplex and Policy-Iteration Methods Are Strongly Polynomial for the Markov Decision Problem with a Fixed Discount Rate / rank
 
Normal rank

Latest revision as of 07:08, 3 August 2024

scientific article; zbMATH DE number 6850359
Language Label Description Also known as
English
Variance Reduced Value Iteration and Faster Algorithms for Solving Markov Decision Processes
scientific article; zbMATH DE number 6850359

    Statements

    15 March 2018
    0 references
    25 October 2023
    0 references
    cs.DS
    0 references
    cs.LG
    0 references
    math.OC
    0 references
    Variance reduced value iteration and faster algorithms for solving Markov decision processes (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    linear programming algorithm
    0 references
    Markov decision processes
    0 references
    value iteration
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references