Randomized Linear Programming Solves the Markov Decision Problem in Nearly Linear (Sometimes Sublinear) Time (Q5119845): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Changed an Item
ReferenceBot (talk | contribs)
Changed an Item
 
Property / cites work
 
Property / cites work: Minimax PAC bounds on the sample complexity of reinforcement learning with a generative model / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3241581 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4368722 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3189557 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Efficient Sampling Methods for Discrete Distributions / rank
 
Normal rank
Property / cites work
 
Property / cites work: Sublinear optimization for machine learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: The Linear Programming Approach to Approximate Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3270185 / rank
 
Normal rank
Property / cites work
 
Property / cites work: The value iteration algorithm is not strongly polynomial for discounted dynamic programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Solving variational inequalities with Stochastic Mirror-Prox algorithm / rank
 
Normal rank
Property / cites work
 
Property / cites work: Exponentiated gradient versus gradient descent for linear predictors / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4421713 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Robust Stochastic Approximation Approach to Stochastic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Variance Reduced Value Iteration and Faster Algorithms for Solving Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Solving H-horizon, stationary Markov decision problems in time proportional to log (H) / rank
 
Normal rank
Property / cites work
 
Property / cites work: An Efficient Method for Weighted Sampling without Replacement / rank
 
Normal rank
Property / cites work
 
Property / cites work: A New Complexity Result on Solving the Markov Decision Problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: The Simplex and Policy-Iteration Methods Are Strongly Polynomial for the Markov Decision Problem with a Fixed Discount Rate / rank
 
Normal rank

Latest revision as of 09:54, 23 July 2024

scientific article; zbMATH DE number 7242693
Language Label Description Also known as
English
Randomized Linear Programming Solves the Markov Decision Problem in Nearly Linear (Sometimes Sublinear) Time
scientific article; zbMATH DE number 7242693

    Statements

    Randomized Linear Programming Solves the Markov Decision Problem in Nearly Linear (Sometimes Sublinear) Time (English)
    0 references
    0 references
    1 September 2020
    0 references
    Markov decision process
    0 references
    randomized algorithm
    0 references
    linear programming
    0 references
    duality
    0 references
    primal-dual method
    0 references
    runtime complexity
    0 references
    stochastic approximation
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references