Derivative-Free Methods for Policy Optimization: Guarantees for Linear Quadratic Systems (Q4969058): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Changed label, description and/or aliases in en, and other parts
ReferenceBot (talk | contribs)
Changed an Item
 
(One intermediate revision by one other user not shown)
Property / arXiv ID
 
Property / arXiv ID: 1812.08305 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Linear Thompson sampling revisited / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3376698 / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the sample complexity of the linear quadratic regulator / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimal Rates for Zero-Order Convex Optimization: The Power of Two Function Evaluations / rank
 
Normal rank
Property / cites work
 
Property / cites work: Probability / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimality of Fast-Matching Algorithms for Random Networks With Applications to Structural Controllability / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2921693 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic First- and Zeroth-Order Methods for Nonconvex Stochastic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Bound on Tail Probabilities for Quadratic Forms in Independent Random Variables / rank
 
Normal rank
Property / cites work
 
Property / cites work: A tail inequality for quadratic forms of subgaussian random vectors / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3849137 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2810828 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5643297 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Gradient methods for solving equations and inequalities / rank
 
Normal rank
Property / cites work
 
Property / cites work: An Optimal Algorithm for Bandit and Zero-Order Convex Optimization with Two-Point Feedback / rank
 
Normal rank
Property / cites work
 
Property / cites work: Introduction to Stochastic Search and Optimization / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimization of Smooth Functions With Noisy Observations: Local Minimax Rates / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4339077 / rank
 
Normal rank

Latest revision as of 18:06, 23 July 2024

scientific article; zbMATH DE number 7255052
Language Label Description Also known as
English
Derivative-Free Methods for Policy Optimization: Guarantees for Linear Quadratic Systems
scientific article; zbMATH DE number 7255052

    Statements

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    5 October 2020
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    derivative-free optimization
    0 references
    linear quadratic control
    0 references
    non-convex optimization
    0 references
    cs.LG
    0 references
    math.OC
    0 references
    stat.ML
    0 references
    0 references