A Two-Timescale Stochastic Algorithm Framework for Bilevel Optimization: Complexity Analysis and Application to Actor-Critic (Q5883319): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
ReferenceBot (talk | contribs)
Changed an Item
 
(2 intermediate revisions by 2 users not shown)
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4999029 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path / rank
 
Normal rank
Property / cites work
 
Property / cites work: First-Order Methods in Optimization / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic approximation with two time scales / rank
 
Normal rank
Property / cites work
 
Property / cites work: Technical Note—The Equivalence of Two Mathematical Programs with Optimization Problems in the Constraints / rank
 
Normal rank
Property / cites work
 
Property / cites work: Mathematical Programs with Optimization Problems in the Constraints / rank
 
Normal rank
Property / cites work
 
Property / cites work: An overview of bilevel optimization / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2934010 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Nonlinear Two-Time-Scale Stochastic Approximation: Convergence and Finite-Time Performance / rank
 
Normal rank
Property / cites work
 
Property / cites work: On bilevel programming. I: General nonlinear cases / rank
 
Normal rank
Property / cites work
 
Property / cites work: Matrix concentration for products / rank
 
Normal rank
Property / cites work
 
Property / cites work: Double penalty method for bilevel optimization problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Two Time-Scale Stochastic Approximation with Controlled Markov Noise and Off-Policy Temporal-Difference Learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Convergence rate of linear two-time-scale stochastic approximation. / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4558791 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Mathematical Programs with Equilibrium Constraints / rank
 
Normal rank
Property / cites work
 
Property / cites work: Convergence rate and averaging of nonlinear two-time-scale stochastic approximation algo\-rithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3096132 / rank
 
Normal rank
Property / cites work
 
Property / cites work: A First Order Method for Solving Convex Bilevel Optimization Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4626283 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Algorithms for Reinforcement Learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Descent approaches for quadratic bilevel programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic compositional gradient descent: algorithms for minimizing compositions of expected-value functions / rank
 
Normal rank
Property / cites work
 
Property / cites work: A penalty function approach for solving bi-level linear programs / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Latest revision as of 19:24, 31 July 2024

scientific article; zbMATH DE number 7669687
Language Label Description Also known as
English
A Two-Timescale Stochastic Algorithm Framework for Bilevel Optimization: Complexity Analysis and Application to Actor-Critic
scientific article; zbMATH DE number 7669687

    Statements

    A Two-Timescale Stochastic Algorithm Framework for Bilevel Optimization: Complexity Analysis and Application to Actor-Critic (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    30 March 2023
    0 references
    bilevel optimization
    0 references
    two-timescale stochastic approximation
    0 references
    actor-critic
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references