Linear Convergence of a Policy Gradient Method for Some Finite Horizon Continuous Time Control Problems (Q6140987): Difference between revisions

From MaRDI portal
Set OpenAlex properties.
Normalize DOI.
 
(One intermediate revision by one other user not shown)
Property / DOI
 
Property / DOI: 10.1137/22m1492180 / rank
Normal rank
 
Property / cites work
 
Property / cites work: Convexity and Optimization in Banach Spaces / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the convergence rate of approximation schemes for Hamilton-Jacobi-Bellman Equations / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Numerical Scheme for a Mean Field Game in Some Queueing Systems Based on Markov Chain Approximation Method / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Time discretization and Markovian iteration for coupled FBSDEs / rank
 
Normal rank
Property / cites work
 
Property / cites work: Mean Field Games and Mean Field Type Control Theory / rank
 
Normal rank
Property / cites work
 
Property / cites work: BSDEs with polynomial growth generators / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2807034 / rank
 
Normal rank
Property / cites work
 
Property / cites work: A regression-based Monte Carlo method to solve backward stochastic differential equations / rank
 
Normal rank
Property / cites work
 
Property / cites work: Policy Gradient Methods for the Noisy Linear Quadratic Regulator over a Finite Horizon / rank
 
Normal rank
Property / cites work
 
Property / cites work: Solving high-dimensional partial differential equations using deep learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Deep backward schemes for high-dimensional nonlinear PDEs / rank
 
Normal rank
Property / cites work
 
Property / cites work: Nonlinear control systems: An introduction / rank
 
Normal rank
Property / cites work
 
Property / cites work: A neural network-based policy iteration algorithm with global \(H^2\)-superlinear convergence for stochastic games on domains / rank
 
Normal rank
Property / cites work
 
Property / cites work: Exponential Convergence and Stability of Howard's Policy Improvement Algorithm for Controlled Diffusions / rank
 
Normal rank
Property / cites work
 
Property / cites work: A modified MSA for stochastic control problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2703816 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4558489 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Time discretization of FBSDE with polynomial growth drivers and reaction-diffusion PDEs / rank
 
Normal rank
Property / cites work
 
Property / cites work: Sufficient stochastic maximum principle for discounted control problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3093369 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Introductory lectures on convex optimization. A basic course. / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4379369 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Continuous-time stochastic control and optimization with financial applications / rank
 
Normal rank
Property / cites work
 
Property / cites work: Variational Analysis / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4626283 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Backward Stochastic Differential Equations / rank
 
Normal rank
Property / DOI
 
Property / DOI: 10.1137/22M1492180 / rank
 
Normal rank

Latest revision as of 18:49, 30 December 2024

scientific article; zbMATH DE number 7782638
Language Label Description Also known as
English
Linear Convergence of a Policy Gradient Method for Some Finite Horizon Continuous Time Control Problems
scientific article; zbMATH DE number 7782638

    Statements

    Linear Convergence of a Policy Gradient Method for Some Finite Horizon Continuous Time Control Problems (English)
    0 references
    0 references
    0 references
    0 references
    2 January 2024
    0 references
    reinforcement learning
    0 references
    policy gradient method
    0 references
    stochastic control
    0 references
    linear convergence
    0 references
    stationary point
    0 references
    backward stochastic differential equation
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references