Proximal algorithms and temporal difference methods for solving fixed point problems (Q721950): Difference between revisions

From MaRDI portal
Set OpenAlex properties.
ReferenceBot (talk | contribs)
Changed an Item
Property / cites work
 
Property / cites work: Approximate dynamic programming with a fuzzy parameterization / rank
 
Normal rank
Property / cites work
 
Property / cites work: Near-Optimal Column-Based Matrix Reconstruction / rank
 
Normal rank
Property / cites work
 
Property / cites work: Convex analysis and monotone operator theory in Hilbert spaces / rank
 
Normal rank
Property / cites work
 
Property / cites work: Error Bounds for Approximations from Projected Linear Equations / rank
 
Normal rank
Property / cites work
 
Property / cites work: An Analysis of Stochastic Shortest Path Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4257216 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3079664 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Projected equation methods for approximate solution of large linear systems / rank
 
Normal rank
Property / cites work
 
Property / cites work: An analysis of temporal-difference learning with function approximation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q-Learning and Enhanced Policy Iteration in Discounted Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the method of multipliers for convex programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3690580 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Temporal Difference Methods for General Projected Equations / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximate policy iteration: a survey and some new methods / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2925454 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3452586 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3189557 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Technical update: Least-squares temporal difference learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5477859 / rank
 
Normal rank
Property / cites work
 
Property / cites work: A note on the behavior of the randomized Kaczmarz algorithm of Strohmer and Vershynin / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3243484 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Fast Monte Carlo Algorithms for Matrices I: Approximating Matrix Multiplication / rank
 
Normal rank
Property / cites work
 
Property / cites work: Fast Monte Carlo Algorithms for Matrices II: Computing a Low-Rank Approximation to a Matrix / rank
 
Normal rank
Property / cites work
 
Property / cites work: Sampling algorithms for <i>l</i><sub>2</sub> regression and applications / rank
 
Normal rank
Property / cites work
 
Property / cites work: Relative-Error $CUR$ Matrix Decompositions / rank
 
Normal rank
Property / cites work
 
Property / cites work: Faster least squares approximation / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the Douglas-Rachford splitting method and the proximal point algorithm for maximal monotone operators / rank
 
Normal rank
Property / cites work
 
Property / cites work: Finite-Dimensional Variational Inequalities and Complementarity Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3316508 / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Retrospective and Prospective Survey of the Monte Carlo Method / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5638711 / rank
 
Normal rank
Property / cites work
 
Property / cites work: 10.1162/1532443041827907 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Randomized Methods for Linear Constraints: Convergence Rates and Conditioning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Splitting Algorithms for the Sum of Two Nonlinear Operators / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5618030 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Least squares policy evaluation algorithms with linear function approximation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximate Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Monotone Operators and the Proximal Point Algorithm / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5744816 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5406031 / rank
 
Normal rank
Property / cites work
 
Property / cites work: A randomized Kaczmarz algorithm with exponential convergence / rank
 
Normal rank
Property / cites work
 
Property / cites work: Algorithms for Reinforcement Learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Applications of a Splitting Algorithm to Decomposition in Convex Programming and Variational Inequalities / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimal Adaptive Control and Differential Games by Reinforcement Learning Principles / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stabilization of Stochastic Iterative Methods for Singular and Nearly Singular Linear Systems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Incremental constraint projection methods for variational inequalities / rank
 
Normal rank
Property / cites work
 
Property / cites work: Convergence Results for Some Temporal Difference Methods Based on Least Squares / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q-learning and policy iteration algorithms for stochastic shortest path problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Least Squares Temporal Difference Methods: An Analysis under General Conditions / rank
 
Normal rank

Revision as of 04:02, 16 July 2024

scientific article
Language Label Description Also known as
English
Proximal algorithms and temporal difference methods for solving fixed point problems
scientific article

    Statements

    Proximal algorithms and temporal difference methods for solving fixed point problems (English)
    0 references
    20 July 2018
    0 references
    proximal algorithm
    0 references
    temporal differences
    0 references
    dynamic programming
    0 references
    convex optimization
    0 references
    fixed point problems
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references

    Identifiers