Simple and Optimal Methods for Stochastic Variational Inequalities, II: Markovian Noise and Policy Evaluation in Reinforcement Learning (Q5081106): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Changed an Item
ReferenceBot (talk | contribs)
Changed an Item
Property / cites work
 
Property / cites work: Q3997575 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stable Optimal Control and Semicontractive Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic optimal control. The discrete time case / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2934010 / rank
 
Normal rank
Property / cites work
 
Property / cites work: The minimax learning rates of normal and Ising undirected graphical models / rank
 
Normal rank
Property / cites work
 
Property / cites work: Ergodic Mirror Descent / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimal Stochastic Approximation Algorithms for Strongly Convex Stochastic Composite Optimization I: A Generic Algorithmic Framework / rank
 
Normal rank
Property / cites work
 
Property / cites work: Mixing time estimation in reversible Markov chains from a single sample path / rank
 
Normal rank
Property / cites work
 
Property / cites work: Statistical Inference via Convex Optimization / rank
 
Normal rank
Property / cites work
 
Property / cites work: Deterministic and stochastic primal-dual subgradient algorithms for uniformly convex minimization / rank
 
Normal rank
Property / cites work
 
Property / cites work: OnActor-Critic Algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4421713 / rank
 
Normal rank
Property / cites work
 
Property / cites work: 10.1162/1532443041827907 / rank
 
Normal rank
Property / cites work
 
Property / cites work: First-order and stochastic optimization methods for machine learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4595047 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Markov Chains and Stochastic Stability / rank
 
Normal rank
Property / cites work
 
Property / cites work: Information-based complexity of linear operator equations / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4315289 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Convergence Rates for Markov Chains / rank
 
Normal rank
Property / cites work
 
Property / cites work: Introduction to Stochastic Search and Optimization / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4626283 / rank
 
Normal rank
Property / cites work
 
Property / cites work: An analysis of temporal-difference learning with function approximation / rank
 
Normal rank

Revision as of 04:18, 29 July 2024

scientific article; zbMATH DE number 7535638
Language Label Description Also known as
English
Simple and Optimal Methods for Stochastic Variational Inequalities, II: Markovian Noise and Policy Evaluation in Reinforcement Learning
scientific article; zbMATH DE number 7535638

    Statements

    Simple and Optimal Methods for Stochastic Variational Inequalities, II: Markovian Noise and Policy Evaluation in Reinforcement Learning (English)
    0 references
    0 references
    0 references
    0 references
    1 June 2022
    0 references
    variational inequality
    0 references
    operator extrapolation
    0 references
    acceleration
    0 references
    reinforcement learning
    0 references
    temporal difference learning
    0 references
    stochastic policy evaluation
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references