Simple and Optimal Methods for Stochastic Variational Inequalities, II: Markovian Noise and Policy Evaluation in Reinforcement Learning (Q5081106): Difference between revisions
From MaRDI portal
Revision as of 04:18, 29 July 2024
scientific article; zbMATH DE number 7535638
Language | Label | Description | Also known as |
---|---|---|---|
English | Simple and Optimal Methods for Stochastic Variational Inequalities, II: Markovian Noise and Policy Evaluation in Reinforcement Learning |
scientific article; zbMATH DE number 7535638 |
Statements
Simple and Optimal Methods for Stochastic Variational Inequalities, II: Markovian Noise and Policy Evaluation in Reinforcement Learning (English)
0 references
1 June 2022
0 references
variational inequality
0 references
operator extrapolation
0 references
acceleration
0 references
reinforcement learning
0 references
temporal difference learning
0 references
stochastic policy evaluation
0 references
0 references