Two Time-Scale Stochastic Approximation with Controlled Markov Noise and Off-Policy Temporal-Difference Learning (Q5219302): Difference between revisions
From MaRDI portal
ReferenceBot (talk | contribs) Changed an Item |
Normalize DOI. |
||
Property / DOI | |||
Property / DOI: 10.1287/moor.2017.0855 / rank | |||
Property / DOI | |||
Property / DOI: 10.1287/MOOR.2017.0855 / rank | |||
Normal rank |
Latest revision as of 16:25, 30 December 2024
scientific article; zbMATH DE number 7179328
Language | Label | Description | Also known as |
---|---|---|---|
English | Two Time-Scale Stochastic Approximation with Controlled Markov Noise and Off-Policy Temporal-Difference Learning |
scientific article; zbMATH DE number 7179328 |
Statements
Two Time-Scale Stochastic Approximation with Controlled Markov Noise and Off-Policy Temporal-Difference Learning (English)
0 references
11 March 2020
0 references
Markov noise
0 references
two time-scale stochastic approximation
0 references
asymptotic convergence
0 references
temporal-difference learning
0 references
0 references