On Generalized Bellman Equations and Temporal-Difference Learning (Q3305109)
From MaRDI portal
scientific article; zbMATH DE number 6982339
Language | Label | Description | Also known as |
---|---|---|---|
English | On Generalized Bellman Equations and Temporal-Difference Learning |
scientific article; zbMATH DE number 6982339 |
Statements
On Generalized Bellman Equations and Temporal-Difference Learning (English)
0 references
5 August 2020
0 references
21 November 2018
0 references
Markov decision process
0 references
policy evaluation
0 references
generalized Bellman equation
0 references
temporal differences
0 references
Markov chain
0 references
randomized stopping time
0 references
approximate policy evaluation
0 references
reinforcement learning
0 references
temporal-difference method
0 references
cs.LG
0 references
math.OC
0 references
0 references
0 references
0 references