Policy Gradient for Continuing Tasks in Discounted Markov Decision Processes (Q6075992)
From MaRDI portal
scientific article; zbMATH DE number 7740953
Language | Label | Description | Also known as |
---|---|---|---|
English | Policy Gradient for Continuing Tasks in Discounted Markov Decision Processes |
scientific article; zbMATH DE number 7740953 |
Statements
Policy Gradient for Continuing Tasks in Discounted Markov Decision Processes (English)
0 references
21 September 2023
0 references
adaptive systems
0 references
gradient methods
0 references
reinforcement learning
0 references
stochastic systems
0 references