Approximate Newton Policy Gradient Algorithms (Q6074547)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Approximate Newton Policy Gradient Algorithms |
scientific article; zbMATH DE number 7749379
| Language | Label | Description | Also known as |
|---|---|---|---|
| default for all languages | No label defined |
||
| English | Approximate Newton Policy Gradient Algorithms |
scientific article; zbMATH DE number 7749379 |
Statements
Approximate Newton Policy Gradient Algorithms (English)
0 references
12 October 2023
0 references
policy gradient algorithm
0 references
approximate Newton method
0 references
quadratic convergence
0 references
Markov decision process
0 references
entropy regularization
0 references
reinforcement learning
0 references
0 references
0 references
0 references
0.835472047328949
0 references
0.830440104007721
0 references
0.774370551109314
0 references
0.74680495262146
0 references
0.7461656332015991
0 references