Algorithms for reinforcement learning. (Q3588852)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Algorithms for reinforcement learning. |
scientific article; zbMATH DE number 5782596
| Language | Label | Description | Also known as |
|---|---|---|---|
| default for all languages | No label defined |
||
| English | Algorithms for reinforcement learning. |
scientific article; zbMATH DE number 5782596 |
Statements
Algorithms for Reinforcement Learning (English)
0 references
10 September 2010
0 references
reinforcement learning
0 references
Markov decision processes
0 references
temporal difference learning
0 references
stochastic approximation
0 references
function approximation
0 references
stochastic gradient methods
0 references
least-squares methods
0 references
overfitting
0 references
bias-variance tradeoff
0 references
online learning
0 references
active learning
0 references
planning
0 references
simulation
0 references
PAC-learning
0 references
Q-learning
0 references
actor-critic methods
0 references
policy gradient
0 references
natural gradient
0 references
0.8399984836578369
0 references
0.8084538578987122
0 references
0.7944624423980713
0 references
0.7856948971748352
0 references