A stochastic approximation method with max-norm projections and its applications to the Q-learning algorithm (Q4635151)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: A stochastic approximation method with max-norm projections and its applications to the Q-learning algorithm |
scientific article; zbMATH DE number 6859937
| Language | Label | Description | Also known as |
|---|---|---|---|
| default for all languages | No label defined |
||
| English | A stochastic approximation method with max-norm projections and its applications to the Q-learning algorithm |
scientific article; zbMATH DE number 6859937 |
Statements
A stochastic approximation method with max-norm projections and its applications to the Q-learning algorithm (English)
0 references
16 April 2018
0 references
Q-learning
0 references
stochastic approximation
0 references
max-norm projection
0 references