A stochastic approximation method with max-norm projections and its applications to the Q-learning algorithm
From MaRDI portal
Publication:4635151
Recommendations
- Exploiting the structural properties of the underlying Markov decision problem in the Q-learning algorithm
- Asynchronous stochastic approximation and Q-learning
- Q-learning algorithms with random truncation bounds and applications to effective parallel computing
- Stochastic approximation: Theory and applications
- Q-Learning with Linear Function Approximation
Cited in
(2)
This page was built for publication: A stochastic approximation method with max-norm projections and its applications to the Q-learning algorithm
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4635151)