A stochastic approximation method with max-norm projections and its applications to the Q-learning algorithm
From MaRDI portal
Publication:4635151
DOI10.1145/1842713.1842715zbMATH Open1490.62209OpenAlexW2029593259MaRDI QIDQ4635151FDOQ4635151
Sumit Kunnumkal, Huseyin Topaloglu
Publication date: 16 April 2018
Published in: ACM Transactions on Modeling and Computer Simulation (Search for Journal in Brave)
Full work available at URL: http://eprints.exchange.isb.edu/57/1/sup_projection.pdf
This page was built for publication: A stochastic approximation method with max-norm projections and its applications to the Q-learning algorithm
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4635151)