A stochastic approximation method with max-norm projections and its applications to the Q-learning algorithm

From MaRDI portal

Publication:4635151

Jump to:navigation, search

DOI10.1145/1842713.1842715MaRDI QIDQ4635151zbMATH OpenOpenAlexFDO

Authors Sumit Kunnumkal, Huseyin Topaloglu

Publication date 16 April 2018

Published in ACM Transactions on Modeling and Computer Simulation (Search for Journal in Brave)

Full work available at URL http://eprints.exchange.isb.edu/57/1/sup_projection.pdf

zbMATH Keywords

stochastic approximation Q-learning max-norm projection

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Stochastic approximation (62L20)

Recommendations

Cited in

(2)

This page was built for publication: A stochastic approximation method with max-norm projections and its applications to the Q-learning algorithm

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4635151)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:4635151&oldid=18829359"