Mathematical Research Data Initiative
Main page
Recent changes
Random page
SPARQL
MaRDI@GitHub
New item
In other projects
MaRDI portal item
Discussion
View source
View history
English
Log in

A stochastic approximation method with max-norm projections and its applications to the Q-learning algorithm

From MaRDI portal
Publication:4635151
Jump to:navigation, search

DOI10.1145/1842713.1842715zbMATH Open1490.62209OpenAlexW2029593259MaRDI QIDQ4635151FDOQ4635151

Sumit Kunnumkal, Huseyin Topaloglu

Publication date: 16 April 2018

Published in: ACM Transactions on Modeling and Computer Simulation (Search for Journal in Brave)

Full work available at URL: http://eprints.exchange.isb.edu/57/1/sup_projection.pdf



zbMATH Keywords

stochastic approximationQ-learningmax-norm projection


Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Stochastic approximation (62L20)








This page was built for publication: A stochastic approximation method with max-norm projections and its applications to the Q-learning algorithm

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4635151)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:4635151&oldid=18829359"
Tools
What links here
Related changes
Printable version
Permanent link
Page information
This page was last edited on 7 February 2024, at 15:49. Warning: Page may not contain recent updates.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki