Multiscale Q-learning with linear function approximation

From MaRDI portal

Publication:312650

Jump to:navigation, search

DOI10.1007/s10626-015-0216-zzbMath1346.93265OpenAlexW2194349390MaRDI QIDQ312650

Mohammad Hasan, M. Dambrine

Publication date: 16 September 2016

Published in: Discrete Event Dynamic Systems (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1007/s10626-015-0216-z

zbMATH Keywords

differential inclusion stochastic approximation ordinary differential equation reinforcement learning multi-stage stochastic shortest path problem Q-learning with linear function approximation

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Time-scale analysis and singular perturbations in control/observation systems (93C70) Stochastic systems in control theory (general) (93E03)

Cites Work

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:312650&oldid=12191175"