A Discrete-Time Switching System Analysis of Q-Learning

From MaRDI portal

Publication:6107867

Jump to:navigation, search

DOI10.1137/22m1489976arXiv2102.08583MaRDI QIDQ6107867

Niao He, Dong Hwan Lee, Jianghai Hu

Publication date: 28 June 2023

Published in: SIAM Journal on Control and Optimization (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/2102.08583

zbMATH Keywords

stochastic approximation Q-learning switched linear system

Mathematics Subject Classification ID

Analysis of algorithms and problem complexity (68Q25) Graph theory (including graph drawing) in computer science (68R10) Computer graphics; computational geometry (digital and algorithmic aspects) (68U05)

Cites Work

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:6107867&oldid=35557421"