Multi-timescale ensemble Q-learning for Markov decision process policy optimization
From MaRDI portal
Publication:6605580
DOI10.1109/tsp.2024.3372699zbMATH Open1548.94097MaRDI QIDQ6605580FDOQ6605580
Authors: Talha Bozkus, Urbashi Mitra
Publication date: 16 September 2024
Published in: IEEE Transactions on Signal Processing (Search for Journal in Brave)
This page was built for publication: Multi-timescale ensemble \(Q\)-learning for Markov decision process policy optimization
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6605580)