Towards understanding asynchronous advantage actor-critic: convergence and linear speedup
From MaRDI portal
Publication:6603629
DOI10.1109/TSP.2023.3268475zbMATH Open1548.68226MaRDI QIDQ6603629FDOQ6603629
Authors: Han Shen, Kaiqing Zhang, Mingyi Hong, Tianyi Chen
Publication date: 12 September 2024
Published in: IEEE Transactions on Signal Processing (Search for Journal in Brave)
Numerical optimization and variational techniques (65K10) Artificial neural networks and deep learning (68T07)
This page was built for publication: Towards understanding asynchronous advantage actor-critic: convergence and linear speedup
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6603629)