Towards understanding asynchronous advantage actor-critic: convergence and linear speedup

From MaRDI portal
Publication:6603629