Successive convex approximation based off-policy optimization for constrained reinforcement learning
From MaRDI portal
Publication:6602807
DOI10.1109/TSP.2022.3158737zbMATH Open1548.90484MaRDI QIDQ6602807FDOQ6602807
Wu Luo, Guan Huang, An Liu, Chang Tian
Publication date: 12 September 2024
Published in: IEEE Transactions on Signal Processing (Search for Journal in Brave)
This page was built for publication: Successive convex approximation based off-policy optimization for constrained reinforcement learning
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6602807)