A stochastic trust-region framework for policy optimization

From MaRDI portal
Publication:5096136

DOI10.4208/JCM.2104-M2021-0007OpenAlexW2990109857MaRDI QIDQ5096136FDOQ5096136


Authors: Mingming Zhao, Yongfeng Li, Zaiwen Wen Edit this on Wikidata


Publication date: 15 August 2022

Published in: Journal of Computational Mathematics (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1911.11640




Recommendations




Cites Work


Cited In (2)

Uses Software





This page was built for publication: A stochastic trust-region framework for policy optimization

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5096136)