Jointly Learning Environments and Control Policies with Projected Stochastic Gradient Ascent

From MaRDI portal
Publication:5026254