Batch mode reinforcement learning based on the synthesis of artificial trajectories

From MaRDI portal
Publication:378762