Convergence of entropy-regularized natural policy gradient with linear function approximation

From MaRDI portal
Publication:6587339