The actor-critic algorithm as multi-time-scale stochastic approximation.

From MaRDI portal
Publication:5955801