A stochastic approximation method with max-norm projections and its applications to the Q-learning algorithm

From MaRDI portal
Publication:4635151