Stronger convergence results for deep residual networks: network width scales linearly with training data size

From MaRDI portal
Publication:5095259