Magnitude and angle dynamics in training single ReLU neurons
From MaRDI portal
Publication:6587015
Recommendations
- The Computational Complexity of ReLU Network Training Parameterized by Data Dimensionality
- Non-convergence of stochastic gradient descent in the training of deep neural networks
- Improved weight initialization for deep and narrow feedforward neural network
- Training neural networks from an ergodic perspective
- On the Relation Between Loss Functions and T-Norms
Cites work
Cited in
(3)
This page was built for publication: Magnitude and angle dynamics in training single ReLU neurons
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6587015)