GPipe
From MaRDI portal
Software:55166
swMATH39466MaRDI QIDQ55166FDOQ55166
Author name not available (Why is that?)
Cited In (8)
- Binary quantized network training with sharpness-aware minimization
- A statistician teaches deep learning
- Associated Learning: Decomposing End-to-End Backpropagation Based on Autoencoders and Target Propagation
- The Stochastic Delta Rule: Faster and More Accurate Deep Learning Through Adaptive Weight Noise
- EGC: entropy-based gradient compression for distributed deep learning
- Title not available (Why is that?)
- On the convergence analysis of asynchronous SGD for solving consistent linear systems
- Deep double descent: where bigger models and more data hurt*
This page was built for software: GPipe