A Distributed Data-Parallel PyTorch Implementation of the Distributed Shampoo Optimizer for Training Neural Networks At-Scale

From MaRDI portal
Publication:6450892

arXiv2309.06497MaRDI QIDQ6450892FDOQ6450892

Jose Gallego-Posada, Kaushik Rangadurai, Shintaro Iwasaki, Tsung-Hsien Lee, Zhijing Li, Michael Rabbat, Dheevatsa Mudigere, Hao-Jun Michael Shi

Publication date: 12 September 2023




Has companion code repository: https://github.com/facebookresearch/optimizers/tree/main/distributed_shampoo









This page was built for publication: A Distributed Data-Parallel PyTorch Implementation of the Distributed Shampoo Optimizer for Training Neural Networks At-Scale

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6450892)