A Distributed Data-Parallel PyTorch Implementation of the Distributed Shampoo Optimizer for Training Neural Networks At-Scale (Q6450892)

From MaRDI portal





preprint article from arXiv
Language Label Description Also known as
default for all languages
No label defined
    English
    A Distributed Data-Parallel PyTorch Implementation of the Distributed Shampoo Optimizer for Training Neural Networks At-Scale
    preprint article from arXiv

      Statements

      12 September 2023
      0 references
      cs.LG
      0 references
      cs.DC
      0 references
      cs.MS
      0 references
      math.OC
      0 references
      Hao-Jun Michael Shi
      0 references
      Tsung-Hsien Lee
      0 references
      Shintaro Iwasaki
      0 references
      Jose Gallego-Posada
      0 references
      Zhijing Li
      0 references
      Kaushik Rangadurai
      0 references
      Dheevatsa Mudigere
      0 references
      Michael Rabbat
      0 references

      Identifiers

      0 references