Adaptive Gradient Methods Converge Faster with Over-Parameterization (but you should do a line-search) (Q6342704)

From MaRDI portal
Revision as of 08:54, 10 July 2024 by Import240710060729 (talk | contribs) (Added link to MaRDI item.)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
preprint article from arXiv
Language Label Description Also known as
English
Adaptive Gradient Methods Converge Faster with Over-Parameterization (but you should do a line-search)
preprint article from arXiv

    Statements

    11 June 2020
    0 references
    cs.LG
    0 references
    math.OC
    0 references
    stat.ML
    0 references
    Sharan Vaswani
    0 references
    Issam Laradji
    0 references
    Frederik Kunstner
    0 references
    Si Yi Meng
    0 references
    Mark Schmidt
    0 references
    Simon Lacoste-Julien
    0 references

    Identifiers

    0 references