Theoretical analysis of Adam using hyperparameters close to one without Lipschitz smoothness (Q6145578): Difference between revisions

From MaRDI portal
ReferenceBot (talk | contribs)
Changed an Item
Normalize DOI.
 
Property / DOI
 
Property / DOI: 10.1007/s11075-023-01575-0 / rank
Normal rank
 
Property / DOI
 
Property / DOI: 10.1007/S11075-023-01575-0 / rank
 
Normal rank

Latest revision as of 19:53, 30 December 2024

scientific article; zbMATH DE number 7785652
Language Label Description Also known as
English
Theoretical analysis of Adam using hyperparameters close to one without Lipschitz smoothness
scientific article; zbMATH DE number 7785652

    Statements

    Theoretical analysis of Adam using hyperparameters close to one without Lipschitz smoothness (English)
    0 references
    0 references
    9 January 2024
    0 references
    Convergence rate analyses of Adaptive Moment Estimation are widely used in nonconvex optimsation. This paper analyses this method and shows that the method performs well for small learning rates and hyperparameters close to 1. This has implications for nonconvex optimisation for deep learning applications.
    0 references
    0 references
    adaptive moment estimation
    0 references
    batch size
    0 references
    hyperparameters
    0 references
    learning rate
    0 references
    nonconvex optimization
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references