Theoretical analysis of Adam using hyperparameters close to one without Lipschitz smoothness (Q6145578)

From MaRDI portal
scientific article; zbMATH DE number 7785652
Language Label Description Also known as
English
Theoretical analysis of Adam using hyperparameters close to one without Lipschitz smoothness
scientific article; zbMATH DE number 7785652

    Statements

    Theoretical analysis of Adam using hyperparameters close to one without Lipschitz smoothness (English)
    0 references
    0 references
    9 January 2024
    0 references
    Convergence rate analyses of Adaptive Moment Estimation are widely used in nonconvex optimsation. This paper analyses this method and shows that the method performs well for small learning rates and hyperparameters close to 1. This has implications for nonconvex optimisation for deep learning applications.
    0 references
    0 references
    adaptive moment estimation
    0 references
    batch size
    0 references
    hyperparameters
    0 references
    learning rate
    0 references
    nonconvex optimization
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references