On large batch training and sharp minima: a Fokker-Planck perspective (Q828491): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Changed an Item
ReferenceBot (talk | contribs)
Changed an Item
 
Property / cites work
 
Property / cites work: Stochastic modified equations for the asynchronous stochastic gradient descent / rank
 
Normal rank
Property / cites work
 
Property / cites work: Kramers' law: Validity, derivations and generalisations / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimization Methods for Large-Scale Machine Learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Metastability in reversible diffusion processes. I: Sharp asymptotics for capacities and exit times / rank
 
Normal rank
Property / cites work
 
Property / cites work: Metastability in reversible diffusion processes. II: Precise asymptotics for small eigenvalues / rank
 
Normal rank
Property / cites work
 
Property / cites work: Deep relaxation: partial differential equations for optimizing deep neural networks / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5189317 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4279615 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4637063 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic Processes and Applications / rank
 
Normal rank
Property / cites work
 
Property / cites work: Hypocoercivity / rank
 
Normal rank

Latest revision as of 06:58, 24 July 2024

scientific article
Language Label Description Also known as
English
On large batch training and sharp minima: a Fokker-Planck perspective
scientific article

    Statements

    On large batch training and sharp minima: a Fokker-Planck perspective (English)
    0 references
    0 references
    0 references
    8 January 2021
    0 references
    large batch training
    0 references
    sharp minima
    0 references
    Fokker-Planck equation
    0 references
    stochastic gradient algorithm
    0 references
    deep neural network
    0 references
    0 references
    0 references
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references