Optimization for deep learning: an overview (Q2218095): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
ReferenceBot (talk | contribs)
Changed an Item
 
(20 intermediate revisions by 4 users not shown)
Property / describes a project that uses
 
Property / describes a project that uses: ADADELTA / rank
 
Normal rank
Property / describes a project that uses
 
Property / describes a project that uses: EfficientNet / rank
 
Normal rank
Property / describes a project that uses
 
Property / describes a project that uses: Entropy-SGD / rank
 
Normal rank
Property / describes a project that uses
 
Property / describes a project that uses: darch / rank
 
Normal rank
Property / describes a project that uses
 
Property / describes a project that uses: AdaGrad / rank
 
Normal rank
Property / describes a project that uses
 
Property / describes a project that uses: Tensor2Tensor / rank
 
Normal rank
Property / describes a project that uses
 
Property / describes a project that uses: Saga / rank
 
Normal rank
Property / describes a project that uses
 
Property / describes a project that uses: neural-tangents / rank
 
Normal rank
Property / describes a project that uses
 
Property / describes a project that uses: word2vec / rank
 
Normal rank
Property / describes a project that uses
 
Property / describes a project that uses: AlexNet / rank
 
Normal rank
Property / describes a project that uses
 
Property / describes a project that uses: SGD-QN / rank
 
Normal rank
Property / describes a project that uses
 
Property / describes a project that uses: ImageNet / rank
 
Normal rank
Property / describes a project that uses
 
Property / describes a project that uses: RMSprop / rank
 
Normal rank
Property / describes a project that uses
 
Property / describes a project that uses: Adam / rank
 
Normal rank
Property / describes a project that uses
 
Property / describes a project that uses: GloVe / rank
 
Normal rank
Property / describes a project that uses
 
Property / describes a project that uses: BERT / rank
 
Normal rank
Property / describes a project that uses
 
Property / describes a project that uses: SGDR / rank
 
Normal rank
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1007/s40305-020-00309-6 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W3034315405 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimization Methods for Large-Scale Machine Learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5270493 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Generalization Error in Deep Learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Reducing the Dimensionality of Data with Neural Networks / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2896045 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Restart procedures for the conjugate gradient method / rank
 
Normal rank
Property / cites work
 
Property / cites work: Adaptive restart for accelerated gradient schemes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4762624 / rank
 
Normal rank
Property / cites work
 
Property / cites work: First-order methods of smooth convex optimization with inexact oracle / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4558559 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4558562 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Accelerated Methods for NonConvex Optimization / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5396673 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4001523 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3626659 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Two-Point Step Size Gradient Methods / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2880947 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Fast Curvature Matrix-Vector Products for Second-Order Gradient Descent / rank
 
Normal rank
Property / cites work
 
Property / cites work: A sensitive-eigenvector based global algorithm for quadratically constrained quadratic programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Nonconvex Optimization Meets Low-Rank Matrix Factorization: An Overview / rank
 
Normal rank
Property / cites work
 
Property / cites work: Flat Minima / rank
 
Normal rank
Property / cites work
 
Property / cites work: Reconciling modern machine-learning practice and the classical bias–variance trade-off / rank
 
Normal rank
Property / cites work
 
Property / cites work: Effect of Depth and Width on Local Minima in Deep Learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Spurious Valleys in Two-layer Neural Network Optimization Landscapes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Gradient descent optimizes over-parameterized deep ReLU networks / rank
 
Normal rank
Property / cites work
 
Property / cites work: Mean field analysis of neural networks: a central limit theorem / rank
 
Normal rank
Property / cites work
 
Property / cites work: A mean field view of the landscape of two-layer neural networks / rank
 
Normal rank
Property / cites work
 
Property / cites work: Local minima and convergence in low-rank semidefinite programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Theoretical Insights Into the Optimization Landscape of Over-Parameterized Shallow Neural Networks / rank
 
Normal rank
Property / cites work
 
Property / cites work: Learning ReLU Networks on Linearly Separable Data: Algorithm, Optimality, and Generalization / rank
 
Normal rank
Property / cites work
 
Property / cites work: Randomized Methods for Linear Constraints: Convergence Rates and Conditioning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Efficiency of Coordinate Descent Methods on Huge-Scale Optimization Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Numerical Optimization / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Latest revision as of 08:14, 24 July 2024

scientific article
Language Label Description Also known as
English
Optimization for deep learning: an overview
scientific article

    Statements

    Optimization for deep learning: an overview (English)
    0 references
    0 references
    12 January 2021
    0 references
    deep learning
    0 references
    non-convex optimization
    0 references
    neural networks
    0 references
    convergence
    0 references
    landscape
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references

    Identifiers