scientific article; zbMATH DE number 6253934

John C. Duchi, Elad Hazan, Yoram Singer

Publication date: 3 February 2014

Full work available at URL: http://www.jmlr.org/papers/v12/duchi11a.html

Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.

adaptivity online learning subgradient methods stochastic convex optimization

Convex programming (90C25) Learning and adaptive systems in artificial intelligence (68T05) Stochastic programming (90C15)

Related Items (only showing first 100 items - show all)

Mini-Batch Metropolis–Hastings With Reversible SGLD Proposal ⋮ Bayesian Projected Calibration of Computer Models ⋮ Artificial-neural-network-based nonlinear algebraic models for large-eddy simulation of compressible wall-bounded turbulence ⋮ Linearly Constrained Nonsmooth Optimization for Training Autoencoders ⋮ Subgradient ellipsoid method for nonsmooth convex problems ⋮ Combining gradient optimization and machine learning methods for inverse problems in layered heterogeneous media ⋮ A stochastic gradient method for a class of nonlinear PDE-constrained optimal control problems under uncertainty ⋮ SCORE: approximating curvature information under self-concordant regularization ⋮ Block-cyclic stochastic coordinate descent for deep neural networks ⋮ Automatic, dynamic, and nearly optimal learning rate specification via local quadratic approximation ⋮ How to handle noisy labels for robust learning from uncertainty ⋮ A distributed optimisation framework combining natural gradient with Hessian-free for discriminative sequence training ⋮ Convergence analysis of AdaBound with relaxed bound functions for non-convex optimization ⋮ Stratified Cox models with time‐varying effects for national kidney transplant patients: A new blockwise steepest ascent method ⋮ Stochastic momentum methods for non-convex learning without bounded assumptions ⋮ Multivariate online regression analysis with heterogeneous streaming data ⋮ Graph deep learning model for mapping mineral prospectivity ⋮ An indefinite proximal subgradient-based algorithm for nonsmooth composite optimization ⋮ A mini-batch stochastic conjugate gradient algorithm with variance reduction ⋮ Comprehensive study of variational Bayes classification for dense deep neural networks ⋮ Three ways to solve partial differential equations with neural networks — A review ⋮ Time series analysis and prediction of nonlinear systems with ensemble learning framework applied to deep learning neural networks ⋮ Efficient learning rate adaptation based on hierarchical optimization approach ⋮ A zeroing neural dynamics based acceleration optimization approach for optimizers in deep neural networks ⋮ Multilevel Objective-Function-Free Optimization with an Application to Neural Networks Training ⋮ Successfully and efficiently training deep multi-layer perceptrons with logistic activation function simply requires initializing the weights with an appropriate negative mean ⋮ Eigenvalue-Corrected Natural Gradient Based on a New Approximation ⋮ Convergence of the RMSProp deep learning method with penalty for nonconvex optimization ⋮ A stepwise physics‐informed neural network for solving large deformation problems of hypoelastic materials ⋮ Semi-implicit back propagation ⋮ Variational inference for Bayesian bridge regression ⋮ Adaptive stochastic gradient descent for optimal control of parabolic equations with random parameters ⋮ Facial Action Units Detection to Identify Interest Emotion: An Application of Deep Learning ⋮ Parallel and distributed asynchronous adaptive stochastic gradient methods ⋮ Speeding-up one-versus-all training for extreme classification via mean-separating initialization ⋮ SVRG meets AdaGrad: painless variance reduction ⋮ Variance reduction on general adaptive stochastic mirror descent ⋮ Optimistic optimisation of composite objective with exponentiated update ⋮ Black Box Variational Bayesian Model Averaging ⋮ A noise-based stabilizer for convolutional neural networks ⋮ Online Covariance Matrix Estimation in Stochastic Gradient Descent ⋮ Batching Adaptive Variance Reduction ⋮ Error convergence and engineering-guided hyperparameter search of PINNs: towards optimized I-FENN performance ⋮ Convergence Properties of an Objective-Function-Free Optimization Regularization Algorithm, Including an $\boldsymbol{\mathcal{O}(\epsilon^{-3/2})}$ Complexity Bound ⋮ Adaptive step size rules for stochastic optimization in large-scale learning ⋮ Addressing discontinuous root-finding for subsequent differentiability in machine learning, inverse problems, and control ⋮ Efficient approximations of the fisher matrix in neural networks using kronecker product singular value decomposition ⋮ Projective Integral Updates for High-Dimensional Variational Inference ⋮ Online decision making for trading wind energy ⋮ A new taxonomy of global optimization algorithms ⋮ Variable separated physics-informed neural networks based on adaptive weighted loss functions for blood flow model ⋮ Theoretical analysis of Adam using hyperparameters close to one without Lipschitz smoothness ⋮ Incremental Majorization-Minimization Optimization with Application to Large-Scale Machine Learning ⋮ Trust-region algorithms for training responses: machine learning methods using indefinite Hessian approximations ⋮ On the Adaptivity of Stochastic Gradient-Based Optimization ⋮ The Discriminative Kalman Filter for Bayesian Filtering with Nonlinear and Nongaussian Observation Models ⋮ A Continuous-Time Analysis of Distributed Stochastic Gradient ⋮ An Infinite Restricted Boltzmann Machine ⋮ Nonconvex Policy Search Using Variational Inequalities ⋮ A Unified Adaptive Tensor Approximation Scheme to Accelerate Composite Convex Optimization ⋮ Deep Convolutional Neural Networks for Image Classification: A Comprehensive Review ⋮ Accelerating Sparse Recovery by Reducing Chatter ⋮ Convergence and Dynamical Behavior of the ADAM Algorithm for Nonconvex Stochastic Optimization ⋮ Convergence of Newton-MR under Inexact Hessian Information ⋮ Why Does Large Batch Training Result in Poor Generalization? A Comprehensive Explanation and a Better Strategy from the Viewpoint of Stochastic Optimization ⋮ $l_p$ Regularization for Ensemble Kalman Inversion ⋮ PNKH-B: A Projected Newton--Krylov Method for Large-Scale Bound-Constrained Optimization ⋮ A Distributed Optimal Control Problem with Averaged Stochastic Gradient Descent ⋮ Dying ReLU and Initialization: Theory and Numerical Examples ⋮ Stochastic Quasi-Newton Methods for Nonconvex Stochastic Optimization ⋮ Ensemble Kalman inversion: a derivative-free technique for machine learning tasks ⋮ Unnamed Item ⋮ Scalable estimation strategies based on stochastic approximations: classical results and new insights ⋮ Stochastic sub-sampled Newton method with variance reduction ⋮ Machine Learning in Adaptive Domain Decomposition Methods---Predicting the Geometric Location of Constraints ⋮ Adaptive sequential machine learning ⋮ A Stochastic Line Search Method with Expected Complexity Analysis ⋮ Unnamed Item ⋮ Abstract convergence theorem for quasi-convex optimization problems with applications ⋮ An Inertial Newton Algorithm for Deep Learning ⋮ A Stochastic Semismooth Newton Method for Nonsmooth Nonconvex Optimization ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Entropy-SGD: biasing gradient descent into wide valleys ⋮ Conformal symplectic and relativistic optimization ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Stochastic proximal linear method for structured non-convex problems ⋮ An accelerated communication-efficient primal-dual optimization framework for structured machine learning ⋮ Joint Online Parameter Estimation and Optimal Sensor Placement for the Partially Observed Stochastic Advection-Diffusion Equation ⋮ Distributed Stochastic Inertial-Accelerated Methods with Delayed Derivatives for Nonconvex Problems ⋮ Adaptive online distributed optimization in dynamic environments ⋮ An Adaptive Gradient Method with Energy and Momentum ⋮ Multi-Objective Optimization of Laminated Functionally Graded Carbon Nanotube-Reinforced Composite Plates Using Deep Feedforward Neural Networks-NSGAII Algorithm ⋮ An inexact first-order method for constrained nonlinear optimization ⋮ A fully stochastic second-order trust region method

This page was built for publication: