scientific article; zbMATH DE number 6253934

From MaRDI portal
Revision as of 01:23, 9 February 2024 by Import240129110113 (talk | contribs) (Created automatically from import240129110113)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:5396673

zbMath1280.68164MaRDI QIDQ5396673

John C. Duchi, Elad Hazan, Yoram Singer

Publication date: 3 February 2014

Full work available at URL: http://www.jmlr.org/papers/v12/duchi11a.html

Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.



Related Items (only showing first 100 items - show all)

Mini-Batch Metropolis–Hastings With Reversible SGLD ProposalBayesian Projected Calibration of Computer ModelsArtificial-neural-network-based nonlinear algebraic models for large-eddy simulation of compressible wall-bounded turbulenceLinearly Constrained Nonsmooth Optimization for Training AutoencodersSubgradient ellipsoid method for nonsmooth convex problemsCombining gradient optimization and machine learning methods for inverse problems in layered heterogeneous mediaA stochastic gradient method for a class of nonlinear PDE-constrained optimal control problems under uncertaintySCORE: approximating curvature information under self-concordant regularizationBlock-cyclic stochastic coordinate descent for deep neural networksAutomatic, dynamic, and nearly optimal learning rate specification via local quadratic approximationHow to handle noisy labels for robust learning from uncertaintyA distributed optimisation framework combining natural gradient with Hessian-free for discriminative sequence trainingConvergence analysis of AdaBound with relaxed bound functions for non-convex optimizationStratified Cox models with time‐varying effects for national kidney transplant patients: A new blockwise steepest ascent methodStochastic momentum methods for non-convex learning without bounded assumptionsMultivariate online regression analysis with heterogeneous streaming dataGraph deep learning model for mapping mineral prospectivityAn indefinite proximal subgradient-based algorithm for nonsmooth composite optimizationA mini-batch stochastic conjugate gradient algorithm with variance reductionComprehensive study of variational Bayes classification for dense deep neural networksThree ways to solve partial differential equations with neural networks — A reviewTime series analysis and prediction of nonlinear systems with ensemble learning framework applied to deep learning neural networksEfficient learning rate adaptation based on hierarchical optimization approachA zeroing neural dynamics based acceleration optimization approach for optimizers in deep neural networksMultilevel Objective-Function-Free Optimization with an Application to Neural Networks TrainingSuccessfully and efficiently training deep multi-layer perceptrons with logistic activation function simply requires initializing the weights with an appropriate negative meanEigenvalue-Corrected Natural Gradient Based on a New ApproximationConvergence of the RMSProp deep learning method with penalty for nonconvex optimizationA stepwise physics‐informed neural network for solving large deformation problems of hypoelastic materialsSemi-implicit back propagationVariational inference for Bayesian bridge regressionAdaptive stochastic gradient descent for optimal control of parabolic equations with random parametersFacial Action Units Detection to Identify Interest Emotion: An Application of Deep LearningParallel and distributed asynchronous adaptive stochastic gradient methodsSpeeding-up one-versus-all training for extreme classification via mean-separating initializationSVRG meets AdaGrad: painless variance reductionVariance reduction on general adaptive stochastic mirror descentOptimistic optimisation of composite objective with exponentiated updateBlack Box Variational Bayesian Model AveragingA noise-based stabilizer for convolutional neural networksOnline Covariance Matrix Estimation in Stochastic Gradient DescentBatching Adaptive Variance ReductionError convergence and engineering-guided hyperparameter search of PINNs: towards optimized I-FENN performanceConvergence Properties of an Objective-Function-Free Optimization Regularization Algorithm, Including an \(\boldsymbol{\mathcal{O}(\epsilon^{-3/2})}\) Complexity BoundAdaptive step size rules for stochastic optimization in large-scale learningAddressing discontinuous root-finding for subsequent differentiability in machine learning, inverse problems, and controlEfficient approximations of the fisher matrix in neural networks using kronecker product singular value decompositionProjective Integral Updates for High-Dimensional Variational InferenceOnline decision making for trading wind energyA new taxonomy of global optimization algorithmsVariable separated physics-informed neural networks based on adaptive weighted loss functions for blood flow modelTheoretical analysis of Adam using hyperparameters close to one without Lipschitz smoothnessIncremental Majorization-Minimization Optimization with Application to Large-Scale Machine LearningTrust-region algorithms for training responses: machine learning methods using indefinite Hessian approximationsOn the Adaptivity of Stochastic Gradient-Based OptimizationThe Discriminative Kalman Filter for Bayesian Filtering with Nonlinear and Nongaussian Observation ModelsA Continuous-Time Analysis of Distributed Stochastic GradientAn Infinite Restricted Boltzmann MachineNonconvex Policy Search Using Variational InequalitiesA Unified Adaptive Tensor Approximation Scheme to Accelerate Composite Convex OptimizationDeep Convolutional Neural Networks for Image Classification: A Comprehensive ReviewAccelerating Sparse Recovery by Reducing ChatterConvergence and Dynamical Behavior of the ADAM Algorithm for Nonconvex Stochastic OptimizationConvergence of Newton-MR under Inexact Hessian InformationWhy Does Large Batch Training Result in Poor Generalization? A Comprehensive Explanation and a Better Strategy from the Viewpoint of Stochastic Optimization$l_p$ Regularization for Ensemble Kalman InversionPNKH-B: A Projected Newton--Krylov Method for Large-Scale Bound-Constrained OptimizationA Distributed Optimal Control Problem with Averaged Stochastic Gradient DescentDying ReLU and Initialization: Theory and Numerical ExamplesStochastic Quasi-Newton Methods for Nonconvex Stochastic OptimizationEnsemble Kalman inversion: a derivative-free technique for machine learning tasksUnnamed ItemScalable estimation strategies based on stochastic approximations: classical results and new insightsStochastic sub-sampled Newton method with variance reductionMachine Learning in Adaptive Domain Decomposition Methods---Predicting the Geometric Location of ConstraintsAdaptive sequential machine learningA Stochastic Line Search Method with Expected Complexity AnalysisUnnamed ItemAbstract convergence theorem for quasi-convex optimization problems with applicationsAn Inertial Newton Algorithm for Deep LearningA Stochastic Semismooth Newton Method for Nonsmooth Nonconvex OptimizationUnnamed ItemUnnamed ItemUnnamed ItemEntropy-SGD: biasing gradient descent into wide valleysConformal symplectic and relativistic optimizationUnnamed ItemUnnamed ItemUnnamed ItemUnnamed ItemUnnamed ItemStochastic proximal linear method for structured non-convex problemsAn accelerated communication-efficient primal-dual optimization framework for structured machine learningJoint Online Parameter Estimation and Optimal Sensor Placement for the Partially Observed Stochastic Advection-Diffusion EquationDistributed Stochastic Inertial-Accelerated Methods with Delayed Derivatives for Nonconvex ProblemsAdaptive online distributed optimization in dynamic environmentsAn Adaptive Gradient Method with Energy and MomentumMulti-Objective Optimization of Laminated Functionally Graded Carbon Nanotube-Reinforced Composite Plates Using Deep Feedforward Neural Networks-NSGAII AlgorithmAn inexact first-order method for constrained nonlinear optimizationA fully stochastic second-order trust region method






This page was built for publication: