Composite difference-MAX programs for modern statistical estimation problems

From MaRDI portal
Publication:4562249

DOI10.1137/18M117337XzbMATH Open1407.62250arXiv1803.00205OpenAlexW2963562721WikidataQ128749629 ScholiaQ128749629MaRDI QIDQ4562249FDOQ4562249


Authors: Ying Cui, Jong-Shi Pang, Bodhisattva Sen Edit this on Wikidata


Publication date: 19 December 2018

Published in: SIAM Journal on Optimization (Search for Journal in Brave)

Abstract: Many modern statistical estimation problems are defined by three major components: a statistical model that postulates the dependence of an output variable on the input features; a loss function measuring the error between the observed output and the model predicted output; and a regularizer that controls the overfitting and/or variable selection in the model. We study the sampling version of this generic statistical estimation problem where the model parameters are estimated by empirical risk minimization, which involves the minimization of the empirical average of the loss function at the data points weighted by the model regularizer. In our setup we allow all three component functions discussed above to be of the difference-of-convex (dc) type and illustrate them with a host of commonly used examples, including those in continuous piecewise affine regression and in deep learning (where the activation functions are piecewise affine). We describe a nonmonotone majorization-minimization (MM) algorithm for solving the unified nonconvex, nondifferentiable optimization problem which is formulated as a specially structured composite dc program of the pointwise max type, and present convergence results to a directional stationary solution. An efficient semismooth Newton method is proposed to solve the dual of the MM subproblems. Numerical results are presented to demonstrate the effectiveness of the proposed algorithm and the superiority of continuous piecewise affine regression over the standard linear model.


Full work available at URL: https://arxiv.org/abs/1803.00205




Recommendations




Cites Work


Cited In (33)

Uses Software





This page was built for publication: Composite difference-MAX programs for modern statistical estimation problems

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4562249)