A novel Frank-Wolfe algorithm. Analysis and applications to large-scale SVM training

DOI10.1016/J.INS.2014.03.059zbMATH Open1355.68234arXiv1304.1014OpenAlexW2964052549WikidataQ62047577 ScholiaQ62047577MaRDI QIDQ508681FDOQ508681

Authors: Ricardo Ñanculef, Emanuele Frandi, Claudio Sartori, Héctor Allende

Publication date: 7 February 2017

Published in: Information Sciences (Search for Journal in Brave)

Abstract: Recently, there has been a renewed interest in the machine learning community for variants of a sparse greedy approximation procedure for concave optimization known as {the Frank-Wolfe (FW) method}. In particular, this procedure has been successfully applied to train large-scale instances of non-linear Support Vector Machines (SVMs). Specializing FW to SVM training has allowed to obtain efficient algorithms but also important theoretical results, including convergence analysis of training algorithms and new characterizations of model sparsity. In this paper, we present and analyze a novel variant of the FW method based on a new way to perform away steps, a classic strategy used to accelerate the convergence of the basic FW procedure. Our formulation and analysis is focused on a general concave maximization problem on the simplex. However, the specialization of our algorithm to quadratic forms is strongly related to some classic methods in computational geometry, namely the Gilbert and MDM algorithms. On the theoretical side, we demonstrate that the method matches the guarantees in terms of convergence rate and number of iterations obtained by using classic away steps. In particular, the method enjoys a linear rate of convergence, a result that has been recently proved for MDM on quadratic forms. On the practical side, we provide experiments on several classification datasets, and evaluate the results using statistical tests. Experiments show that our method is faster than the FW method with classic away steps, and works well even in the cases in which classic away steps slow down the algorithm. Furthermore, these improvements are obtained without sacrificing the predictive accuracy of the obtained SVM model.

Full work available at URL: https://arxiv.org/abs/1304.1014

Recommendations

zbMATH Keywords

quadratic programming concave optimization Frank-Wolfe methods large-scale support vector machines learning from massive datasets

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05)

Cites Work

Cited In (16)

Uses Software

This page was built for publication: A novel Frank-Wolfe algorithm. Analysis and applications to large-scale SVM training

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q508681)