A data driven trimming procedure for robust classification

From MaRDI portal
Publication:6282083

arXiv1701.05065MaRDI QIDQ6282083FDOQ6282083


Authors: Marina Antolín, Eustasio Del Barrio, Jean-Michel Loubes Edit this on Wikidata


Publication date: 18 January 2017

Abstract: Classification rules can be severely affected by the presence of disturbing observations in the training sample. Looking for an optimal classifier with such data may lead to unnecessarily complex rules. So, simpler effective classification rules could be achieved if we relax the goal of fitting a good rule for the whole training sample but only consider a fraction of the data. In this paper we introduce a new method based on trimming to produce classification rules with guaranteed performance on a significant fraction of the data. In particular, we provide an automatic way of determining the right trimming proportion and obtain in this setting oracle bounds for the classification error on the new data set.













This page was built for publication: A data driven trimming procedure for robust classification

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6282083)