Forward-backward selection with early dropping

From MaRDI portal
Publication:4633015

DOI10.48550/ARXIV.1705.10770zbMATH Open1483.68279arXiv1705.10770MaRDI QIDQ4633015FDOQ4633015


Authors: Giorgos Borboudakis, Ioannis Tsamardinos, Giorgos Borboudakis, Ioannis Tsamardinos Edit this on Wikidata


Publication date: 2 May 2019

Abstract: Forward-backward selection is one of the most basic and commonly-used feature selection algorithms available. It is also general and conceptually applicable to many different types of data. In this paper, we propose a heuristic that significantly improves its running time, while preserving predictive accuracy. The idea is to temporarily discard the variables that are conditionally independent with the outcome given the selected variable set. Depending on how those variables are reconsidered and reintroduced, this heuristic gives rise to a family of algorithms with increasingly stronger theoretical guarantees. In distributions that can be faithfully represented by Bayesian networks or maximal ancestral graphs, members of this algorithmic family are able to correctly identify the Markov blanket in the sample limit. In experiments we show that the proposed heuristic increases computational efficiency by about two orders of magnitude in high-dimensional problems, while selecting fewer variables and retaining predictive performance. Furthermore, we show that the proposed algorithm and feature selection with LASSO perform similarly when restricted to select the same number of variables, making the proposed algorithm an attractive alternative for problems where no (efficient) algorithm for LASSO exists.


Full work available at URL: https://arxiv.org/abs/1705.10770




Recommendations




Cites Work


Cited In (9)

Uses Software





This page was built for publication: Forward-backward selection with early dropping

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4633015)