Sparsity in optimal randomized classification trees

DOI10.1016/J.EJOR.2019.12.002MaRDI QIDQ2301963zbMATH OpenOpenAlexWikidataFDO

Authors R. Blanquero, Emilio Carrizosa, Cristina Molero-Río, Dolores Romero Morales

Publication date 25 February 2020

Published in European Journal of Operational Research (Search for Journal in Brave)

Full work available at URL https://arxiv.org/abs/2002.09191

data mining nonlinear programming global and local sparsity optimal classification trees

Classification and discrimination; cluster analysis (statistical aspects) (62H30) Learning and adaptive systems in artificial intelligence (68T05) Applications of mathematical programming (90C90)

Abstract: Decision trees are popular Classification and Regression tools and, when small-sized, easy to interpret. Traditionally, a greedy approach has been used to build the trees, yielding a very fast training process; however, controlling sparsity (a proxy for interpretability) is challenging. In recent studies, optimal decision trees, where all decisions are optimized simultaneously, have shown a better learning performance, especially when oblique cuts are implemented. In this paper, we propose a continuous optimization approach to build sparse optimal classification trees, based on oblique cuts, with the aim of using fewer predictor variables in the cuts as well as along the whole tree. Both types of sparsity, namely local and global, are modeled by means of regularizations with polyhedral norms. The computational experience reported supports the usefulness of our methodology. In all our data sets, local and global sparsity can be improved without harming classification accuracy. Unlike greedy approaches, our ability to easily trade in some of our classification accuracy for a gain in global sparsity is shown.

Recommendations

Cites work

Cited in

(23)

Describes a project that uses

Uses Software

This page was built for publication: Sparsity in optimal randomized classification trees

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2301963)