Multi-target regression via input space expansion: treating targets as inputs

DOI10.1007/S10994-016-5546-ZMaRDI QIDQ1689552zbMATH OpenOpenAlexDBLPWikidataFDO

Authors Eleftherios Spyromitros-Xioufis, Grigorios Tsoumakas, William Groves, Ioannis Vlahavas

Publication date 12 January 2018

Published in Machine Learning (Search for Journal in Brave)

Full work available at URL https://arxiv.org/abs/1211.6581

multi-label classification chaining stacking multi-target regression

Classification and discrimination; cluster analysis (statistical aspects) (62H30) Learning and adaptive systems in artificial intelligence (68T05)

Abstract: In many practical applications of supervised learning the task involves the prediction of multiple target variables from a common set of input variables. When the prediction targets are binary the task is called multi-label classification, while when the targets are continuous the task is called multi-target regression. In both tasks, target variables often exhibit statistical dependencies and exploiting them in order to improve predictive accuracy is a core challenge. A family of multi-label classification methods address this challenge by building a separate model for each target on an expanded input space where other targets are treated as additional input variables. Despite the success of these methods in the multi-label classification domain, their applicability and effectiveness in multi-target regression has not been studied until now. In this paper, we introduce two new methods for multi-target regression, called Stacked Single-Target and Ensemble of Regressor Chains, by adapting two popular multi-label classification methods of this family. Furthermore, we highlight an inherent problem of these methods - a discrepancy of the values of the additional input variables between training and prediction - and develop extensions that use out-of-sample estimates of the target variables during training in order to tackle this problem. The results of an extensive experimental evaluation carried out on a large and diverse collection of datasets show that, when the discrepancy is appropriately mitigated, the proposed methods attain consistent improvements over the independent regressions baseline. Moreover, two versions of Ensemble of Regression Chains perform significantly better than four state-of-the-art methods including regularization-based multi-task learning methods and a multi-objective random forest approach.

Recommendations

Cites work

Cited in

(26)

Describes a project that uses

Uses Software

This page was built for publication: Multi-target regression via input space expansion: treating targets as inputs

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1689552)