Variance reduction trends on `boosted' classifiers (Q2495106)

From MaRDI portal
Revision as of 07:20, 5 March 2024 by Import240304020342 (talk | contribs) (Set profile property.)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
scientific article
Language Label Description Also known as
English
Variance reduction trends on `boosted' classifiers
scientific article

    Statements

    Variance reduction trends on `boosted' classifiers (English)
    0 references
    0 references
    3 July 2006
    0 references
    Summary: Ensemble classification techniques such as bagging, [\textit{L. Breiman}, Mach. Learn. 24, No. 2, 123--140 (1996; Zbl 0858.68080)], boosting [\textit{Y. Freund} and \textit{R. E. Schapire} [J. Comput. Syst. Sci. 55, No. 1, 119--139 (1997; Zbl 0880.68103)] and arcing algorithms \textit{L. Breiman} [Ann. Stat. 26, No. 3, 801--849 (1998; Zbl 0934.62064)] have received much attention in recent literature. Such techniques have been shown to lead to reduced classification error on unseen cases. Even when the ensemble is trained well beyond zero training set error, the ensemble continues to exhibit improved classification errors on unseen cases. Despite many studies and conjectures, the reasons behind this improved performance and understanding of the underlying probabilistic structures remain open and challenging problems. More recently, diagnostics such as edge and margin [\textit{R.E. Schapire} and \textit{Y. Singer}, Mach. Learn. 37, No. 3, 297--336 (1999; Zbl 0945.68194)] have been used to explain the improvements made when ensemble classifiers are built. This paper presents some interesting results from an empirical study performed on a set of representative data sets using the decision tree learner C4.5. An exponential-like decay in the variance of the edge is observed as the number of boosting trials is increased. i.e., boosting appears to `homogenise' the edge. Some initial theory is presented which indicates that a lack of correlation between the errors of individual classifiers is a key factor in this variance reduction.
    0 references
    classification
    0 references
    ensemble classifiers
    0 references
    boosting
    0 references
    variance reduction
    0 references

    Identifiers