Ensemble quantile classifier
From MaRDI portal
Publication:2291292
DOI10.1016/J.CSDA.2019.106849zbMATH Open1504.62081arXiv1910.12960OpenAlexW2978423104WikidataQ127171533 ScholiaQ127171533MaRDI QIDQ2291292FDOQ2291292
Authors: Yanyan Li
Publication date: 30 January 2020
Published in: Computational Statistics and Data Analysis (Search for Journal in Brave)
Abstract: Both the median-based classifier and the quantile-based classifier are useful for discriminating high-dimensional data with heavy-tailed or skewed inputs. But these methods are restricted as they assign equal weight to each variable in an unregularized way. The ensemble quantile classifier is a more flexible regularized classifier that provides better performance with high-dimensional data, asymmetric data or when there are many irrelevant extraneous inputs. The improved performance is demonstrated by a simulation study as well as an application to text categorization. It is proven that the estimated parameters of the ensemble quantile classifier consistently estimate the minimal population loss under suitable general model assumptions. It is also shown that the ensemble quantile classifier is Bayes optimal under suitable assumptions with asymmetric Laplace distribution inputs.
Full work available at URL: https://arxiv.org/abs/1910.12960
Recommendations
- Quantile-based classifiers
- Median-based classifiers for high-dimensional data
- Quantile-distribution functions and their use for classification, with application to naïve Bayes classifiers
- Random-projection ensemble classification. (With discussion).
- Quantile regression with group Lasso for classification
sparsitybinary classificationtext miningpattern recognition and machine learningextraneous noise variableshigh-dimensional discriminant analysis
Cites Work
- Statistical Analysis of Financial Data in S-Plus
- A decision-theoretic generalization of on-line learning and an application to boosting
- An introduction to statistical learning. With applications in R
- Regression Quantiles
- Random forests
- Support-vector networks
- Quantile regression.
- Comparison of Discrimination Methods for the Classification of Tumors Using Gene Expression Data
- High-dimensional classification using features annealed independence rules
- Class prediction by nearest shrunken centroids, with applications to DNA microarrays.
- Some theory for Fisher's linear discriminant function, `naive Bayes', and some alternatives when there are many more variables than observations
- Quantile-based classifiers
- Generating random correlation matrices based on partial correlations
- Penalized logistic regression for detecting gene interactions
- Boosting. Foundations and algorithms.
- Applied predictive modeling
- Stacked regressions
- Title not available (Why is that?)
- Some characterizations of almost sure bounds for weighted multidimensional empirical distributions and a Glivenko-Cantelli theorem for sample quantiles
- Median-based classifiers for high-dimensional data
Cited In (6)
- Directional Quantile Classifiers
- Classification of multivariate objects using interval quantile classes
- Median-based classifiers for high-dimensional data
- Quantile-distribution functions and their use for classification, with application to naïve Bayes classifiers
- A nonparametric ensemble binary classifier and its statistical properties
- Quantile-based classifiers
Uses Software
This page was built for publication: Ensemble quantile classifier
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2291292)