Propagation of outliers in multivariate data
From MaRDI portal
Publication:1002160
DOI10.1214/07-AOS588zbMATH Open1155.62043arXiv0903.0447MaRDI QIDQ1002160FDOQ1002160
Authors: Fatemah Alqallaf, Ruben H. Zamar, Stefan Van Aelst, Victor J. Yohai
Publication date: 25 February 2009
Published in: The Annals of Statistics (Search for Journal in Brave)
Abstract: We investigate the performance of robust estimates of multivariate location under nonstandard data contamination models such as componentwise outliers (i.e., contamination in each variable is independent from the other variables). This model brings up a possible new source of statistical error that we call "propagation of outliers." This source of error is unusual in the sense that it is generated by the data processing itself and takes place after the data has been collected. We define and derive the influence function of robust multivariate location estimates under flexible contamination models and use it to investigate the effect of propagation of outliers. Furthermore, we show that standard high-breakdown affine equivariant estimators propagate outliers and therefore show poor breakdown behavior under componentwise contamination when the dimension is high.
Full work available at URL: https://arxiv.org/abs/0903.0447
Recommendations
Estimation in multivariate analysis (62H12) Robustness and adaptive procedures (parametric inference) (62F35)
Cites Work
- Robust Estimation of a Location Parameter
- Asymptotic behaviour of S-estimates of multivariate location parameters and dispersion matrices
- Robust m-estimators of multivariate location and scatter
- Constrained \(M\)-estimation for multivariate location and scatter
- On the uniqueness of \(S\)-functionals and \(M\)-functionals under nonelliptical distributions.
- Robust factor analysis.
- Least Median of Squares Regression
- Title not available (Why is that?)
- Breakdown points of affine equivariant estimators of multivariate location and covariance matrices
- On the relation between S-estimators and M-estimators of multivariate location and covariance
- The Future of Data Analysis
- Breakdown Robustness of Tests
- Title not available (Why is that?)
- Title not available (Why is that?)
- Min-max bias robust regression
- Robust singular value decomposition analysis of microarray data
- Lower bounds for contamination bias: Globally minimax versus locally linear estimation
- Multivariate τ-Estimators for Location and Scatter
- Title not available (Why is that?)
Cited In (62)
- Discussion of: ``The power of monitoring: how to make the most of a contaminated multivariate sample
- Robust estimation of precision matrices under cellwise contamination
- Stahel-Donoho estimators with cellwise weights
- Snipping for robust \(k\)-means clustering under component-wise contamination
- Multivariate outlier detection in applied data analysis: global, local, compositional and cellwise outliers
- Simultaneous feature selection and outlier detection with optimality guarantees
- Jump robust daily covariance estimation by disentangling variance and correlation components
- High-dimensional robust precision matrix estimation: cellwise corruption under \(\epsilon \)-contamination
- Detection and correction of outliers in the bivariate chain-ladder method
- Comments on: ``Robust estimation of multivariate location and scatter in the presence of cellwise and casewise contamination
- Multidimensional outlier-proneness of dependent data and the extremal index
- Robust variable selection under cellwise contamination
- Sparse regression for large data sets with outliers
- Robust regression with compositional covariates including cellwise outliers
- Quantitative robustness of instance ranking problems
- Stahel-Donoho estimation for high-dimensional data
- Resistant estimates for high dimensional and functional data based on random projections
- Comments on ``Data science, big data and statistics
- Robust estimation of the hierarchical model for responses and response times
- A discussion on the robust vector autoregressive models: novel evidence from safe haven assets
- Multivariate outlier detection based on a robust Mahalanobis distance with shrinkage estimators
- Robust statistics: a selective overview and new directions
- Robust and sparse estimation of graphical models based on multivariate winsorization
- Robust estimation of general linear mixed effects models
- Cluster analysis with cellwise trimming and applications for the robust clustering of curves
- Cellwise outlier detection with false discovery rate control
- Robust Multivariate Lasso Regression with Covariance Estimation
- Multivariate location and scatter matrix estimation under cellwise and casewise contamination
- RDELA -- a Delaunay-triangulation-based, location and covariance estimator with high breakdown point
- Gini's mean difference and variance as measures of finite populations scales
- Robust estimation of AR coefficients under simultaneously influencing outliers and missing values
- Robust regression estimation and inference in the presence of cellwise and casewise contamination
- Asymptotic linear expansion of regularized M-estimators
- Robust correlation scaled principal component regression
- Robust tools for the imperfect world
- Robust regression estimation and variable selection when cellwise and casewise outliers are present
- Robust and sparse estimation of the inverse covariance matrix using rank correlation measures
- Multiple scaled contaminated normal distribution and its application in clustering
- Robust nonlinear principal components
- The power of (extended) monitoring in robust clustering. Discussion of ``The power of monitoring: how to make the most of a contaminated multivariate sample
- Robust VIF regression with application to variable selection in large data sets
- Fast Robust Correlation for High-Dimensional Data
- Robust estimation of multivariate location and scatter in the presence of cellwise and casewise contamination
- Robust principal component analysis based on pairwise correlation estimators
- Robust multivariate estimation based on statistical depth filters
- The Gaussian rank correlation estimator: robustness properties
- Multivariate Outliers and the O3 Plot
- The shooting S-estimator for robust regression
- Outlier detection and robust covariance estimation using mathematical programming
- Comments on: ``Robust estimation of multivariate location and scatter in the presence of cellwise and casewise contamination
- Rejoinder to `Multivariate functional outlier detection'
- Detecting Deviating Data Cells
- MacroPCA: An All-in-One PCA Method Allowing for Missing Values as Well as Cellwise and Rowwise Outliers
- The Cellwise Minimum Covariance Determinant Estimator
- Sparse Principal Component Analysis Based on Least Trimmed Squares
- Outlier detection via a minimum ridge covariance determinant estimator
- CR-Lasso: robust cellwise regularized sparse regression
- A novel robust estimation for high-dimensional precision matrices
- Fast Robust Location and Scatter Estimation: A Depth-based Method
- Robust Multivariate Functional Control Chart
- Robust clustering based on trimming
- Robust and sparse logistic regression
This page was built for publication: Propagation of outliers in multivariate data
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1002160)