Propagation of outliers in multivariate data
From MaRDI portal
Abstract: We investigate the performance of robust estimates of multivariate location under nonstandard data contamination models such as componentwise outliers (i.e., contamination in each variable is independent from the other variables). This model brings up a possible new source of statistical error that we call "propagation of outliers." This source of error is unusual in the sense that it is generated by the data processing itself and takes place after the data has been collected. We define and derive the influence function of robust multivariate location estimates under flexible contamination models and use it to investigate the effect of propagation of outliers. Furthermore, we show that standard high-breakdown affine equivariant estimators propagate outliers and therefore show poor breakdown behavior under componentwise contamination when the dimension is high.
Recommendations
Cites work
- scientific article; zbMATH DE number 3842984 (Why is no real title available?)
- scientific article; zbMATH DE number 3954047 (Why is no real title available?)
- scientific article; zbMATH DE number 621792 (Why is no real title available?)
- scientific article; zbMATH DE number 2058051 (Why is no real title available?)
- Asymptotic behaviour of S-estimates of multivariate location parameters and dispersion matrices
- Breakdown Robustness of Tests
- Breakdown points of affine equivariant estimators of multivariate location and covariance matrices
- Constrained \(M\)-estimation for multivariate location and scatter
- Least Median of Squares Regression
- Lower bounds for contamination bias: Globally minimax versus locally linear estimation
- Min-max bias robust regression
- Multivariate τ-Estimators for Location and Scatter
- On the relation between S-estimators and M-estimators of multivariate location and covariance
- On the uniqueness of \(S\)-functionals and \(M\)-functionals under nonelliptical distributions.
- Robust Estimation of a Location Parameter
- Robust factor analysis.
- Robust m-estimators of multivariate location and scatter
- Robust singular value decomposition analysis of microarray data
- The Future of Data Analysis
Cited in
(62)- Detecting Deviating Data Cells
- Robust estimation of precision matrices under cellwise contamination
- Discussion of: ``The power of monitoring: how to make the most of a contaminated multivariate sample
- Snipping for robust \(k\)-means clustering under component-wise contamination
- Multivariate outlier detection in applied data analysis: global, local, compositional and cellwise outliers
- Stahel-Donoho estimators with cellwise weights
- Jump robust daily covariance estimation by disentangling variance and correlation components
- Simultaneous feature selection and outlier detection with optimality guarantees
- High-dimensional robust precision matrix estimation: cellwise corruption under \(\epsilon \)-contamination
- Detection and correction of outliers in the bivariate chain-ladder method
- Multidimensional outlier-proneness of dependent data and the extremal index
- Comments on: ``Robust estimation of multivariate location and scatter in the presence of cellwise and casewise contamination
- Sparse regression for large data sets with outliers
- Robust variable selection under cellwise contamination
- Robust regression with compositional covariates including cellwise outliers
- MacroPCA: An All-in-One PCA Method Allowing for Missing Values as Well as Cellwise and Rowwise Outliers
- Stahel-Donoho estimation for high-dimensional data
- Quantitative robustness of instance ranking problems
- Resistant estimates for high dimensional and functional data based on random projections
- Multivariate outlier detection based on a robust Mahalanobis distance with shrinkage estimators
- Robust estimation of the hierarchical model for responses and response times
- Comments on ``Data science, big data and statistics
- A discussion on the robust vector autoregressive models: novel evidence from safe haven assets
- Robust statistics: a selective overview and new directions
- Robust and sparse estimation of graphical models based on multivariate winsorization
- Robust estimation of general linear mixed effects models
- The Cellwise Minimum Covariance Determinant Estimator
- Sparse Principal Component Analysis Based on Least Trimmed Squares
- Cluster analysis with cellwise trimming and applications for the robust clustering of curves
- Cellwise outlier detection with false discovery rate control
- Multivariate location and scatter matrix estimation under cellwise and casewise contamination
- Robust Multivariate Lasso Regression with Covariance Estimation
- Robust estimation of AR coefficients under simultaneously influencing outliers and missing values
- RDELA -- a Delaunay-triangulation-based, location and covariance estimator with high breakdown point
- Gini's mean difference and variance as measures of finite populations scales
- Robust regression estimation and inference in the presence of cellwise and casewise contamination
- Asymptotic linear expansion of regularized M-estimators
- Outlier detection via a minimum ridge covariance determinant estimator
- CR-Lasso: robust cellwise regularized sparse regression
- Robust tools for the imperfect world
- Robust correlation scaled principal component regression
- A novel robust estimation for high-dimensional precision matrices
- Fast Robust Location and Scatter Estimation: A Depth-based Method
- Robust regression estimation and variable selection when cellwise and casewise outliers are present
- Robust and sparse estimation of the inverse covariance matrix using rank correlation measures
- Robust Multivariate Functional Control Chart
- Robust clustering based on trimming
- Multiple scaled contaminated normal distribution and its application in clustering
- Robust nonlinear principal components
- The power of (extended) monitoring in robust clustering. Discussion of ``The power of monitoring: how to make the most of a contaminated multivariate sample
- Robust and sparse logistic regression
- Robust VIF regression with application to variable selection in large data sets
- Robust estimation of multivariate location and scatter in the presence of cellwise and casewise contamination
- Robust multivariate estimation based on statistical depth filters
- Robust principal component analysis based on pairwise correlation estimators
- Fast Robust Correlation for High-Dimensional Data
- The Gaussian rank correlation estimator: robustness properties
- Multivariate Outliers and the O3 Plot
- The shooting S-estimator for robust regression
- Outlier detection and robust covariance estimation using mathematical programming
- Rejoinder to `Multivariate functional outlier detection'
- Comments on: ``Robust estimation of multivariate location and scatter in the presence of cellwise and casewise contamination
This page was built for publication: Propagation of outliers in multivariate data
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1002160)