Abstract: Independence screening is a variable selection method that uses a ranking criterion to select significant variables, particularly for statistical models with nonpolynomial dimensionality or "large p, small n" paradigms when p can be as large as an exponential of the sample size n. In this paper we propose a robust rank correlation screening (RRCS) method to deal with ultra-high dimensional data. The new procedure is based on the Kendall au correlation coefficient between response and predictor variables rather than the Pearson correlation of existing methods. The new method has four desirable features compared with existing independence screening methods. First, the sure independence screening property can hold only under the existence of a second order moment of predictor variables, rather than exponential tails or alikeness, even when the number of predictor variables grows as fast as exponentially of the sample size. Second, it can be used to deal with semiparametric models such as transformation regression models and single-index models under monotonic constraint to the link function without involving nonparametric estimation even when there are nonparametric functions in the models. Third, the procedure can be largely used against outliers and influence points in the observations. Last, the use of indicator functions in rank correlation screening greatly simplifies the theoretical derivation due to the boundedness of the resulting statistics, compared with previous studies on variable screening. Simulations are carried out for comparisons with existing methods and a real data example is analyzed.
Recommendations
- Sure screening by ranking the canonical correlations
- Some notes on robust sure independence screening
- Censored rank independence screening for high-dimensional survival data
- Robust sure independence screening for ultrahigh dimensional non-normal data
- Robust rank screening for ultrahigh dimensional discriminant analysis
Cites work
- scientific article; zbMATH DE number 43550 (Why is no real title available?)
- scientific article; zbMATH DE number 42417 (Why is no real title available?)
- scientific article; zbMATH DE number 845714 (Why is no real title available?)
- scientific article; zbMATH DE number 3251902 (Why is no real title available?)
- scientific article; zbMATH DE number 3049708 (Why is no real title available?)
- A NEW MEASURE OF RANK CORRELATION
- A Selective Overview of Variable Selection in High Dimensional Feature Space (Invited Review Article)
- A Statistical View of Some Chemometrics Regression Tools
- A unified approach to model selection and sparse recovery using regularized least squares
- An Analysis of Transformations Revisited
- An introduction to copulas.
- Asymptotic properties of bridge estimators in sparse high-dimensional regression models
- Behavior of the NORTA method for correlated random vector generation as the dimension increases
- Efficient Bayesian inference for Gaussian copula regression models
- Efficient estimation in the bivariate normal copula model: Normal margins are least favourable
- Estimates of the Regression Coefficient Based on Kendall's Tau
- Factor profiled sure independence screening
- High-dimensional generalized linear models and the lasso
- Least angle regression. (With discussion)
- Model-free feature screening for ultrahigh-dimensional data
- Non-parametric analysis of a generalized regression model. The maximum rank correlation estimator
- Nonconcave Penalized Likelihood With NP-Dimensionality
- Nonconcave penalized M-estimation with a diverging number of parameters
- Nonconcave penalized likelihood with a diverging number of parameters.
- Nonparametric independence screening in sparse ultra-high-dimensional additive models
- RANK AND PRODUCT-MOMENT CORRELATION
- Regularization and Variable Selection Via the Elastic Net
- Rejoinder: One-step sparse estimates in nonconcave penalized likelihood models
- Robust Statistics
- Robust rank correlation based screening
- Sliced Inverse Regression for Dimension Reduction
- Smoothed rank correlation of the linear transformation regression model
- Statistical challenges with high dimensionality: feature selection in knowledge discovery
- Sure independence screening in generalized linear models with NP-dimensionality
- The Adaptive Lasso and Its Oracle Properties
- The Dantzig selector: statistical estimation when \(p\) is much larger than \(n\). (With discussions and rejoinder).
- Ultrahigh dimensional feature selection: beyond the linear model
- Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties
Cited in
(only showing first 100 items - show all)- SCAD‐penalized quantile regression for high‐dimensional data analysis and variable selection
- BOLT-SSI: A Statistical Approach to Screening Interaction Effects for Ultra-High Dimensional Data
- Stable feature screening for ultrahigh dimensional data
- Robust Feature Screening via Distance Correlation for Ultrahigh Dimensional Data With Responses Missing at Random
- Adaptive model-free sure independence screening
- Quantile Correlation-based Variable Selection
- Some notes on robust sure independence screening
- Disease progression based feature screening for ultrahigh-dimensional survival-associated biomarkers
- Robust group variable screening based on maximum Lq-likelihood estimation
- Statistical inference for nonignorable missing-data problems: a selective review
- Nonparametric independence feature screening for ultrahigh-dimensional survival data
- Independence index sufficient variable screening for categorical responses
- Robust model-free feature screening for ultrahigh dimensional surrogate data
- Grouped feature screening for ultra-high dimensional data for the classification model
- Measuring and testing for interval quantile dependence
- Gini correlation for feature screening
- Feature screening for ultrahigh-dimensional survival data when failure indicators are missing at random
- Conditional sure independence screening by conditional marginal empirical likelihood
- Conditional-quantile screening for ultrahigh-dimensional survival data via martingale difference correlation
- Model-free feature screening for high-dimensional survival data
- Sure screening by ranking the canonical correlations
- Lq-based robust analytics on ultrahigh and high dimensional data
- Interaction screening via Kendall's rank correlation for imbalanced multi-class classification
- Robust model-free feature screening via quantile correlation
- Fast stepwise regression based on multidimensional indexes
- Feature screening of quadratic inference functions for ultrahigh dimensional longitudinal data
- Projection correlation between scalar and vector variables and its use in feature screening with multi-response data
- Model-free feature screening via distance correlation for ultrahigh dimensional survival data
- Conditional screening for ultrahigh-dimensional survival data in case-cohort studies
- Censored mean variance sure independence screening for ultrahigh dimensional survival data
- A dimension reduction based approach for estimation and variable selection in partially linear single-index models with high-dimensional covariates
- Least-Square Approximation for a Distributed System
- Independent feature screening for ultrahigh-dimensional models with interactions
- Adaptive sufficient sparse clustering by controlling false discovery
- Model-free feature screening for ultrahigh dimensional data via a Pearson chi-square based index
- Prior Knowledge Guided Ultra-High Dimensional Variable Screening With Application to Neuroimaging Data
- Nonparametric independence feature screening for ultrahigh-dimensional missing data
- Greedy forward regression for variable screening
- Model-free conditional screening via conditional distance correlation
- Stable correlation and robust feature screening
- Rank-based score tests for high-dimensional regression coefficients
- Nonparametric Independence Screening in Sparse Ultra-High-Dimensional Varying Coefficient Models
- A fast adaptive Lasso for the cox regression via safe screening rules
- Joint feature screening for ultra-high-dimensional sparse additive hazards model by the sparsity-restricted pseudo-score estimator
- Distributed parameter estimation framework based on moment method
- Stab-GKnock: controlled variable selection for partially linear models using generalized knockoffs
- Robust feature screening for multi-response trans-elliptical regression model with ultrahigh-dimensional covariates
- Feature screening under missing indicator imputation with non-ignorable missing response
- A sure independence screening procedure for ultra-high dimensional partially linear additive models
- A robust variable screening method for high-dimensional data
- Unified model-free interaction screening via CV-entropy filter
- Distribution-free and model-free multivariate feature screening via multivariate rank distance correlation
- Robust feature screening for varying coefficient models via quantile partial correlation
- Profile forward regression screening for ultra-high dimensional semiparametric varying coefficient partially linear models
- Threshold Selection in Feature Screening for Error Rate Control
- Score test variable screening
- Uniform joint screening for ultra-high dimensional graphical models
- Robust sure independence screening for ultrahigh dimensional non-normal data
- Nonparametric feature screening
- Variable selection for covariate adjusted regression model
- Consistent Screening Procedures in High-dimensional Binary Classification
- A data-driven approach to conditional screening of high-dimensional variables
- High-dimensional variable screening under multicollinearity
- Robust rank screening for ultrahigh dimensional discriminant analysis
- Robust feature screening procedures for single and mixed types of data
- Composite coefficient of determination and its application in ultrahigh dimensional variable screening
- Distributed feature screening via componentwise debiasing
- Ultra-high dimensional variable screening via Gram-Schmidt orthogonalization
- Large-Scale Correlation Screening
- Nonparametric screening under conditional strictly convex loss for ultrahigh dimensional sparse data
- A simplified algorithm for identifying abnormal changes in dynamic networks
- Model averaging estimation for generalized partially linear varying-coefficient models
- Entropy-based model-free feature screening for ultrahigh-dimensional multiclass classification
- Sure feature screening for high-dimensional dichotomous classification
- Robust screening under ambiguity
- Feature Screening for Massive Data Analysis by Subsampling
- Tests for high-dimensional single-index models
- The main contributions of robust statistics to statistical science and a new challenge
- Robust sure independence screening for nonpolynomial dimensional generalized linear models
- A selective overview of feature screening for ultrahigh-dimensional data
- Variable selection for high dimensional Gaussian copula regression model: an adaptive hypothesis testing procedure
- Variable screening for ultrahigh dimensional censored quantile regression
- Conditional characteristic feature screening for massive imbalanced data
- Ranking-based variable selection for high-dimensional data
- A note of feature screening via a rank-based coefficient of correlation
- Covariance-insured screening
- Feature screening in ultrahigh-dimensional partially linear models with missing responses at random
- Regression adjustment for treatment effect with multicollinearity in high dimensions
- The Kendall interaction filter for variable interaction screening in high dimensional classification problems
- Model-free variable selection for conditional mean in regression
- A scalable surrogate L₀ sparse regression method for generalized linear models with applications to large scale data
- A modified mean-variance feature-screening procedure for ultrahigh-dimensional discriminant analysis
- Group feature screening via the F statistic
- Are Latent Factor Regression and Sparse Regression Adequate?
- Modified SCAD penalty for constrained variable selection problems
- Feature selection for varying coefficient models with ultrahigh-dimensional covariates
- Model-free conditional independence feature screening for ultrahigh dimensional data
- Model-free feature screening for ultrahigh dimensional censored regression
- Revisiting feature selection for linear models with FDR and power guarantees
- Dynamic tilted current correlation for high dimensional variable screening
This page was built for publication: Robust rank correlation based screening
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q693749)