Farmtest: factor-adjusted robust multiple testing with approximate false discovery control
From MaRDI portal
Publication:5208092
Abstract: Large-scale multiple testing with correlated and heavy-tailed data arises in a wide range of research areas from genomics, medical imaging to finance. Conventional methods for estimating the false discovery proportion (FDP) often ignore the effect of heavy-tailedness and the dependence structure among test statistics, and thus may lead to inefficient or even inconsistent estimation. Also, the commonly imposed joint normality assumption is arguably too stringent for many applications. To address these challenges, in this paper we propose a Factor-Adjusted Robust Multiple Testing (FarmTest) procedure for large-scale simultaneous inference with control of the false discovery proportion. We demonstrate that robust factor adjustments are extremely important in both controlling the FDP and improving the power. We identify general conditions under which the proposed method produces consistent estimate of the FDP. As a byproduct that is of independent interest, we establish an exponential-type deviation inequality for a robust -type covariance estimator under the spectral norm. Extensive numerical experiments demonstrate the advantage of the proposed method over several state-of-the-art methods especially when the data are generated from heavy-tailed distributions. The proposed procedures are implemented in the R-package FarmTest.
Recommendations
- Estimating false discovery proportion under arbitrary covariance dependence
- A factor model approach to multiple testing under dependence
- Estimation of the false discovery proportion with unknown dependence
- Robustness of multiple testing procedures against dependence
- Control of the FWER in Multiple Testing Under Dependence
Cites work
- scientific article; zbMATH DE number 720689 (Why is no real title available?)
- scientific article; zbMATH DE number 3396952 (Why is no real title available?)
- A Direct Approach to False Discovery Rates
- A factor model approach to multiple testing under dependence
- A general framework for multiple testing dependence
- A new perspective on robust \(M\)-estimation: finite sample theory and applications to dependence-adjusted multiple testing
- A stochastic process approach to false discovery control.
- A useful variant of the Davis-Kahan theorem for statisticians
- Adaptive Huber Regression
- Adaptive false discovery rate control under independence and dependence
- Asymptotics of empirical eigenstructure for high dimensional spiked covariance
- Asymptotics of the principal components estimator of large factor models with weakly influential factors
- Challenging the empirical mean and empirical variance: a deviation study
- Confounder adjustment in multiple hypothesis testing
- Correlated \(z\)-values and the accuracy of large-scale statistical estimates
- Correlation and Large-Scale Simultaneous Significance Testing
- Cross-dimensional inference of dependent high-dimensional data
- Determining the Number of Factors in Approximate Factor Models
- Eigenvalue ratio test for the number of factors
- Empirical properties of asset returns: stylized facts and statistical issues
- Error Distribution for Gene Expression Data
- Estimating false discovery proportion under arbitrary covariance dependence
- Estimating the Null and the Proportion of Nonnull Effects in Large-Scale Multiple Comparisons
- Estimating the Proportion of True Null Hypotheses, with application to DNA Microarray Data
- Estimating the proportion of false null hypotheses among a large number of independently tested hypotheses
- Estimation of High Dimensional Mean Regression in the Absence of Symmetry and Light Tail Assumptions
- Estimation of the false discovery proportion with unknown dependence
- Factor modeling for high-dimensional time series: inference for the number of factors
- Forecasting Using Principal Components From a Large Number of Predictors
- Generalizations of the familywise error rate
- High-dimensional probability. An introduction with applications in data science
- Inferential Theory for Factor Models of Large Dimensions
- Large covariance estimation by thresholding principal orthogonal complements. With discussion and authors' reply
- Large-scale multiple testing under dependence
- On false discovery control under dependence
- On the Benjamini-Hochberg method
- On the performance of FDR control: constraints and a partial solution
- Phase transition and regularized bootstrap in large-scale \(t\)-tests with false discovery rate control
- Proportion of Non-Zero Normal Means: Universal Oracle Equivalences and Uniformly Consistent Estimators
- Robust Estimation of a Location Parameter
- Robustness and accuracy of methods for high dimensional data analysis based on Student's \(t\)-statistic
- Robustness of multiple testing procedures against dependence
- Statistical analysis of factor models of high dimension
- Strong Control, Conservative Point Estimation and Simultaneous Conservative Consistency of False Discovery Rates: A Unified Approach
- Sub-Gaussian estimators of the mean of a random matrix with heavy-tailed entries
- The analysis of gene expression data. Methods and software
- The control of the false discovery rate in multiple testing under dependency.
- The effect of correlation in false discovery rate estimation
- The statistics and mathematics of high dimension low sample size asymptotics
- Variance of the Number of False Discoveries
Cited in
(21)- Gaussian differentially private robust mean estimation and inference
- Skilled Mutual Fund Selection: False Discovery Control Under Dependence
- Learning Latent Factors From Diversified Projections and Its Applications to Over-Estimated and Weak Factors
- Robust high-dimensional tuning free multiple testing
- scientific article; zbMATH DE number 7370530 (Why is no real title available?)
- High-dimensional two-sample mean vectors test and support recovery with factor adjustment
- Asymptotic false discovery control of the Benjamini-Hochberg procedure for pairwise comparisons
- Robust high-dimensional factor models with applications to statistical machine learning
- Multiple two-sample testing under arbitrary covariance dependency with an application in imaging mass spectrometry
- Posterior consistency of factor dimensionality in high-dimensional sparse factor models
- Overview of research advance for knockoff methods
- Non-asymptotic properties of spectral decomposition of large Gram-type matrices and applications
- A central limit theorem for the Benjamini-Hochberg false discovery proportion under a factor model
- Large-Scale Inference of Multivariate Regression for Heavy-Tailed and Asymmetric Data
- Confounder adjustment in multiple hypothesis testing
- Model-Free Feature Screening and FDR Control With Knockoff Features
- FarmTest
- Noisy matrix completion: understanding statistical guarantees for convex relaxation via nonconvex optimization
- Robust projected principal component analysis for large-dimensional semiparametric factor modeling
- Robust factor number specification for large-dimensional elliptical factor model
- Test for Market Timing Using Daily Fund Returns
This page was built for publication: Farmtest: factor-adjusted robust multiple testing with approximate false discovery control
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5208092)