Distributions associated with simultaneous multiple hypothesis testing
From MaRDI portal
Publication:2040901
Abstract: We develop the distribution of the number of hypotheses found to be statistically significant using the rule from Benjamini and Hochberg (1995) for controlling the false discovery rate (FDR). This distribution has both a small sample form and an asymptotic expression for testing many independent hypotheses simultaneously. We propose a parametric distribution to approximate the marginal distribution of p-values under a non-uniform alternative hypothesis. This distribution is useful when there are many different alternative hypotheses and these are not individually well understood. We fit to data from three cancer studies and use it to illustrate the distribution of the number of notable hypotheses observed in these examples. We model dependence of sampled p-values using a copula model and a latent variable approach. These methods can be combined to illustrate a power analysis in planning a large study on the basis of a smaller pilot study. We show the number of statistically significant p-values behaves approximately as a mixture of a normal and the Borel-Tanner distribution.
Recommendations
- Deriving and comparing the distribution for the number of false positives in single step methods to control \(k\)-FWER
- Control of generalized error rates in multiple testing
- Multiple hypotheses testing and expected number of type I errors
- An extended Simes' test
- The Simes Method for Multiple Hypothesis Testing With Positively Dependent Test Statistics
Cites work
- scientific article; zbMATH DE number 3165668 (Why is no real title available?)
- scientific article; zbMATH DE number 720689 (Why is no real title available?)
- A Bayesian method for combining statistical tests
- A derivation of the Borel distribution
- A factor model approach to multiple testing under dependence
- A parametric model to estimate the proportion from true null using a distribution for \(p\)-values
- A stochastic process approach to false discovery control.
- An improved Bonferroni procedure for multiple tests of significance
- Discovering the False Discovery Rate
- Empirical Bayes Analysis of a Microarray Experiment
- Estimating the Null and the Proportion of Nonnull Effects in Large-Scale Multiple Comparisons
- Estimating the Proportion of True Null Hypotheses, with application to DNA Microarray Data
- Higher criticism for detecting sparse heterogeneous mixtures.
- Large-Scale Simultaneous Hypothesis Testing
- Large-scale multiple testing under dependence
- Nonparametric Bayesian Estimation of Positive False Discovery Rates
- On false discovery control under dependence
- Statistical significance for genomewide studies
- Variance of the Number of False Discoveries
Cited in
(6)- Multiple comparisons and sums of dissociated random variables
- Illustrations on using the distribution of a \(P\)-value in high dimensional data analyses
- Simultaneous Testing of Grouped Hypotheses: Finding Needles in Multiple Haystacks
- Simultaneous testing of multiple hypotheses using generalized \(p\)-values
- Distribution-free multiple testing
- The joint null criterion for multiple hypothesis tests
This page was built for publication: Distributions associated with simultaneous multiple hypothesis testing
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2040901)