Distributions associated with simultaneous multiple hypothesis testing

From MaRDI portal
Publication:2040901

DOI10.1186/S40488-020-00109-6zbMATH Open1472.62156arXiv1802.09018OpenAlexW3094101422MaRDI QIDQ2040901FDOQ2040901


Authors: Chang Yu, Daniel Zelterman Edit this on Wikidata


Publication date: 14 July 2021

Published in: Journal of Statistical Distributions and Applications (Search for Journal in Brave)

Abstract: We develop the distribution of the number of hypotheses found to be statistically significant using the rule from Benjamini and Hochberg (1995) for controlling the false discovery rate (FDR). This distribution has both a small sample form and an asymptotic expression for testing many independent hypotheses simultaneously. We propose a parametric distribution ,PsiI(cdot), to approximate the marginal distribution of p-values under a non-uniform alternative hypothesis. This distribution is useful when there are many different alternative hypotheses and these are not individually well understood. We fit ,PsiI, to data from three cancer studies and use it to illustrate the distribution of the number of notable hypotheses observed in these examples. We model dependence of sampled p-values using a copula model and a latent variable approach. These methods can be combined to illustrate a power analysis in planning a large study on the basis of a smaller pilot study. We show the number of statistically significant p-values behaves approximately as a mixture of a normal and the Borel-Tanner distribution.


Full work available at URL: https://arxiv.org/abs/1802.09018




Recommendations




Cites Work


Cited In (6)

Uses Software





This page was built for publication: Distributions associated with simultaneous multiple hypothesis testing

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2040901)