Multiple testing with heterogeneous multinomial distributions

From MaRDI portal
Publication:5283311

DOI10.1111/BIOM.12586zbMATH Open1372.62067arXiv1511.01400OpenAlexW2278602278WikidataQ39418931 ScholiaQ39418931MaRDI QIDQ5283311FDOQ5283311


Authors:


Publication date: 21 July 2017

Published in: Biometrics (Search for Journal in Brave)

Abstract: False discovery rate (FDR) procedures provide misleading inference when testing multiple null hypotheses with heterogeneous multinomial data. For example, in the motivating study the goal is to identify species of bacteria near the roots of wheat plants (rhizobacteria) that are associated with productivity, but standard procedures discover the most abundant species even when the association is weak or negligible, and fail to discover strong associations when species are not abundant. Consequently, a list of abundant species is produced by the multiple testing procedure even though the goal was to provide a list of producitivity-associated species. This paper provides an FDR method based on a mixture of multinomial distributions and shows that it tends to discover more non-negligible effects and fewer negligible effects when the data are heterogeneous across tests. The proposed method and competing methods are applied to the motivating data. The new method identifies more species that are strongly associated with productivity and identifies fewer species that are weakly associated with productivity.


Full work available at URL: https://arxiv.org/abs/1511.01400




Recommendations




Cites Work


Cited In (12)





This page was built for publication: Multiple testing with heterogeneous multinomial distributions

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5283311)