Querying multiple sets of p-values through composed hypothesis testing

From MaRDI portal
Publication:126752

DOI10.48550/ARXIV.2104.14601arXiv2104.14601MaRDI QIDQ126752FDOQ126752

Stéphane Robin, Indranil Mukhopadhyay, Sarmistha Das, Tristan Mary-Huard

Publication date: 29 April 2021

Abstract: Motivation: Combining the results of different experiments to exhibit complex patterns or to improve statistical power is a typical aim of data integration. The starting point of the statistical analysis often comes as sets of p-values resulting from previous analyses, that need to be combined in a flexible way to explore complex hypotheses, while guaranteeing a low proportion of false discoveries. Results: We introduce the generic concept of composed hypothesis, which corresponds to an arbitrary complex combination of simple hypotheses. We rephrase the problem of testing a composed hypothesis as a classification task, and show that finding items for which the composed null hypothesis is rejected boils down to fitting a mixture model and classify the items according to their posterior probabilities. We show that inference can be efficiently performed and provide a thorough classification rule to control for type I error. The performance and the usefulness of the approach are illustrated on simulations and on two different applications. The method is scalable, does not require any parameter tuning, and provided valuable biological insight on the considered application cases. Availability: The QCH methodology is implemented in the qch R package hosted on CRAN.








Cited In (1)





This page was built for publication: Querying multiple sets of $p$-values through composed hypothesis testing

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q126752)