Truncated rank-based tests for two-part models with excessive zeros and applications to microbiome data
From MaRDI portal
Publication:6104147
DOI10.1214/22-AOAS1688arXiv2110.05368MaRDI QIDQ6104147FDOQ6104147
Authors:
Publication date: 5 June 2023
Published in: The Annals of Applied Statistics (Search for Journal in Brave)
Abstract: High-throughput sequencing technology allows us to test the compositional difference of bacteria in different populations. One important feature of human microbiome data is that it often includes a large number of zeros. Such data can be treated as being generated from a two-part model that includes a zero point-mass. Motivated by analysis of such non-negative data with excessive zeros, we introduce several truncated rank-based two-group and multi-group tests for such data, including a truncated rank-based Wilcoxon rank-sum test for two-group comparison and two truncated Kruskal-Wallis tests for multi-group comparison. We show both analytically through asymptotic relative efficiency analysis and by simulations that the proposed tests have higher power than the standard rank-based tests, especially when the proportion of zeros in the data is high. The tests can also be applied to repeated measurements of compositional data via simple within-subject permutations. In a simple before-and-after treatment experiment, the within-subject permutation is similar to the paired rank test. However, the proposed tests handle the excessive zeros, which leads to a better power. We apply the tests to the analysis of a gut microbiome data set to compare the microbiome compositions of healthy and pediatric Crohn's disease patients and to assess the treatment effects on microbiome compositions. We identify several bacterial genera that are missed by the standard rank-based tests.
Full work available at URL: https://arxiv.org/abs/2110.05368
Recommendations
- A class of rank-based tests for doubly-truncated data
- scientific article; zbMATH DE number 1247687
- Two sample rank tests under a random truncation model
- Two-sample rank tests with truncated populations
- A class of semiparametric rank-based tests for right-truncated data
- On linear rank tests for truncated binomial randomization
- A class of rank-based test for left-truncated and right-censored data
Cites Work
This page was built for publication: Truncated rank-based tests for two-part models with excessive zeros and applications to microbiome data
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6104147)