Estimating diversity via frequency ratios

From MaRDI portal
Publication:88012

DOI10.1111/BIOM.12332zbMATH Open1419.62476arXiv1408.3333OpenAlexW1903024071WikidataQ35650618 ScholiaQ35650618MaRDI QIDQ88012FDOQ88012


Authors: Amy Willis, John Bunge Edit this on Wikidata


Publication date: 2 June 2015

Published in: Biometrics (Search for Journal in Brave)

Abstract: We wish to estimate the total number of classes in a population based on sample counts, especially in the presence of high latent diversity. Drawing on probability theory that characterizes distributions on the integers by ratios of consecutive probabilities, we construct a nonlinear regression model for the ratios of consecutive frequency counts. This allows us to predict the unobserved count and hence estimate the total diversity. We believe that this is the first approach to depart from the classical mixed Poisson model in this problem. Our method is geometrically intuitive and yields good fits to data with reasonable standard errors. It is especially well-suited to analyzing high diversity datasets derived from next-generation sequencing in microbial ecology. We demonstrate the method's performance in this context and via simulation, and we present a dataset for which our method outperforms all competitors.


Full work available at URL: https://arxiv.org/abs/1408.3333




Recommendations




Cites Work


Cited In (10)

Uses Software





This page was built for publication: Estimating diversity via frequency ratios

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q88012)