Rediscovery of Good-Turing estimators via Bayesian nonparametrics

From MaRDI portal
Publication:2805189

DOI10.1111/BIOM.12366zbMATH Open1393.62062arXiv1401.0303OpenAlexW1836729566WikidataQ35725181 ScholiaQ35725181MaRDI QIDQ2805189FDOQ2805189


Authors: Bernardo Nipoti, Stefano Favaro, Yee Whye Teh Edit this on Wikidata


Publication date: 10 May 2016

Published in: Biometrics (Search for Journal in Brave)

Abstract: The problem of estimating discovery probabilities originated in the context of statistical ecology, and in recent years it has become popular due to its frequent appearance in challenging applications arising in genetics, bioinformatics, linguistics, designs of experiments, machine learning, etc. A full range of statistical approaches, parametric and nonparametric as well as frequentist and Bayesian, has been proposed for estimating discovery probabilities. In this paper we investigate the relationships between the celebrated Good-Turing approach, which is a frequentist nonparametric approach developed in the 1940s, and a Bayesian nonparametric approach recently introduced in the literature. Specifically, under the assumption of a two parameter Poisson-Dirichlet prior, we show that Bayesian nonparametric estimators of discovery probabilities are asymptotically equivalent, for a large sample size, to suitably smoothed Good-Turing estimators. As a by-product of this result, we introduce and investigate a methodology for deriving exact and asymptotic credible intervals to be associated with the Bayesian nonparametric estimators of discovery probabilities. The proposed methodology is illustrated through a comprehensive simulation study and the analysis of Expressed Sequence Tags data generated by sequencing a benchmark complementary DNA library.


Full work available at URL: https://arxiv.org/abs/1401.0303




Recommendations




Cites Work


Cited In (8)





This page was built for publication: Rediscovery of Good-Turing estimators via Bayesian nonparametrics

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2805189)