Correct ordering in the Zipf-Poisson ensemble

From MaRDI portal
Publication:4904728

DOI10.1080/01621459.2012.734177zbMATH Open1260.62016arXiv1101.2481OpenAlexW2160176524MaRDI QIDQ4904728FDOQ4904728


Authors: Justin S. Dyer, Art B. Owen Edit this on Wikidata


Publication date: 31 January 2013

Published in: Journal of the American Statistical Association (Search for Journal in Brave)

Abstract: We consider a Zipf--Poisson ensemble in which Xisimpoi(Nialpha) for alpha>1 and N>0 and integers ige1. As Noinfty the first n(N) random variables have their proper order X1>X2>...>Xn relative to each other, with probability tending to 1 for n up to (AN/log(N))1/(alpha+2) for an explicit constant A(alpha)ge3/4. The rate N1/(alpha+2) cannot be achieved. The ordering of the first n(N) entities does not preclude Xm>Xn for some interloping m>n. The first random variables are correctly ordered exclusive of any interlopers, with probability tending to 1 if for B<A. For a Zipf--Poisson model of the British National Corpus, which has a total word count of 100,000,000, our result estimates that the 72 words with the highest counts are properly ordered.


Full work available at URL: https://arxiv.org/abs/1101.2481




Recommendations




Cites Work


Cited In (3)





This page was built for publication: Correct ordering in the Zipf-Poisson ensemble

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4904728)