Concentration inequalities in the infinite urn scheme for occupancy counts and the missing mass, with applications (Q502868)
From MaRDI portal
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Concentration inequalities in the infinite urn scheme for occupancy counts and the missing mass, with applications |
scientific article |
Statements
Concentration inequalities in the infinite urn scheme for occupancy counts and the missing mass, with applications (English)
0 references
11 January 2017
0 references
Suppose \(U_1, U_2, \ldots, U_n\) are i.i.d. observations from a fixed but unknown distribution \((p_j)^\infty_{j=1}\) over the positive integers, let \(X_{n,j} = \sum^n_{i=1} \mathbb{I}_{\{U_i=j\}}\) be the number of times that the symbol \(j\) occurs in a sample of size \(n\). The occupancy counts \(K_{n,r}=\sum^\infty_{j=1} \mathbb{I}_{\{X_{n,j}=r\}}\) are the number of symbols that appear exactly \(r\) times in a sample of size \(n\). This paper shows that, occupancy counts and several related random quantities satisfy Bernstein-type concentration inequalities.
0 references
concentration
0 references
missing mass
0 references
occupancy
0 references
rare species
0 references
regular variation
0 references