Concentration inequalities in the infinite urn scheme for occupancy counts and the missing mass, with applications (Q502868)

From MaRDI portal
scientific article
Language Label Description Also known as
English
Concentration inequalities in the infinite urn scheme for occupancy counts and the missing mass, with applications
scientific article

    Statements

    Concentration inequalities in the infinite urn scheme for occupancy counts and the missing mass, with applications (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    11 January 2017
    0 references
    Suppose \(U_1, U_2, \ldots, U_n\) are i.i.d. observations from a fixed but unknown distribution \((p_j)^\infty_{j=1}\) over the positive integers, let \(X_{n,j} = \sum^n_{i=1} \mathbb{I}_{\{U_i=j\}}\) be the number of times that the symbol \(j\) occurs in a sample of size \(n\). The occupancy counts \(K_{n,r}=\sum^\infty_{j=1} \mathbb{I}_{\{X_{n,j}=r\}}\) are the number of symbols that appear exactly \(r\) times in a sample of size \(n\). This paper shows that, occupancy counts and several related random quantities satisfy Bernstein-type concentration inequalities.
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    concentration
    0 references
    missing mass
    0 references
    occupancy
    0 references
    rare species
    0 references
    regular variation
    0 references
    0 references