Calculation of precise constants in a probability model of Zipf's law generation and asymptotics of sums of multinomial coefficients (Q1751551)

From MaRDI portal
scientific article
Language Label Description Also known as
English
Calculation of precise constants in a probability model of Zipf's law generation and asymptotics of sums of multinomial coefficients
scientific article

    Statements

    Calculation of precise constants in a probability model of Zipf's law generation and asymptotics of sums of multinomial coefficients (English)
    0 references
    0 references
    0 references
    25 May 2018
    0 references
    Summary: Let \(\omega_0, \omega_1, \ldots, \omega_n\) be a full set of outcomes (symbols) and let positive \(p_i\), \(i = 0, \ldots, n\), be their probabilities \((\sum_{i = 0}^n p_i = 1)\). Let us treat \(\omega_0\) as a stop symbol; it can occur in sequences of symbols (we call them words) only once, at the very end. The probability of a word is defined as the product of probabilities of its symbols. We consider the list of all possible words sorted in the nonincreasing order of their probabilities. Let \(p \left(r\right)\) be the probability of the \(r\)th word in this list. We prove that if at least one of the ratios \(\log p_i / \log p_j\), \(i, j \in \left\{1, \ldots, n\right\}\), is irrational, then the limit \(\lim_{r \rightarrow \infty} p \left(r\right) / r^{- 1 / \gamma}\) exists and differs from zero; here \(\gamma\) is the root of the equation \(\sum_{i = 1}^n p_i^\gamma = 1\). The limit constant can be expressed (rather easily) in terms of the entropy of the distribution \((p_1^\gamma, \ldots, p_n^\gamma)\).
    0 references
    Zipf's law
    0 references
    Markov chain
    0 references

    Identifiers