Word frequency distributions

From MaRDI portal
Publication:5943510


zbMath0989.68146MaRDI QIDQ5943510

Harald R. Baayen

Publication date: 27 September 2001

Published in: Text, Speech and Language Technology (Search for Journal in Brave)


62P99: Applications of statistics

68T50: Natural language processing

91F20: Linguistics


Related Items

Empirical Bayes estimators of structural distribution of words in Lithuanian texts, Extended truncated Inverse Gaussian–Poisson model, Two halves of a meaningful text are statistically different, Unnamed Item, The Sichel model and the mixing and truncation order, Zipf's law unzipped, A scaling law beyond Zipf's law and its relation to Heaps' law, Zipf's law for randomly generated frequencies: explicit tests for the goodness-of-fit, Simple and efficient classification scheme based on specific vocabulary, Martingale limit theorems of divisible statistics in a multinomial scheme with mixed frequencies, The exact rank-frequency function and size-frequency function of \(N\)-grams and \(N\)-word phrases with applications, A firm foundation for statistical disclosure control, Mineral species frequency distribution conforms to a large number of rare events model: prediction of Earth's missing minerals, Diversity analysis in multiple-choice questionnaires, Modelling of count data using nonparametric mixtures, A language as a self-organized critical system, Calculation of precise constants in a probability model of Zipf's law generation and asymptotics of sums of multinomial coefficients, Extended truncated Tweedie-Poisson model, A computationally efficient approach to estimating species richness and rarefaction curve, Completely monotone distributions: mixing, approximation and estimation of number of species, On recovering a mixed Poisson distribution from its left-truncated version, Relative abundances of mineral species: a statistical measure to characterize Earth-like planets based on Earth's mineralogy, Identifying trends in word frequency dynamics, Bayesian estimation of Earth's undiscovered mineralogical diversity using noninformative priors, On the measure and the estimation of evenness and diversity, Statistical simulation and the distribution of distances between identical elements in a random sequence, On zero-truncating and mixing Poisson distributions, Convergence Properties in Certain Occupancy Problems Including the Karlin-Rouault Law, Scaling laws and fluctuations in the statistics of word frequencies