Nonparametric Bayesian topic modelling with the hierarchical Pitman-Yor processes
From MaRDI portal
Abstract: The Dirichlet process and its extension, the Pitman-Yor process, are stochastic processes that take probability distributions as a parameter. These processes can be stacked up to form a hierarchical nonparametric Bayesian model. In this article, we present efficient methods for the use of these processes in this hierarchical context, and apply them to latent variable models for text analytics. In particular, we propose a general framework for designing these Bayesian models, which are called topic models in the computer science community. We then propose a specific nonparametric Bayesian topic model for modelling text from social media. We focus on tweets (posts on Twitter) in this article due to their ease of access. We find that our nonparametric model performs better than existing parametric models in both goodness of fit and real world applications.
Recommendations
- The nested Chinese restaurant process and Bayesian nonparametric inference of topic hierarchies
- Hierarchical Dirichlet Processes
- The Pitman-Yor multinomial process for mixture modelling
- Hierarchical topic modeling with nested hierarchical Dirichlet process
- Dynamic hierarchical Dirichlet processes topic model using the power prior approach
Cites work
- scientific article; zbMATH DE number 6377992 (Why is no real title available?)
- scientific article; zbMATH DE number 5292192 (Why is no real title available?)
- scientific article; zbMATH DE number 1408945 (Why is no real title available?)
- scientific article; zbMATH DE number 7646009 (Why is no real title available?)
- 10.1162/jmlr.2003.3.4-5.993
- A Bayesian analysis of some nonparametric problems
- Bayesian Non-Parametric Inference for Species Variety with a Two-Parameter Poisson–Dirichlet Process Prior
- Bayesian Nonparametrics
- Combinatorial stochastic processes. Ecole d'Eté de Probabilités de Saint-Flour XXXII -- 2002.
- Delayed rejection in reversible jump Metropolis-Hastings.
- Equation of state calculations by fast computing machines
- Gibbs Sampling Methods for Stick-Breaking Priors
- Hierarchical Dirichlet Processes
- Introduction to Information Retrieval
- Monte Carlo sampling methods using Markov chains and their applications
- On Metropolis-Hastings algorithms with delayed rejection
- Producing power-law distributions and damping word frequencies with two-stage language models
- Reversible jump Markov chain Monte Carlo computation and Bayesian model determination
- Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images
- The Collapsed Gibbs Sampler in Bayesian Computations with Applications to a Gene Regulation Problem
- The nested Chinese restaurant process and Bayesian nonparametric inference of topic hierarchies
- The two-parameter Poisson-Dirichlet distribution derived from a stable subordinator
- Thermodynamic limits of macroeconomic or financial models: one- and two-parameter Poisson-Dirichlet models
Cited in
(7)- Hierarchical topic modeling with nested hierarchical Dirichlet process
- Accurate parameter estimation for Bayesian network classifiers using hierarchical Dirichlet processes
- Fast approximation of variational Bayes Dirichlet process mixture using the maximization-maximization algorithm
- A segmented topic model based on the two-parameter Poisson-Dirichlet process
- Dynamic hierarchical Dirichlet processes topic model using the power prior approach
- A three-way approach for learning rules in automatic knowledge-based topic models
- Hierarchical species sampling models
This page was built for publication: Nonparametric Bayesian topic modelling with the hierarchical Pitman-Yor processes
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q324682)