Feature allocations, probability functions, and paintboxes

From MaRDI portal
Publication:908036

DOI10.1214/13-BA823zbMATH Open1329.62278arXiv1301.6647OpenAlexW2112410777MaRDI QIDQ908036FDOQ908036


Authors: Tamara Broderick, Michael Jordan, Jim Pitman Edit this on Wikidata


Publication date: 2 February 2016

Published in: Bayesian Analysis (Search for Journal in Brave)

Abstract: The problem of inferring a clustering of a data set has been the subject of much research in Bayesian analysis, and there currently exists a solid mathematical foundation for Bayesian approaches to clustering. In particular, the class of probability distributions over partitions of a data set has been characterized in a number of ways, including via exchangeable partition probability functions (EPPFs) and the Kingman paintbox. Here, we develop a generalization of the clustering problem, called feature allocation, where we allow each data point to belong to an arbitrary, non-negative integer number of groups, now called features or topics. We define and study an "exchangeable feature probability function" (EFPF)---analogous to the EPPF in the clustering setting---for certain types of feature models. Moreover, we introduce a "feature paintbox" characterization---analogous to the Kingman paintbox for clustering---of the class of exchangeable feature models. We provide a further characterization of the subclass of feature allocations that have EFPF representations.


Full work available at URL: https://arxiv.org/abs/1301.6647




Recommendations





Cited In (17)





This page was built for publication: Feature allocations, probability functions, and paintboxes

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q908036)