Comparison and Bayesian Estimation of Feature Allocations
From MaRDI portal
Publication:79670
DOI10.1080/10618600.2023.2204136arXiv2207.13824MaRDI QIDQ79670FDOQ79670
Author name not available (Why is that?), David B. Dahl, Author name not available (Why is that?), David B. Dahl, R. Jacob Andros, Devin J. Johnson
Publication date: 27 July 2022
Published in: Journal of Computational and Graphical Statistics (Search for Journal in Brave)
Abstract: Feature allocation models postulate a sampling distribution whose parameters are derived from shared features. Bayesian models place a prior distribution on the feature allocation, and Markov chain Monte Carlo is typically used for model fitting, which results in thousands of feature allocations sampled from the posterior distribution. Based on these samples, we propose a method to provide a point estimate of a latent feature allocation. First, we introduce FARO loss, a function between feature allocations which satisfies quasi-metric properties and allows for comparing feature allocations with differing numbers of features. The loss involves finding the optimal feature ordering among all possible, but computational feasibility is achieved by framing this task as a linear assignment problem. We also introduce the FANGS algorithm to obtain a Bayes estimate by minimizing the Monte Carlo estimate of the posterior expected FARO loss using the available samples. FANGS can produce an estimate other than those visited in the Markov chain. We provide an investigation of existing methods and our proposed methods. Our loss function and search algorithm are implemented in the fangs package in R.
Full work available at URL: https://arxiv.org/abs/2207.13824
Cites Work
- Bayesian Double Feature Allocation for Phenotyping With Electronic Health Records
- A shortest augmenting path algorithm for dense and sparse linear assignment problems
- Bayesian cluster analysis: point estimation and credible balls (with discussion)
- An introduction to abstract algebra
- Phylogeny-based tumor subclone identification using a Bayesian feature allocation model
- Bayesian cluster analysis
- Search Algorithms and Loss Functions for Bayesian Clustering
- Consensus Monte Carlo for Random Subsets Using Shared Anchors
- Multiple Hypothesis Testing by Clustering Treatment Effects
- MAD Bayes for Tumor Heterogeneity—Feature Allocation With Exponential Family Sampling
- Title not available (Why is that?)
- Combinatorial matrix classes
- Bayesian Inference for Gene Expression and Proteomics
- The attraction Indian buffet distribution
Cited In (1)
This page was built for publication: Comparison and Bayesian Estimation of Feature Allocations
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q79670)