Bayesian Double Feature Allocation for Phenotyping With Electronic Health Records
From MaRDI portal
Publication:5146013
Abstract: We propose a categorical matrix factorization method to infer latent diseases from electronic health records (EHR) data in an unsupervised manner. A latent disease is defined as an unknown biological aberration that causes a set of common symptoms for a group of patients. The proposed approach is based on a novel double feature allocation model which simultaneously allocates features to the rows and the columns of a categorical matrix. Using a Bayesian approach, available prior information on known diseases greatly improves identifiability and interpretability of latent diseases. This includes known diagnoses for patients and known association of diseases with symptoms. We validate the proposed approach by simulation studies including mis-specified models and comparison with sparse latent factor models. In the application to Chinese EHR data, we find interesting results, some of which agree with related clinical and medical knowledge.
Recommendations
- A two‐phase Bayesian methodology for the analysis of binary phenotypes in genome‐wide association studies
- Bayesian latent multi‐state modeling for nonequidistant longitudinal electronic health records
- Bayesian analysis for imbalanced positive-unlabelled diagnosis codes in electronic health records
- Statistical inference for association studies using electronic health records: handling both selection bias and outcome misclassification
- Electronic health record analysis via deep Poisson factor models
- Bayesian and Frequentist Methods for Provider Profiling Using Risk-Adjusted Assessments of Medical Outcomes
- Bayesian Variable Selection in Multinomial Probit Models to Identify Molecular Signatures of Disease Stage
- Bayesian feature allocation models for tumor heterogeneity
Cites work
- scientific article; zbMATH DE number 1085980 (Why is no real title available?)
- scientific article; zbMATH DE number 1134987 (Why is no real title available?)
- A Bayesian feature allocation model for tumor heterogeneity
- Bayes and empirical-Bayes multiplicity adjustment in the variable-selection problem
- Bayesian Inference for Gene Expression and Proteomics
- Bayesian estimation of the DINA \(Q\) matrix
- Bayesian inference for general Gaussian graphical models with application to multivariate lattice data
- Cluster and feature modeling from combinatorial stochastic processes
- Exchangeable trait allocations
- MCMC for normalized random measure mixture models
- Mixture models with a prior on the number of components
- Modeling with normalized random measure mixture models
- Nonparametric Bayesian bi-clustering for next generation sequencing count data
- Sampling decomposable graphs using a Markov chain on junction trees
- Sparse Bayesian infinite factor models
- Statistical analysis of \(Q\)-matrix based diagnostic classification models
- The Indian buffet process: an introduction and review
Cited in
(10)- A Joint MLE Approach to Large-Scale Structured Latent Attribute Analysis
- Hierarchical infinite factor models for improving the prediction of surgical complications for geriatric patients
- Gaussian process regression and classification using international classification of disease codes as covariates
- Comparison and Bayesian Estimation of Feature Allocations
- The attraction Indian buffet distribution
- dfa
- Biclustering via semiparametric Bayesian inference
- Automated feature selection of predictors in electronic medical records data
- A Bayesian approach to restricted latent class models for scientifically structured clustering of multivariate binary outcomes
- Electronic health record analysis via deep Poisson factor models
This page was built for publication: Bayesian Double Feature Allocation for Phenotyping With Electronic Health Records
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5146013)