Prediction with missing data via Bayesian additive regression trees
From MaRDI portal
Publication:5256379
DOI10.1002/CJS.11248zbMATH Open1328.62243arXiv1306.0618OpenAlexW2962723792MaRDI QIDQ5256379FDOQ5256379
Authors: Adam Kapelner, Justin Bleich
Publication date: 22 June 2015
Published in: The Canadian Journal of Statistics (Search for Journal in Brave)
Abstract: We present a method for incorporating missing data in non-parametric statistical learning without the need for imputation. We focus on a tree-based method, Bayesian Additive Regression Trees (BART), enhanced with "Missingness Incorporated in Attributes," an approach recently proposed incorporating missingness into decision trees (Twala, 2008). This procedure takes advantage of the partitioning mechanisms found in tree-based models. Simulations on generated models and real data indicate that our proposed method can forecast well on complicated missing-at-random and not-missing-at-random models as well as models where missingness itself influences the response. Our procedure has higher predictive performance and is more stable than competitors in many cases. We also illustrate BART's abilities to incorporate missingness into uncertainty intervals and to detect the influence of missingness on the model fit.
Full work available at URL: https://arxiv.org/abs/1306.0618
Recommendations
Cites Work
- Variable selection for BART: an application to gene regulation
- BART: Bayesian additive regression trees
- Random forests
- Bayesian Analysis of Binary and Polychotomous Response Data
- Statistical modeling: The two cultures. (With comments and a rejoinder).
- Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images
- Title not available (Why is that?)
- Monte Carlo sampling methods using Markov chains and their applications
- Pattern-Mixture Models for Multivariate Incomplete Data
- Stochastic gradient boosting.
- An investigation of missing data methods for classification trees applied to binary response data
Cited In (24)
- Learning algorithms to evaluate forensic glass evidence
- Racing trees to query partial data
- Nowcasting in a pandemic using non-parametric mixed frequency VARs
- Smoothing and adaptation of shifted Pólya tree ensembles
- Discovering interactions using covariate informed random partition models
- Rhapsody in fractional
- BART-based inference for Poisson processes
- Boosting with missing predictors
- Understanding the effect of contextual factors and decision making on team performance in Twenty20 cricket: an interpretable machine learning approach
- Clustering and Prediction With Variable Dimension Covariates
- An integrated Bayesian framework for multi-omics prediction and classification
- bartMachine
- Performance of variable and function selection methods for estimating the nonlinear health effects of correlated chemical mixtures: a simulation study
- Variable selection for BART: an application to gene regulation
- Heterogeneous causal effects with imperfect compliance: a Bayesian machine learning approach
- Bayesian additive regression trees with model trees
- Regression with variable dimension covariates
- Inference in Bayesian additive vector autoregressive tree models
- BiMM tree: a decision tree method for modeling clustered and longitudinal binary outcomes
- Tree-based algorithms for missing data imputation
- Bayesian additive regression trees using Bayesian model averaging
- Propensity score estimation using classification and regression trees in the presence of missing covariate data
- An ensemble learning method for variable selection: application to high-dimensional data and missing values
- Ordered probit Bayesian additive regression trees for ordinal data
Uses Software
This page was built for publication: Prediction with missing data via Bayesian additive regression trees
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5256379)