Multiple imputation for multilevel data with continuous and binary variables

From MaRDI portal
Publication:1799343

DOI10.1214/18-STS646zbMATH Open1397.62265arXiv1702.00971MaRDI QIDQ1799343FDOQ1799343

Stef van Buuren, Vincent Audigier, Shahab Jolani, Ian R. White, Matthieu Resche-Rigon, Matteo Quartagno, Thomas P. A. Debray, James Carpenter

Publication date: 18 October 2018

Published in: Statistical Science (Search for Journal in Brave)

Abstract: We present and compare multiple imputation methods for multilevel continuous and binary data where variables are systematically and sporadically missing. The methods are compared from a theoretical point of view and through an extensive simulation study motivated by a real dataset comprising multiple studies. Simulations are reproducible. The comparisons show why these multiple imputation methods are the most appropriate to handle missing values in a multilevel setting and why their relative performances can vary according to the missing data pattern, the multilevel structure and the type of missing variables. This study shows that valid inferences can only be obtained if the dataset gathers a large number of clusters. In addition, it highlights that heteroscedastic MI methods provide more accurate inferences than homoscedastic methods, which should be reserved for data with few individuals per cluster. Finally, the method of Quartagno and Carpenter (2016a) appears generally accurate for binary variables, the method of Resche-Rigon and White (2016) with large clusters, and the approach of Jolani et al. (2015) with small clusters.


Full work available at URL: https://arxiv.org/abs/1702.00971





Cites Work


Cited In (19)

Uses Software






This page was built for publication: Multiple imputation for multilevel data with continuous and binary variables

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1799343)