Multiple imputation for multilevel data with continuous and binary variables

From MaRDI portal
Publication:1799343

DOI10.1214/18-STS646zbMATH Open1397.62265arXiv1702.00971MaRDI QIDQ1799343FDOQ1799343


Authors: Vincent Audigier, Ian R. White, Shahab Jolani, Thomas P. A. Debray, Matteo Quartagno, James Carpenter, Stef van Buuren, Matthieu Resche-Rigon Edit this on Wikidata


Publication date: 18 October 2018

Published in: Statistical Science (Search for Journal in Brave)

Abstract: We present and compare multiple imputation methods for multilevel continuous and binary data where variables are systematically and sporadically missing. The methods are compared from a theoretical point of view and through an extensive simulation study motivated by a real dataset comprising multiple studies. Simulations are reproducible. The comparisons show why these multiple imputation methods are the most appropriate to handle missing values in a multilevel setting and why their relative performances can vary according to the missing data pattern, the multilevel structure and the type of missing variables. This study shows that valid inferences can only be obtained if the dataset gathers a large number of clusters. In addition, it highlights that heteroscedastic MI methods provide more accurate inferences than homoscedastic methods, which should be reserved for data with few individuals per cluster. Finally, the method of Quartagno and Carpenter (2016a) appears generally accurate for binary variables, the method of Resche-Rigon and White (2016) with large clusters, and the approach of Jolani et al. (2015) with small clusters.


Full work available at URL: https://arxiv.org/abs/1702.00971




Recommendations




Cites Work


Cited In (20)

Uses Software





This page was built for publication: Multiple imputation for multilevel data with continuous and binary variables

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1799343)