Addressing selection bias and measurement error in COVID-19 case count data using auxiliary information
From MaRDI portal
Publication:6138611
DOI10.1214/23-AOAS1744arXiv2005.10425MaRDI QIDQ6138611FDOQ6138611
Authors: Walter Dempsey
Publication date: 16 January 2024
Published in: The Annals of Applied Statistics (Search for Journal in Brave)
Abstract: Coronavirus case-count data has influenced government policies and drives most epidemiological forecasts. Limited testing is cited as the key driver behind minimal information on the COVID-19 pandemic. While expanded testing is laudable, measurement error and selection bias are the two greatest problems limiting our understanding of the COVID-19 pandemic; neither can be fully addressed by increased testing capacity. In this paper, we demonstrate their impact on estimation of point prevalence and the effective reproduction number. We show that estimates based on the millions of molecular tests in the US has the same mean square error as a small simple random sample. To address this, a procedure is presented that combines case-count data and random samples over time to estimate selection propensities based on key covariate information. We then combine these selection propensities with epidemiological forecast models to construct a emph{doubly robust} estimation method that accounts for both measurement-error and selection bias. This method is then applied to estimate Indiana's active infection prevalence using case-count, hospitalization, and death data with demographic information, a statewide random molecular sample collected from April 25--29th, and Delphi's COVID-19 Trends and Impact Survey. We end with a series of recommendations based on the proposed methodology.
Full work available at URL: https://arxiv.org/abs/2005.10425
Cites Work
- Title not available (Why is that?)
- Title not available (Why is that?)
- A Generalization of Sampling Without Replacement From a Finite Universe
- Model-assisted survey estimation with modern prediction techniques
- BETS: the dangers of selection bias in early analyses of the coronavirus disease (COVID-19) pandemic
- Inference for nonprobability samples
- Statistical paradises and paradoxes in big data. I: Law of large populations, big data paradox, and the 2016 US presidential election
- Forecasting seasonal influenza with a state-space SIR model
- Doubly Robust Inference With Nonprobability Survey Samples
This page was built for publication: Addressing selection bias and measurement error in COVID-19 case count data using auxiliary information
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6138611)