Addressing selection bias and measurement error in COVID-19 case count data using auxiliary information
From MaRDI portal
Publication:6138611
Abstract: Coronavirus case-count data has influenced government policies and drives most epidemiological forecasts. Limited testing is cited as the key driver behind minimal information on the COVID-19 pandemic. While expanded testing is laudable, measurement error and selection bias are the two greatest problems limiting our understanding of the COVID-19 pandemic; neither can be fully addressed by increased testing capacity. In this paper, we demonstrate their impact on estimation of point prevalence and the effective reproduction number. We show that estimates based on the millions of molecular tests in the US has the same mean square error as a small simple random sample. To address this, a procedure is presented that combines case-count data and random samples over time to estimate selection propensities based on key covariate information. We then combine these selection propensities with epidemiological forecast models to construct a emph{doubly robust} estimation method that accounts for both measurement-error and selection bias. This method is then applied to estimate Indiana's active infection prevalence using case-count, hospitalization, and death data with demographic information, a statewide random molecular sample collected from April 25--29th, and Delphi's COVID-19 Trends and Impact Survey. We end with a series of recommendations based on the proposed methodology.
Cites work
- scientific article; zbMATH DE number 3522963 (Why is no real title available?)
- scientific article; zbMATH DE number 3549966 (Why is no real title available?)
- A Generalization of Sampling Without Replacement From a Finite Universe
- BETS: the dangers of selection bias in early analyses of the coronavirus disease (COVID-19) pandemic
- Doubly Robust Inference With Nonprobability Survey Samples
- Forecasting seasonal influenza with a state-space SIR model
- Inference for nonprobability samples
- Model-assisted survey estimation with modern prediction techniques
- Statistical paradises and paradoxes in big data. I: Law of large populations, big data paradox, and the 2016 US presidential election
This page was built for publication: Addressing selection bias and measurement error in COVID-19 case count data using auxiliary information
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6138611)