Improving estimation efficiency for two-phase, outcome-dependent sampling studies

DOI10.1214/23-EJS2124MaRDI QIDQ6158213zbMATH OpenOpenAlexFDO

Authors Menglu Che, Peisong Han, Jerald F. Lawless

Publication date 31 May 2023

Published in Electronic Journal of Statistics (Search for Journal in Brave)

Full work available at URL https://arxiv.org/abs/2212.09817, https://projecteuclid.org/journals/electronic-journal-of-statistics/volume-17/issue-1/Improving-estimation-efficiency-for-two-phase-outcome-dependent-sampling-studies/10.1214/23-EJS2124.full

zbMATH Keywords

conditional likelihood missing at random empirical likelihood two-phase study surrogate covariate expensive covariate

Mathematics Subject Classification ID

Statistics (62-XX)

Abstract: Two-phase outcome dependent sampling (ODS) is widely used in many fields, especially when certain covariates are expensive and/or difficult to measure. For two-phase ODS, the conditional maximum likelihood (CML) method is very attractive because it can handle zero Phase 2 selection probabilities and avoids modeling the covariate distribution. However, most existing CML-based methods use only the Phase 2 sample and thus may be less efficient than other methods. We propose a general empirical likelihood method that uses CML augmented with additional information in the whole Phase 1 sample to improve estimation efficiency. The proposed method maintains the ability to handle zero selection probabilities and avoids modeling the covariate distribution, but can lead to substantial efficiency gains over CML in the inexpensive covariates, or in the influential covariate when a surrogate is available, because of an effective use of the Phase 1 data. Simulations and a real data illustration using NHANES data are presented.

Cites work

Cited in

(5)

This page was built for publication: Improving estimation efficiency for two-phase, outcome-dependent sampling studies

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6158213)