Statistical data integration in survey sampling: a review
From MaRDI portal
Publication:830265
DOI10.1007/S42081-020-00093-WzbMATH Open1466.62247arXiv2001.03259OpenAlexW3092806957MaRDI QIDQ830265FDOQ830265
Publication date: 7 May 2021
Published in: Japanese Journal of Statistics and Data Science (Search for Journal in Brave)
Abstract: Finite population inference is a central goal in survey sampling. Probability sampling is the main statistical approach to finite population inference. Challenges arise due to high cost and increasing non-response rates. Data integration provides a timely solution by leveraging multiple data sources to provide more robust and efficient inference than using any single data source alone. The technique for data integration varies depending on types of samples and available information to be combined. This article provides a systematic review of data integration techniques for combining probability samples, probability and non-probability samples, and probability and big data samples. We discuss a wide range of integration methods such as generalized least squares, calibration weighting, inverse probability weighting, mass imputation and doubly robust methods. Finally, we highlight important questions for future research.
Full work available at URL: https://arxiv.org/abs/2001.03259
Sampling theory, sample surveys (62D05) Missing data (62D10) Research exposition (monographs, survey articles) pertaining to statistics (62-02)
Cites Work
- Title not available (Why is that?)
- Title not available (Why is that?)
- Title not available (Why is that?)
- Stable Weights that Balance Covariates for Estimation With Incomplete Outcome Data
- Random forests
- Model assisted survey sampling.
- Improving efficiency and robustness of the doubly robust estimator for a population mean with incomplete data
- Demystifying double robustness: a comparison of alternative strategies for estimating a population mean from incomplete data
- Double/debiased machine learning for treatment and structural parameters
- On a Least Squares Adjustment of a Sampled Frequency Table When the Expected Marginal Totals are Known
- Small area estimation when auxiliary information is measured with error
- Doubly Robust Estimation in Missing Data and Causal Inference Models
- Multivariate k-nearest neighbor density estimates
- The central role of the propensity score in observational studies for causal effects
- Calibration Estimators in Survey Sampling
- Statistical Matching
- Small Area Estimation
- Robust inference on average treatment effects with possibly more covariates than observations
- Sampling Statistics
- Parametric fractional imputation for missing data analysis
- Some new asymptotic theory for least squares series: pointwise and uniform results
- Globally Efficient Non-Parametric Inference of Average Treatment Effects by Empirical Balancing Calibration Weighting
- Covariate Balancing Propensity Score
- An imputation approach for handling mixed-mode surveys
- Aligning Estimates for Common Variables in Two or More Sample Surveys
- Contribution to the Theory of Sampling Human Populations
- Combining Independent Regression Estimators From Multiple Surveys
- Combining data from two independent surveys: a model-assisted approach
- Combining Information from Multiple Surveys by using Regression for Efficient Small Domain Estimation
- Big Data, Official Statistics and Some Initiatives by the Australian Bureau of Statistics
- A note on the equivalence of two semiparametric estimation methods for nonignorable nonresponse
- A measurement error model approach to survey data integration: combining information from two surveys
- Inference for nonprobability samples
- Combining survey data with other data sources
- Covariate balancing propensity score by tailored loss functions
- On making valid inferences by integrating data from surveys and other sources
- Combining household surveys using mass imputation to estimate population totals
- Robust Estimation of Encouragement Design Intervention Effects Transported Across Sites
- Causal Inference in Outcome-Dependent Two-Phase Sampling Designs
- Combining information from multiple surveys through the empirical likelihood method
- Doubly Robust Inference when Combining Probability and Non-Probability Samples with High Dimensional Data
- Combining Multiple Observational Data Sources to Estimate Causal Effects
- Analysis of Integrated Data
- Data integration with high dimensionality
- Developments in Survey Research over the Past 60 Years: A Personal Perspective
Cited In (15)
- Integration of traditional and telematics data for efficient insurance claims prediction
- Kernel weighting for blending probability and non-probability survey samples
- Model-Assisted Estimation Through Random Forests in Finite Population Sampling
- Kernel regression utilizing external information as constraints
- A bridging model to reconcile statistics based on data from multiple surveys
- Pretest estimation in combining probability and non-probability samples
- Data integration in causal inference
- MLE with datasets from populations having shared parameters
- Statistical matching of sample survey data: application to integrate Iranian time use and labour force surveys
- Bayesian ideas in survey sampling: the legacy of Basu
- An Efficient Approach for Statistical Matching of Survey Data Trough Calibration, Optimal Transport and Balanced Sampling
- A GMM approach in coupling internal data and external summary information with heterogeneous data populations
- Causal inference methods for combining randomized trials and observational studies: a review
- Assessment of the effect of constraints in a new multivariate mixed method for statistical matching
- Enhancing estimation methods for integrating probability and nonprobability survey samples with machine-learning techniques. An application to a survey on the impact of the COVID-19 pandemic in Spain
Uses Software
This page was built for publication: Statistical data integration in survey sampling: a review
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q830265)