Statistical data integration in survey sampling: a review
From MaRDI portal
(Redirected from Publication:830265)
Abstract: Finite population inference is a central goal in survey sampling. Probability sampling is the main statistical approach to finite population inference. Challenges arise due to high cost and increasing non-response rates. Data integration provides a timely solution by leveraging multiple data sources to provide more robust and efficient inference than using any single data source alone. The technique for data integration varies depending on types of samples and available information to be combined. This article provides a systematic review of data integration techniques for combining probability samples, probability and non-probability samples, and probability and big data samples. We discuss a wide range of integration methods such as generalized least squares, calibration weighting, inverse probability weighting, mass imputation and doubly robust methods. Finally, we highlight important questions for future research.
Recommendations
- On making valid inferences by integrating data from surveys and other sources
- Combining survey data with other data sources
- Combining household surveys using mass imputation to estimate population totals
- Combining statistical matching and propensity score adjustment for inference from non-probability surveys
- scientific article; zbMATH DE number 218978
Cites work
- scientific article; zbMATH DE number 3549966 (Why is no real title available?)
- scientific article; zbMATH DE number 3297798 (Why is no real title available?)
- A measurement error model approach to survey data integration: combining information from two surveys
- A note on the equivalence of two semiparametric estimation methods for nonignorable nonresponse
- Aligning Estimates for Common Variables in Two or More Sample Surveys
- An imputation approach for handling mixed-mode surveys
- Analysis of integrated data
- Big Data, Official Statistics and Some Initiatives by the Australian Bureau of Statistics
- Calibration Estimators in Survey Sampling
- Causal Inference in Outcome-Dependent Two-Phase Sampling Designs
- Combining Independent Regression Estimators From Multiple Surveys
- Combining Information from Multiple Surveys by using Regression for Efficient Small Domain Estimation
- Combining data from two independent surveys: a model-assisted approach
- Combining household surveys using mass imputation to estimate population totals
- Combining information from multiple surveys through the empirical likelihood method
- Combining multiple observational data sources to estimate causal effects
- Combining survey data with other data sources
- Contribution to the Theory of Sampling Human Populations
- Covariate Balancing Propensity Score
- Covariate balancing propensity score by tailored loss functions
- Data integration with high dimensionality
- Demystifying double robustness: a comparison of alternative strategies for estimating a population mean from incomplete data
- Developments in Survey Research over the Past 60 Years: A Personal Perspective
- Double/debiased machine learning for treatment and structural parameters
- Doubly Robust Estimation in Missing Data and Causal Inference Models
- Doubly Robust Inference when Combining Probability and Non-Probability Samples with High Dimensional Data
- Generalized additive models. An introduction with R.
- Globally efficient non-parametric inference of average treatment effects by empirical balancing calibration weighting
- Improving efficiency and robustness of the doubly robust estimator for a population mean with incomplete data
- Inference for nonprobability samples
- Model assisted survey sampling.
- Multivariate k-nearest neighbor density estimates
- On a Least Squares Adjustment of a Sampled Frequency Table When the Expected Marginal Totals are Known
- On making valid inferences by integrating data from surveys and other sources
- Parametric fractional imputation for missing data analysis
- Random forests
- Robust Estimation of Encouragement Design Intervention Effects Transported Across Sites
- Robust inference on average treatment effects with possibly more covariates than observations
- Sampling Statistics
- Small area estimation
- Small area estimation when auxiliary information is measured with error
- Some new asymptotic theory for least squares series: pointwise and uniform results
- Stable weights that balance covariates for estimation with incomplete outcome data
- Statistical Matching
- The central role of the propensity score in observational studies for causal effects
Cited in
(18)- MLE with datasets from populations having shared parameters
- Combining household surveys using mass imputation to estimate population totals
- Integration of traditional and telematics data for efficient insurance claims prediction
- Statistical matching of sample survey data: application to integrate Iranian time use and labour force surveys
- Assessment of the effect of constraints in a new multivariate mixed method for statistical matching
- On making valid inferences by integrating data from surveys and other sources
- A GMM approach in coupling internal data and external summary information with heterogeneous data populations
- Enhancing estimation methods for integrating probability and nonprobability survey samples with machine-learning techniques. An application to a survey on the impact of the COVID-19 pandemic in Spain
- Data integration in causal inference
- Kernel weighting for blending probability and non-probability survey samples
- Bayesian ideas in survey sampling: the legacy of Basu
- A bridging model to reconcile statistics based on data from multiple surveys
- Model-Assisted Estimation Through Random Forests in Finite Population Sampling
- An Efficient Approach for Statistical Matching of Survey Data Trough Calibration, Optimal Transport and Balanced Sampling
- Causal inference methods for combining randomized trials and observational studies: a review
- Kernel regression utilizing external information as constraints
- Combining survey data with other data sources
- Pretest estimation in combining probability and non-probability samples
This page was built for publication: Statistical data integration in survey sampling: a review
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q830265)