Automated Selection of Post-Strata using a Model-Assisted Regression Tree Estimator
From MaRDI portal
Publication:124129
DOI10.48550/ARXIV.1712.05708zbMATH Open1418.62039arXiv1712.05708OpenAlexW2963726264MaRDI QIDQ124129FDOQ124129
Kelly S. McConville, Daniell Toth, Daniell Toth, Kelly S. McConville
Publication date: 15 December 2017
Published in: Scandinavian Journal of Statistics (Search for Journal in Brave)
Abstract: Auxiliary information can increase the efficiency of survey estimators through an assisting model when the model captures some of the relationship between the auxiliary data and the study variables. Despite their superior properties, model-assisted estimators are rarely used in anything but their simplest form by statistical agencies to produce official statistics. This is due to the fact that the more complicated models that have been used in model-assisted estimation are often ill suited to the available auxiliary data. Under a model-assisted framework, we propose a regression tree estimator for a finite population total. Regression tree models are adept at handling the type of auxiliary data usually available in the sampling frame and provide a model that is easy to explain and justify. The estimator can be viewed as a post-stratification estimator where the post-strata are automatically selected by the recursive partitioning algorithm of the regression tree. We establish consistency of the regression tree estimator and compare its performance to other survey estimators using the US Bureau of Labor Statistics Occupational Employment Statistics Survey.
Full work available at URL: https://arxiv.org/abs/1712.05708
Recommendations
- Tree-based models for fitting stratified linear regression models
- scientific article; zbMATH DE number 2062524
- Semi-automated simultaneous predictor selection for regression-SARIMA models
- Efficient and adaptive post-model-selection estimators
- Automatic model selection for partially linear models
- Model selection for (auto-)regression with dependent data
- Variable Selection and Interaction Detection with Bayesian Additive Regression Trees
Cited In (6)
- Model-Assisted Estimation Through Random Forests in Finite Population Sampling
- On making valid inferences by integrating data from surveys and other sources
- Comments on: ``Deville and Särndal's calibration: revisiting a 25 years old successful optimization problem
- Design-unbiased statistical learning in survey sampling
- mase
- Model-assisted estimation in high-dimensional settings for survey data
This page was built for publication: Automated Selection of Post-Strata using a Model-Assisted Regression Tree Estimator
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q124129)