Endogenous post-stratification in surveys: classifying with a sample-fitted model
From MaRDI portal
Publication:2477066
Abstract: Post-stratification is frequently used to improve the precision of survey estimators when categorical auxiliary information is available from sources outside the survey. In natural resource surveys, such information is often obtained from remote sensing data, classified into categories and displayed as pixel-based maps. These maps may be constructed based on classification models fitted to the sample data. Post-stratification of the sample data based on categories derived from the sample data (``endogenous post-stratification) violates the standard post-stratification assumptions that observations are classified without error into post-strata, and post-stratum population counts are known. Properties of the endogenous post-stratification estimator are derived for the case of a sample-fitted generalized linear model, from which the post-strata are constructed by dividing the range of the model predictions into predetermined intervals. Design consistency of the endogenous post-stratification estimator is established under mild conditions. Under a superpopulation model, consistency and asymptotic normality of the endogenous post-stratification estimator are established, showing that it has the same asymptotic variance as the traditional post-stratified estimator with fixed strata. Simulation experiments demonstrate that the practical effect of first fitting a model to the survey data before post-stratifying is small, even for relatively small sample sizes.
Recommendations
- Nonparametric endogenous post-stratification estimation
- Post-Stratification: A Modeler's Perspective
- Poststratification Without Population Level Information on the Poststratifying Variable With Application to Political Polling
- scientific article; zbMATH DE number 5769446
- scientific article; zbMATH DE number 1159489
Cites work
- scientific article; zbMATH DE number 3842953 (Why is no real title available?)
- scientific article; zbMATH DE number 3945130 (Why is no real title available?)
- scientific article; zbMATH DE number 3549966 (Why is no real title available?)
- scientific article; zbMATH DE number 3607327 (Why is no real title available?)
- A Generalization of the Glivenko-Cantelli Theorem
- A Model-Calibration Approach to Using Complete Auxiliary Information From Survey Data
- Local polynomial regresssion estimators in survey sampling.
- Model assisted survey sampling
- Model-assisted estimation for complex surveys using penalised splines
- On the asymptotic normality of statistics with estimated parameters
Cited in
(6)- Nonparametric endogenous post-stratification estimation
- Extended Glivenko—Cantelli theorem for simple random sampling without replacement from a finite population
- Survey design asymptotics for the model-assisted penalised spline regression estimator
- Uniform convergence of the empirical cumulative distribution function under informative selection from a finite population
- Consistency of the Horvitz-Thompson estimator under general sampling and experimental designs
- Likelihood-based estimators for endogenous or truncated samples in standard stratified sampling
This page was built for publication: Endogenous post-stratification in surveys: classifying with a sample-fitted model
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2477066)