A smoothing approach for masking spatial data
From MaRDI portal
(Redirected from Publication:614160)
Abstract: Individual-level health data are often not publicly available due to confidentiality; masked data are released instead. Therefore, it is important to evaluate the utility of using the masked data in statistical analyses such as regression. In this paper we propose a data masking method which is based on spatial smoothing techniques. The proposed method allows for selecting both the form and the degree of masking, thus resulting in a large degree of flexibility. We investigate the utility of the masked data sets in terms of the mean square error (MSE) of regression parameter estimates when fitting a Generalized Linear Model (GLM) to the masked data. We also show that incorporating prior knowledge on the spatial pattern of the exposure into the data masking may reduce the bias and MSE of the parameter estimates. By evaluating both utility and disclosure risk as functions of the form and the degree of masking, our method produces a risk-utility profile which can facilitate the selection of masking parameters. We apply the method to a study of racial disparities in mortality rates using data on more than 4 million Medicare enrollees residing in 2095 zip codes in the Northeast region of the United States.
Recommendations
- scientific article; zbMATH DE number 2087737
- Spatial Smoothing of Geographically Aggregated Data, With Application to the Construction of Incidence Maps
- A family of methods for statistical disclosure control
- Make assurance double sure: combination of two disclosure limitation methods and estimation of general regression models
- Masking methods that preserve positivity constraints in microdata
Cites work
- scientific article; zbMATH DE number 46578 (Why is no real title available?)
- scientific article; zbMATH DE number 708500 (Why is no real title available?)
- scientific article; zbMATH DE number 1082208 (Why is no real title available?)
- scientific article; zbMATH DE number 2156343 (Why is no real title available?)
- A smoothing approach for masking spatial data
- Bootstrap methods: another look at the jackknife
- Data-swapping: A technique for disclosure control
- Elements of statistical disclosure control
- Estimating Risks of Identification Disclosure in Microdata
- MODELLING USER UNCERTAINTY FOR DISCLOSURE RISK AND DATA UTILITY
- Network Models for Complementary Cell Suppression
- On the Barcilon formula for the string equation with a piecewise continuous density function
- Releasing Multiply Imputed, Synthetic Public use Microdata: An Illustration and Empirical Study
- SOFTWARE SYSTEMS FOR TABULAR DATA RELEASES
- Semiparametric Regression
- Smoothing methods in statistics
- Statistical disclosure control in practice
- Synthetic two-way contingency tables that preserve conditional frequencies
- The elements of statistical learning. Data mining, inference, and prediction
Cited in
(3)
This page was built for publication: A smoothing approach for masking spatial data
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q614160)