Providing accurate models across private partitioned data: secure maximum likelihood estimation

From MaRDI portal
Publication:1624813

DOI10.1214/18-AOAS1171zbMATH Open1405.62245arXiv1710.06933OpenAlexW2964294215WikidataQ129462835 ScholiaQ129462835MaRDI QIDQ1624813FDOQ1624813

Timothy R. Brick, Aleksandra Slavković, Joshua Snoke, Michael D. Hunter

Publication date: 16 November 2018

Published in: The Annals of Applied Statistics (Search for Journal in Brave)

Abstract: This paper focuses on the privacy paradigm of providing access to researchers to remotely carry out analyses on sensitive data stored behind firewalls. We address the situation where the analysis demands data from multiple physically separate databases which cannot be combined. Motivating this problem are analyses using multiple data sources that currently are only possible through extension work creating a trusted user network. We develop and demonstrate a method for accurate calculation of the multivariate normal likelihood equation, for a set of parameters given the partitioned data, which can then be maximized to obtain estimates. These estimates are achieved without sharing any data or any true intermediate statistics of the data across firewalls. We show that under a certain set of assumptions our method for estimation across these partitions achieves identical results as estimation with the full data. Privacy is maintained by adding noise at each partition. This ensures each party receives noisy statistics, such that the noise cannot be removed until the last step to obtain a single value, the true total log-likelihood. Potential applications include all methods utilizing parameter estimation through maximizing the multivariate normal likelihood equation. We give detailed algorithms, along with available software, and both a real data example and simulations estimating structural equation models (SEMs) with partitioned data.


Full work available at URL: https://arxiv.org/abs/1710.06933





Cites Work


Uses Software






This page was built for publication: Providing accurate models across private partitioned data: secure maximum likelihood estimation

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1624813)