Resolving the Complexity of Some Data Privacy Problems

From MaRDI portal
Publication:3587455

DOI10.1007/978-3-642-14162-1_33zbMATH Open1288.68058arXiv1004.3811OpenAlexW1880832693MaRDI QIDQ3587455FDOQ3587455


Authors: Jeremiah Blocki, Ryan Williams Edit this on Wikidata


Publication date: 7 September 2010

Published in: Automata, Languages and Programming (Search for Journal in Brave)

Abstract: We formally study two methods for data sanitation that have been used extensively in the database community: k-anonymity and l-diversity. We settle several open problems concerning the difficulty of applying these methods optimally, proving both positive and negative results: 1. 2-anonymity is in P. 2. The problem of partitioning the edges of a triangle-free graph into 4-stars (degree-three vertices) is NP-hard. This yields an alternative proof that 3-anonymity is NP-hard even when the database attributes are all binary. 3. 3-anonymity with only 27 attributes per record is MAX SNP-hard. 4. For databases with n rows, k-anonymity is in O(4^n poly(n)) time for all k > 1. 5. For databases with n rows and l <= log_{2c+2} log n attributes over an alphabet of cardinality c = O(1), k-anonymity is in P. Assuming c, l = O(1), k-anonymity is in O(n). 6. 3-diversity with binary attributes is NP-hard, with one sensitive attribute. 7. 2-diversity with binary attributes is NP-hard, with three sensitive attributes.


Full work available at URL: https://arxiv.org/abs/1004.3811




Recommendations




Cited In (16)





This page was built for publication: Resolving the Complexity of Some Data Privacy Problems

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q3587455)