Pattern-guided \(k\)-anonymity (Q1736590)

From MaRDI portal
scientific article
Language Label Description Also known as
English
Pattern-guided \(k\)-anonymity
scientific article

    Statements

    Pattern-guided \(k\)-anonymity (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    26 March 2019
    0 references
    Summary: We suggest a user-oriented approach to combinatorial data anonymization. A data matrix is called \(k\)-anonymous if every row appears at least \(k\) times -- the goal of the NP-hard \(k\)-Anonymity problem then is to make a given matrix \(k\)-anonymous by suppressing (blanking out) as few entries as possible. Building on previous work and coping with corresponding deficiencies, we describe an enhanced \(k\)-anonymization problem called \textsc{Pattern-Guided} \(k\)-\textsc{Anonymity}, where the users specify in which combinations suppressions may occur. In this way, the user of the anonymized data can express the differing importance of various data features. We show that \textsc{Pattern-Guided} \(k\)-\textsc{Anonymity} is NP-hard. We complement this by a fixed-parameter tractability result based on a ``data-driven parameterization'' and, based on this, develop an exact integer linear program (ILP)-based solution method, as well as a simple, but very effective, greedy heuristic. Experiments on several real-world datasets show that our heuristic easily matches up to the established ``Mondrian'' algorithm for \(k\)-\textsc{Anonymity} in terms of the quality of the anonymization and outperforms it in terms of running time.
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    NP-hardness
    0 references
    parameterized complexity
    0 references
    integer linear programming
    0 references
    exact algorithms
    0 references
    heuristics
    0 references
    experiments
    0 references
    0 references