\(K\)-nearest neighbor intervals based AP clustering algorithm for large incomplete data (Q1665949)

From MaRDI portal
scientific article
Language Label Description Also known as
English
\(K\)-nearest neighbor intervals based AP clustering algorithm for large incomplete data
scientific article

    Statements

    \(K\)-nearest neighbor intervals based AP clustering algorithm for large incomplete data (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    27 August 2018
    0 references
    Summary: The Affinity Propagation (AP) algorithm is an effective algorithm for clustering analysis, but it can not be directly applicable to the case of incomplete data. In view of the prevalence of missing data and the uncertainty of missing attributes, we put forward a modified AP clustering algorithm based on \(K\)-nearest neighbor intervals (KNNI) for incomplete data. Based on an Improved Partial Data Strategy, the proposed algorithm estimates the KNNI representation of missing attributes by using the attribute distribution information of the available data. The similarity function can be changed by dealing with the interval data. Then the improved AP algorithm can be applicable to the case of incomplete data. Experiments on several UCI datasets show that the proposed algorithm achieves impressive clustering results.
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references