Influential features PCA for high dimensional clustering (Q510669)

From MaRDI portal
scientific article
Language Label Description Also known as
English
Influential features PCA for high dimensional clustering
scientific article

    Statements

    Influential features PCA for high dimensional clustering (English)
    0 references
    0 references
    0 references
    0 references
    13 February 2017
    0 references
    Starting from the problem of clustering using gene microarray data, the paper approaches the situation when the feature vectors come from different classes the labels of which are unknown. The authors propose as solution to this problem the influential features PCA (IF-PCA) technique as a new spectral clustering method, along with the Kolmogorov-Smirnov (K-S) score. Since the performance of IF-PCA depends on the choice of the corresponding threshold, the Higher Criticism (H-C) technique is used accordingly. The model is applied to ten different microarray medical datasets (brain, breast cancer, leukemia, etc.) and compared with other clustering methods, and the method is proved to be efficient.
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    empirical null
    0 references
    feature selection
    0 references
    gene microarray
    0 references
    Hamming distance
    0 references
    phase transition
    0 references
    post-selection spectral clustering
    0 references
    sparsity
    0 references
    0 references
    0 references