A hierarchical multi-label classification algorithm for gene function prediction (Q2633227)

From MaRDI portal
scientific article
Language Label Description Also known as
English
A hierarchical multi-label classification algorithm for gene function prediction
scientific article

    Statements

    A hierarchical multi-label classification algorithm for gene function prediction (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    8 May 2019
    0 references
    Summary: Gene function prediction is a complicated and challenging hierarchical multi-label classification (HMC) task, in which genes may have many functions at the same time and these functions are organized in a hierarchy. This paper proposed a novel HMC algorithm for solving this problem based on the Gene Ontology (GO), the hierarchy of which is a directed acyclic graph (DAG) and is more difficult to tackle. In the proposed algorithm, the HMC task is firstly changed into a set of binary classification tasks. Then, two measures are implemented in the algorithm to enhance the HMC performance by considering the hierarchy structure during the learning procedures. Firstly, negative instances selecting policy associated with the SMOTE approach are proposed to alleviate the imbalanced data set problem. Secondly, a nodes interaction method is introduced to combine the results of binary classifiers. It can guarantee that the predictions are consistent with the hierarchy constraint. The experiments on eight benchmark yeast data sets annotated by the Gene Ontology show the promising performance of the proposed algorithm compared with other state-of-the-art algorithms.
    0 references
    0 references
    0 references
    0 references
    0 references
    hierarchical multi-label classification
    0 references
    the gene ontology
    0 references
    gene function prediction
    0 references
    DAG
    0 references
    0 references