Hierarchical clustering of large databases and classification of antibiotics at high noise levels (Q1662434)

From MaRDI portal
scientific article
Language Label Description Also known as
English
Hierarchical clustering of large databases and classification of antibiotics at high noise levels
scientific article

    Statements

    Hierarchical clustering of large databases and classification of antibiotics at high noise levels (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    20 August 2018
    0 references
    Summary: A new algorithm for divisive hierarchical clustering of chemical compounds based on 2D structural fragments is suggested. The algorithm is deterministic, and given a random ordering of the input, will always give the same clustering and can process a database up to 2 million records on a standard PC. The algorithm was used for classification of 1,183 antibiotics mixed with 999,994 random chemical structures. Similarity threshold, at which best separation of active and non active compounds took place, was estimated as 0.6. 85.7\% of the antibiotics were successfully classified at this threshold with 0.4\% of inaccurate compounds. A .sdf file was created with the probe molecules for clustering of external databases.
    0 references
    0 references
    0 references
    0 references
    0 references
    molecular structure
    0 references
    hierarchical clustering
    0 references
    algorithm
    0 references
    classification of antibiotics
    0 references
    0 references