Hierarchical clustering of large databases and classification of antibiotics at high noise levels (Q1662434)
From MaRDI portal
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Hierarchical clustering of large databases and classification of antibiotics at high noise levels |
scientific article |
Statements
Hierarchical clustering of large databases and classification of antibiotics at high noise levels (English)
0 references
20 August 2018
0 references
Summary: A new algorithm for divisive hierarchical clustering of chemical compounds based on 2D structural fragments is suggested. The algorithm is deterministic, and given a random ordering of the input, will always give the same clustering and can process a database up to 2 million records on a standard PC. The algorithm was used for classification of 1,183 antibiotics mixed with 999,994 random chemical structures. Similarity threshold, at which best separation of active and non active compounds took place, was estimated as 0.6. 85.7\% of the antibiotics were successfully classified at this threshold with 0.4\% of inaccurate compounds. A .sdf file was created with the probe molecules for clustering of external databases.
0 references
molecular structure
0 references
hierarchical clustering
0 references
algorithm
0 references
classification of antibiotics
0 references