Hierarchical clustering of large databases and classification of antibiotics at high noise levels (Q1662434)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Hierarchical clustering of large databases and classification of antibiotics at high noise levels |
scientific article; zbMATH DE number 6920412
| Language | Label | Description | Also known as |
|---|---|---|---|
| default for all languages | No label defined |
||
| English | Hierarchical clustering of large databases and classification of antibiotics at high noise levels |
scientific article; zbMATH DE number 6920412 |
Statements
Hierarchical clustering of large databases and classification of antibiotics at high noise levels (English)
0 references
20 August 2018
0 references
Summary: A new algorithm for divisive hierarchical clustering of chemical compounds based on 2D structural fragments is suggested. The algorithm is deterministic, and given a random ordering of the input, will always give the same clustering and can process a database up to 2 million records on a standard PC. The algorithm was used for classification of 1,183 antibiotics mixed with 999,994 random chemical structures. Similarity threshold, at which best separation of active and non active compounds took place, was estimated as 0.6. 85.7\% of the antibiotics were successfully classified at this threshold with 0.4\% of inaccurate compounds. A .sdf file was created with the probe molecules for clustering of external databases.
0 references
molecular structure
0 references
hierarchical clustering
0 references
algorithm
0 references
classification of antibiotics
0 references
0.80938345
0 references
0.7957787
0 references
0.79416263
0 references