Fast structural alignment of biomolecules using a hash table, n-grams and string descriptors (Q1662475)

From MaRDI portal





scientific article; zbMATH DE number 6920452
Language Label Description Also known as
default for all languages
No label defined
    English
    Fast structural alignment of biomolecules using a hash table, n-grams and string descriptors
    scientific article; zbMATH DE number 6920452

      Statements

      Fast structural alignment of biomolecules using a hash table, n-grams and string descriptors (English)
      0 references
      0 references
      0 references
      0 references
      0 references
      0 references
      0 references
      0 references
      0 references
      20 August 2018
      0 references
      Summary: This work presents a generalized approach for the fast structural alignment of thousands of macromolecular structures. The method uses string representations of a macromolecular structure and a hash table that stores n-grams of a certain size for searching. To this end, macromolecular structure-to-string translators were implemented for protein and RNA structures. A query against the index is performed in two hierarchical steps to unite speed and precision. In the first step the query structure is translated into n-grams, and all target structures containing these n-grams are retrieved from the hash table. In the second step all corresponding n-grams of the query and each target structure are subsequently aligned, and after each alignment a score is calculated based on the matching n-grams of query and target. The extendable framework enables the user to query and structurally align thousands of protein and RNA structures on a commodity machine and is available as open source from \url{http://lajolla.sf.net}.
      0 references
      structural alignment
      0 references
      protein
      0 references
      RNA
      0 references
      hash table
      0 references
      n-gram
      0 references
      torsion angles
      0 references
      0 references
      0 references
      0 references
      0 references

      Identifiers