Extreme value theory in some statistical analysis of genomic sequences (Q2463683)
From MaRDI portal
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Extreme value theory in some statistical analysis of genomic sequences |
scientific article |
Statements
Extreme value theory in some statistical analysis of genomic sequences (English)
0 references
16 December 2007
0 references
The authors consider the problem of multiple alignments of protein-coding DNA sequences. A concept of multiple alignment profiles for a domain of protein sequences is introduced. It is assumed that the profiles are random and possibly with gaps. Statistical significance of the profile scores is estimated by deriving the distribution of their maximum. It is shown that the tail distributions of the scores behave like a mixed normal distribution. An application to the Immunoglobulin domain (Ig) and results of simulations are presented.
0 references
maximum profile scores
0 references
protein profiles
0 references
sequence alignments
0 references