Poisson, compound Poisson and process approximations for testing statistical significance in sequence comparisons (Q1194425): Difference between revisions
From MaRDI portal
Changed an Item |
Changed an Item |
||
Property / describes a project that uses | |||
Property / describes a project that uses: FASTA / rank | |||
Normal rank |
Revision as of 12:57, 29 February 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Poisson, compound Poisson and process approximations for testing statistical significance in sequence comparisons |
scientific article |
Statements
Poisson, compound Poisson and process approximations for testing statistical significance in sequence comparisons (English)
0 references
27 September 1992
0 references
Approximate statistical distributions under the null hypothesis of sequence independence are obtained for certain scores when DNA or protein sequence comparisons are performed. The comparison between two sequences is made on a diagonal and deletions and insertions are not considered. The scores studied are the total number of quality \(q/k\) \(k\)-word matches on a diagonal (number of \(k\)-words with \(q\) or more letter agreements), number of clumps on a diagonal (clumps of the above matches are conveniently defined in the paper), and maximum clump size for all diagonals. The approximate distributions are Poisson, compound Poisson, and integerized extreme value. The theorems also present bounds on the approximations. The paper concludes with some data analysis (including significance tests) and references to open problems.
0 references
DNA sequence comparisons
0 references
extreme value distribution
0 references
Poisson approximation
0 references
word matches
0 references
distribution of order statistics
0 references
longest exact matching word
0 references
dynamic programming approach
0 references
null hypothesis of sequence independence
0 references
protein sequence comparisons
0 references
approximate distributions
0 references
compound Poisson
0 references
significance tests
0 references