A phase transition for the score in matching random sequences allowing deletions (Q1327613): Difference between revisions
From MaRDI portal
Set profile property. |
Set OpenAlex properties. |
||
Property / full work available at URL | |||
Property / full work available at URL: https://doi.org/10.1214/aoap/1177005208 / rank | |||
Normal rank | |||
Property / OpenAlex ID | |||
Property / OpenAlex ID: W2041976742 / rank | |||
Normal rank |
Latest revision as of 21:36, 19 March 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | A phase transition for the score in matching random sequences allowing deletions |
scientific article |
Statements
A phase transition for the score in matching random sequences allowing deletions (English)
0 references
2 April 1995
0 references
The paper examines a sequence matching problem (known as string matching in the computer science literature) involving the optional alignment score for contiguous subsequences, rewarding matches and penalizing for deletions and mismatches. This score is used by biologists comparing pairs of DNA or protein sequences. It is proved that for two sequences of length \(n\), as \(n \to \infty\), there is a phase transition between linear growth in \(n\), when the penalty parameters are small, and logarithmic growth in \(n\), when the penalties are large. The results are valid for independent sequences with iid or Markov letters. The phase transition is also established for more general scoring schemes allowing general letter-to-letter alignment penalties and block deletion penalties. A general method for applying the bounded increments martingale method to Lipschitz functionals of marked processes is given.
0 references
longest common subsequence
0 references
large deviations
0 references
Azuma-Hoeffding
0 references
percolation
0 references
sequence matching problem
0 references
string matching
0 references
optional alignment score
0 references
contiguous subsequences
0 references
rewarding matches
0 references
deletions
0 references
mismatches
0 references
DNA
0 references
protein sequences
0 references
phase transition
0 references
linear growth
0 references
logarithmic growth
0 references
general letter-to-letter alignment penalties
0 references
block deletion penalties
0 references
bounded increments martingale method
0 references
Lipschitz functionals of marked processes
0 references