A phase transition for the score in matching random sequences allowing deletions (Q1327613)

From MaRDI portal
scientific article
Language Label Description Also known as
English
A phase transition for the score in matching random sequences allowing deletions
scientific article

    Statements

    A phase transition for the score in matching random sequences allowing deletions (English)
    0 references
    0 references
    0 references
    2 April 1995
    0 references
    The paper examines a sequence matching problem (known as string matching in the computer science literature) involving the optional alignment score for contiguous subsequences, rewarding matches and penalizing for deletions and mismatches. This score is used by biologists comparing pairs of DNA or protein sequences. It is proved that for two sequences of length \(n\), as \(n \to \infty\), there is a phase transition between linear growth in \(n\), when the penalty parameters are small, and logarithmic growth in \(n\), when the penalties are large. The results are valid for independent sequences with iid or Markov letters. The phase transition is also established for more general scoring schemes allowing general letter-to-letter alignment penalties and block deletion penalties. A general method for applying the bounded increments martingale method to Lipschitz functionals of marked processes is given.
    0 references
    0 references
    0 references
    0 references
    0 references
    longest common subsequence
    0 references
    large deviations
    0 references
    Azuma-Hoeffding
    0 references
    percolation
    0 references
    sequence matching problem
    0 references
    string matching
    0 references
    optional alignment score
    0 references
    contiguous subsequences
    0 references
    rewarding matches
    0 references
    deletions
    0 references
    mismatches
    0 references
    DNA
    0 references
    protein sequences
    0 references
    phase transition
    0 references
    linear growth
    0 references
    logarithmic growth
    0 references
    general letter-to-letter alignment penalties
    0 references
    block deletion penalties
    0 references
    bounded increments martingale method
    0 references
    Lipschitz functionals of marked processes
    0 references
    0 references