Minimum common string partition problem: hardness and approximations (Q2571293)

From MaRDI portal
scientific article
Language Label Description Also known as
English
Minimum common string partition problem: hardness and approximations
scientific article

    Statements

    Minimum common string partition problem: hardness and approximations (English)
    0 references
    0 references
    0 references
    0 references
    1 November 2005
    0 references
    Summary: String comparison is a fundamental problem in computer science, with applications in areas such as computational biology, text processing and compression. We address the minimum common string partition problem, a string comparison problem with tight connection to the problem of sorting by reversals with duplicates, a key problem in genome rearrangement. A partition of a string \(A\) is a sequence \({\mathcal P}=(P_1,P_2,\dots,P_m)\) of strings, called the blocks, whose concatenation is equal to \(A\). Given a partition \({\mathcal P}\) of a string \(A\) and a partition \({\mathcal Q}\) of a string \(B\), we say that the pair \(\langle {\mathcal P},{\mathcal Q}\rangle\) is a common partition of \(A\) and \(B\) if \({\mathcal Q}\) is a permutation of \({\mathcal P}\). The minimum common string partition problem (MCSP) is to find a common partition of two strings \(A\) and \(B\) with the minimum number of blocks. The restricted version of MCSP where each letter occurs at most \(k\) times in each input string, is denoted by \(k\)-MCSP. In this paper, we show that 2-MCSP (and therefore MCSP) is NP-hard and, moreover, even APX-hard. We describe a 1.1037-approximation for 2-MCSP and a linear time 4-approximation algorithm for 3-MCSP. We are not aware of any better approximations.
    0 references
    String comparison
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references