Sublinear space algorithms for the longest common substring problem

From MaRDI portal




Abstract: Given m documents of total length n, we consider the problem of finding a longest string common to at least dgeq2 of the documents. This problem is known as the emph{longest common substring (LCS) problem} and has a classic O(n) space and O(n) time solution (Weiner [FOCS'73], Hui [CPM'92]). However, the use of linear space is impractical in many applications. In this paper we show that for any trade-off parameter 1leqauleqn, the LCS problem can be solved in O(au) space and O(n2/au) time, thus providing the first smooth deterministic time-space trade-off from constant to linear space. The result uses a new and very simple algorithm, which computes a au-additive approximation to the LCS in O(n2/au) time and O(1) space. We also show a time-space trade-off lower bound for deterministic branching programs, which implies that any deterministic RAM algorithm solving the LCS problem on documents from a sufficiently large alphabet in O(au) space must use Omega(nsqrtlog(n/(aulogn))/loglog(n/(aulogn)) time.









This page was built for publication: Sublinear space algorithms for the longest common substring problem

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2921446)