On the string consensus problem and the Manhattan sequence consensus problem

From MaRDI portal
Publication:1698720

DOI10.1016/J.TCS.2017.03.022zbMATH Open1387.68311arXiv1407.6144OpenAlexW2599391063MaRDI QIDQ1698720FDOQ1698720


Authors: Tomasz Kociumaka, Jakub W. Pachocki, Jakub Radoszewski, Wojciech Rytter, Tomasz Waleń Edit this on Wikidata


Publication date: 16 February 2018

Published in: Theoretical Computer Science (Search for Journal in Brave)

Abstract: In the Manhattan Sequence Consensus problem (MSC problem) we are given k integer sequences, each of length l, and we are to find an integer sequence x of length l (called a consensus sequence), such that the maximum Manhattan distance of x from each of the input sequences is minimized. For binary sequences Manhattan distance coincides with Hamming distance, hence in this case the string consensus problem (also called string center problem or closest string problem) is a special case of MSC. Our main result is a practically efficient O(l)-time algorithm solving MSC for kle5 sequences. Practicality of our algorithms has been verified experimentally. It improves upon the quadratic algorithm by Amir et al. (SPIRE 2012) for string consensus problem for k=5 binary strings. Similarly as in Amir's algorithm we use a column-based framework. We replace the implied general integer linear programming by its easy special cases, due to combinatorial properties of the MSC for kle5. We also show that for a general parameter k any instance can be reduced in linear time to a kernel of size k!, so the problem is fixed-parameter tractable. Nevertheless, for kge4 this is still too large for any naive solution to be feasible in practice.


Full work available at URL: https://arxiv.org/abs/1407.6144




Recommendations




Cites Work


Cited In (3)

Uses Software





This page was built for publication: On the string consensus problem and the Manhattan sequence consensus problem

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1698720)