Cache Oblivious Algorithms for Computing the Triplet Distance Between Trees

From MaRDI portal
Publication:5111707

DOI10.4230/LIPICS.ESA.2017.21zbMATH Open1442.68289arXiv1706.10284OpenAlexW2963351835MaRDI QIDQ5111707FDOQ5111707

Konstantinos Mampentzidis, Gerth Stølting Brodal

Publication date: 27 May 2020

Abstract: We study the problem of computing the triplet distance between two rooted unordered trees with n labeled leafs. Introduced by Dobson 1975, the triplet distance is the number of leaf triples that induce different topologies in the two trees. The current theoretically best algorithm is an mathrmO(nlogn) time algorithm by Brodal et al. (SODA 2013). Recently Jansson and Rajaby proposed a new algorithm that, while slower in theory, requiring mathrmO(nlog3n) time, in practice it outperforms the theoretically faster mathrmO(nlogn) algorithm. Both algorithms do not scale to external memory. We present two cache oblivious algorithms that combine the best of both worlds. The first algorithm is for the case when the two input trees are binary trees and the second a generalized algorithm for two input trees of arbitrary degree. Analyzed in the RAM model, both algorithms require mathrmO(nlogn) time, and in the cache oblivious model mathrmO(fracnBlog2fracnM) I/Os. Their relative simplicity and the fact that they scale to external memory makes them achieve the best practical performance. We note that these are the first algorithms that scale to external memory, both in theory and practice, for this problem.


Full work available at URL: https://arxiv.org/abs/1706.10284




Recommendations




Cites Work


Cited In (7)

Uses Software





This page was built for publication: Cache Oblivious Algorithms for Computing the Triplet Distance Between Trees

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5111707)