scispace - formally typeset
Open AccessPosted Content

An Even Faster and More Unifying Algorithm for Comparing Trees via Unbalanced Bipartite Matchings

Reads0
Chats0
TLDR
In this article, the authors presented an algorithm for comparing trees that are labeled in an arbitrary manner, which is faster than the previous algorithms and is at the core of their maximum agreement subtree algorithm.
Abstract
A widely used method for determining the similarity of two labeled trees is to compute a maximum agreement subtree of the two trees. Previous work on this similarity measure is only concerned with the comparison of labeled trees of two special kinds, namely, uniformly labeled trees (i.e., trees with all their nodes labeled by the same symbol) and evolutionary trees (i.e., leaf-labeled trees with distinct symbols for distinct leaves). This paper presents an algorithm for comparing trees that are labeled in an arbitrary manner. In addition to this generality, this algorithm is faster than the previous algorithms. Another contribution of this paper is on maximum weight bipartite matchings. We show how to speed up the best known matching algorithms when the input graphs are node-unbalanced or weight-unbalanced. Based on these enhancements, we obtain an efficient algorithm for a new matching problem called the hierarchical bipartite matching problem, which is at the core of our maximum agreement subtree algorithm.

read more

Citations
More filters
Journal ArticleDOI

Tree Edit Distance and Maximum Agreement Subtree

TL;DR: It is shown that, for an arbitrary tree edit distance metric that is a derivative of the Tai tree edits distance metric, there exists a MAST-like pattern extraction problem, named Mostly Adjusted Sub-Forest (MASF) problem, such that computing a distance between trees x and y is equivalent to finding an optimal pattern shared betweenx and y.
Proceedings ArticleDOI

Clustering of Positions in Nucleotide Sequences by Trim Distance

TL;DR: This paper forms another trim distance based on the MAST distance, in contrast to the previous trim distancebased on the LCA-preserving distance, and applies a group average method in agglomerative hierarchical clustering to the positions in nucleotide sequences by the trim distances toucleotide sequences of influenza A (H1N1) viruses.
Journal ArticleDOI

On the approximability of the Maximum Agreement SubTree and Maximum Compatible Tree problems

TL;DR: The aim of this paper is to give a complete picture of approximability for two tree consensus problems which are of particular interest in computational biology: Maximum Agreement SubTree (MAST) and Maximum Compatible Tree (MCT).
Journal ArticleDOI

An efficient strategy for generating all descendant subtree patterns from phylogenetic trees with its implementation

TL;DR: This paper efficiently solve two subtree-comparison problems on a set of phylogenetic trees which have practical applications to analyze the evolution and co-evolution genes clustering of genomic sequences.
Journal ArticleDOI

A Faster Algorithm for Computing the Kernel of Maximum Agreement Subtrees

TL;DR: The construction of the kernel agreement subtree (KAST) is the focus of this paper and an time algorithm for computing the KAST.
References
More filters
Book

Introduction to Algorithms

TL;DR: The updated new edition of the classic Introduction to Algorithms is intended primarily for use in undergraduate or graduate courses in algorithms or data structures and presents a rich variety of algorithms and covers them in considerable depth while making their design and analysis accessible to all levels of readers.
Journal ArticleDOI

Faster scaling algorithms for network problems

TL;DR: This paper presents algorithms for the assignment problem, the transportation problem, and the minimum- cost flow problem of operations research that find a minimum-cost solution, yet run in time close to the best-known bounds for the corresponding problems without costs.
Journal ArticleDOI

Comparing multiple RNA secondary structures using tree comparisons

TL;DR: This paper presents another approach to the problem of comparing many secondary structures by utilizing a very efficient tree-matching algorithm that will compare two trees in O([T1] X [T2] X L1 X L2) in the worst case and very close to O[T1?] for average trees representing secondary structures.
Journal ArticleDOI

Obtaining common pruned trees

TL;DR: The tree obtained by regrafting branches on to a largest common pruned tree is shown to contain all the classes present in the strict consensus tree.
Related Papers (5)