Literature DB >> 16901235

Designing an A* algorithm for calculating edit distance between rooted-unordered trees.

Yair Horesh1, Ramit Mehr, Ron Unger.   

Abstract

Tree structures are useful for describing and analyzing biological objects and processes. Consequently, there is a need to design metrics and algorithms to compare trees. A natural comparison metric is the "Tree Edit Distance," the number of simple edit (insert/delete) operations needed to transform one tree into the other. Rooted-ordered trees, where the order between the siblings is significant, can be compared in polynomial time. Rooted-unordered trees are used to describe processes or objects where the topology, rather than the order or the identity of each node, is important. For example, in immunology, rooted-unordered trees describe the process of immunoglobulin (antibody) gene diversification in the germinal center over time. Comparing such trees has been proven to be a difficult computational problem that belongs to the set of NP-Complete problems. Comparing two trees can be viewed as a search problem in graphs. A* is a search algorithm that explores the search space in an efficient order. Using a good lower bound estimation of the degree of difference between the two trees, A* can reduce search time dramatically. We have designed and implemented a variant of the A* search algorithm suitable for calculating tree edit distance. We show here that A* is able to perform an edit distance measurement in reasonable time for trees with dozens of nodes.

Mesh:

Substances:

Year:  2006        PMID: 16901235     DOI: 10.1089/cmb.2006.13.1165

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  3 in total

1.  Complexity of the human memory B-cell compartment is determined by the versatility of clonal diversification in germinal centers.

Authors:  Bettina Budeus; Stefanie Schweigle de Reynoso; Martina Przekopowitz; Daniel Hoffmann; Marc Seifert; Ralf Küppers
Journal:  Proc Natl Acad Sci U S A       Date:  2015-08-31       Impact factor: 11.205

2.  A clique-based method using dynamic programming for computing edit distance between unordered trees.

Authors:  Tomoya Mori; Takeyuki Tamura; Daiji Fukagawa; Atsuhiro Takasu; Etsuji Tomita; Tatsuya Akutsu
Journal:  J Comput Biol       Date:  2012-10       Impact factor: 1.479

3.  A clique-based method for the edit distance between unordered trees and its application to analysis of glycan structures.

Authors:  Daiji Fukagawa; Takeyuki Tamura; Atsuhiro Takasu; Etsuji Tomita; Tatsuya Akutsu
Journal:  BMC Bioinformatics       Date:  2011-02-15       Impact factor: 3.169

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.