Literature DB >> 23057820

A clique-based method using dynamic programming for computing edit distance between unordered trees.

Tomoya Mori1, Takeyuki Tamura, Daiji Fukagawa, Atsuhiro Takasu, Etsuji Tomita, Tatsuya Akutsu.   

Abstract

Many kinds of tree-structured data, such as RNA secondary structures, have become available due to the progress of techniques in the field of molecular biology. To analyze the tree-structured data, various measures for computing the similarity between them have been developed and applied. Among them, tree edit distance is one of the most widely used measures. However, the tree edit distance problem for unordered trees is NP-hard. Therefore, it is required to develop efficient algorithms for the problem. Recently, a practical method called clique-based algorithm has been proposed, but it is not fast for large trees. This article presents an improved clique-based method for the tree edit distance problem for unordered trees. The improved method is obtained by introducing a dynamic programming scheme and heuristic techniques to the previous clique-based method. To evaluate the efficiency of the improved method, we applied the method to comparison of real tree structured data such as glycan structures. For large tree-structures, the improved method is much faster than the previous method. In particular, for hard instances, the improved method achieved more than 100 times speed-up.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 23057820      PMCID: PMC3469208          DOI: 10.1089/cmb.2012.0133

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  6 in total

1.  A general edit distance between RNA structures.

Authors:  Tao Jiang; Guohui Lin; Bin Ma; Kaizhong Zhang
Journal:  J Comput Biol       Date:  2002       Impact factor: 1.479

2.  Designing an A* algorithm for calculating edit distance between rooted-unordered trees.

Authors:  Yair Horesh; Ramit Mehr; Ron Unger
Journal:  J Comput Biol       Date:  2006 Jul-Aug       Impact factor: 1.479

3.  System for the analysis and visualization of large 3D anatomical trees.

Authors:  Kun-Chang Yu; Erik L Ritman; William E Higgins
Journal:  Comput Biol Med       Date:  2007-07-31       Impact factor: 4.589

4.  KCaM (KEGG Carbohydrate Matcher): a software tool for analyzing the structures of carbohydrate sugar chains.

Authors:  Kiyoko F Aoki; Atsuko Yamaguchi; Nobuhisa Ueda; Tatsuya Akutsu; Hiroshi Mamitsuka; Susumu Goto; Minoru Kanehisa
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

5.  A clique-based method for the edit distance between unordered trees and its application to analysis of glycan structures.

Authors:  Daiji Fukagawa; Takeyuki Tamura; Atsuhiro Takasu; Etsuji Tomita; Tatsuya Akutsu
Journal:  BMC Bioinformatics       Date:  2011-02-15       Impact factor: 3.169

6.  KEGG for representation and analysis of molecular networks involving diseases and drugs.

Authors:  Minoru Kanehisa; Susumu Goto; Miho Furumichi; Mao Tanabe; Mika Hirakawa
Journal:  Nucleic Acids Res       Date:  2009-10-30       Impact factor: 16.971

  6 in total
  1 in total

1.  PhISCS: a combinatorial approach for subperfect tumor phylogeny reconstruction via integrative use of single-cell and bulk sequencing data.

Authors:  Salem Malikic; Farid Rashidi Mehrabadi; Simone Ciccolella; Md Khaledur Rahman; Camir Ricketts; Ehsan Haghshenas; Daniel Seidman; Faraz Hach; Iman Hajirasouliha; S Cenk Sahinalp
Journal:  Genome Res       Date:  2019-10-18       Impact factor: 9.043

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.