Literature DB >> 12385990

An efficient and accurate distance based algorithm to reconstruct tandem duplication trees.

Olivier Elemento1, Olivier Gascuel.   

Abstract

UNLABELLED: The problem of reconstructing the duplication tree of a set of tandemly repeated sequences which are supposed to have arisen through unequal recombination, was first introduced by Fitch (1977, Genetics, 86, 93-104), and has recently received a lot of attention. In this paper, we describe DTSCORE, a fast distance based algorithm to reconstruct tandem duplication trees, which is statistically consistent. As a cousin of the ADDTREE algorithm (Sattath and Tversky, 1977, Psychometrika, 42, 319-345), the raw DTSCORE has a time complexity in O(n(5)), where n is the number of observed repeated sequences. Through a series of algorithmic refinements, we improve its complexity to O(n(4)) in the worst case, but stress that the refined DTSCORE algorithm should perform faster with real data. We assess the topological accuracy of DTSCORE using simulated data sets, and compare it to existing reconstruction methods. The results clearly show that DTSCORE is more accurate than all the other methods we studied. Finally, we report the results of DTSCORE on a real dataset. SUPPLEMENTARY INFORMATION: http://www.lirmm.fr/w3ifa/MAAS/

Mesh:

Year:  2002        PMID: 12385990     DOI: 10.1093/bioinformatics/18.suppl_2.s92

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  6 in total

1.  Algebraic Dynamic Programming over general data structures.

Authors:  Christian Höner zu Siederdissen; Sonja J Prohaska; Peter F Stadler
Journal:  BMC Bioinformatics       Date:  2015-12-16       Impact factor: 3.169

2.  TRStalker: an efficient heuristic for finding fuzzy tandem repeats.

Authors:  Marco Pellegrini; M Elena Renda; Alessio Vecchio
Journal:  Bioinformatics       Date:  2010-06-15       Impact factor: 6.937

3.  Estimation of duplication history under a stochastic model for tandem repeats.

Authors:  Farzad Farnoud; Moshe Schwartz; Jehoshua Bruck
Journal:  BMC Bioinformatics       Date:  2019-02-06       Impact factor: 3.169

4.  Tandemly arrayed genes in vertebrate genomes.

Authors:  Deng Pan; Liqing Zhang
Journal:  Comp Funct Genomics       Date:  2008

5.  Evolution of C2H2-zinc finger genes and subfamilies in mammals: species-specific duplication and loss of clusters, genes and effector domains.

Authors:  Hamsa D Tadepally; Gertraud Burger; Muriel Aubry
Journal:  BMC Evol Biol       Date:  2008-06-18       Impact factor: 3.260

6.  Fast NJ-like algorithms to deal with incomplete distance matrices.

Authors:  Alexis Criscuolo; Olivier Gascuel
Journal:  BMC Bioinformatics       Date:  2008-03-26       Impact factor: 3.169

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.