Literature DB >> 19407347

The gene-duplication problem: near-linear time algorithms for NNI-based local searches.

Mukul S Bansal1, Oliver Eulenstein, André Wehe.   

Abstract

The gene-duplication problem is to infer a species supertree from a collection of gene trees that are confounded by complex histories of gene-duplication events. This problem is NP-complete and thus requires efficient and effective heuristics. Existing heuristics perform a stepwise search of the tree space, where each step is guided by an exact solution to an instance of a local search problem. A classical local search problem is the {\tt NNI} search problem, which is based on the nearest neighbor interchange operation. In this work, we 1) provide a novel near-linear time algorithm for the {\tt NNI} search problem, 2) introduce extensions that significantly enlarge the search space of the {\tt NNI} search problem, and 3) present algorithms for these extended versions that are asymptotically just as efficient as our algorithm for the {\tt NNI} search problem. The exceptional speedup achieved in the extended {\tt NNI} search problems makes the gene-duplication problem more tractable for large-scale phylogenetic analyses. We verify the performance of our algorithms in a comparison study using sets of large randomly generated gene trees.

Mesh:

Year:  2009        PMID: 19407347     DOI: 10.1109/TCBB.2009.7

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  4 in total

1.  Are the duplication cost and Robinson-Foulds distance equivalent?

Authors:  Yu Zheng; Louxin Zhang
Journal:  J Comput Biol       Date:  2014-07-02       Impact factor: 1.479

2.  Robinson-Foulds supertrees.

Authors:  Mukul S Bansal; J Gordon Burleigh; Oliver Eulenstein; David Fernández-Baca
Journal:  Algorithms Mol Biol       Date:  2010-02-24       Impact factor: 1.405

3.  Efficient genome-scale phylogenetic analysis under the duplication-loss and deep coalescence cost models.

Authors:  Mukul S Bansal; J Gordon Burleigh; Oliver Eulenstein
Journal:  BMC Bioinformatics       Date:  2010-01-18       Impact factor: 3.169

4.  Triplet supertree heuristics for the tree of life.

Authors:  Harris T Lin; J Gordon Burleigh; Oliver Eulenstein
Journal:  BMC Bioinformatics       Date:  2009-01-30       Impact factor: 3.169

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.