Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 The gene-duplication problem: near-linear time algorithms for NNI-based local searches.

Literature DB >> 19407347

The gene-duplication problem: near-linear time algorithms for NNI-based local searches.

Mukul S Bansal¹, Oliver Eulenstein, André Wehe.

Abstract

The gene-duplication problem is to infer a species supertree from a collection of gene trees that are confounded by complex histories of gene-duplication events. This problem is NP-complete and thus requires efficient and effective heuristics. Existing heuristics perform a stepwise search of the tree space, where each step is guided by an exact solution to an instance of a local search problem. A classical local search problem is the {\tt NNI} search problem, which is based on the nearest neighbor interchange operation. In this work, we 1) provide a novel near-linear time algorithm for the {\tt NNI} search problem, 2) introduce extensions that significantly enlarge the search space of the {\tt NNI} search problem, and 3) present algorithms for these extended versions that are asymptotically just as efficient as our algorithm for the {\tt NNI} search problem. The exceptional speedup achieved in the extended {\tt NNI} search problems makes the gene-duplication problem more tractable for large-scale phylogenetic analyses. We verify the performance of our algorithms in a comparison study using sets of large randomly generated gene trees.

Mesh：

Year: 2009 PMID： 19407347 DOI： 10.1109/TCBB.2009.7

Source DB: PubMed Journal: IEEE/ACM Trans Comput Biol Bioinform ISSN： 1545-5963 Impact factor: 3.710

Keyword Cloud
Cited

4 in total

The gene-duplication problem: near-linear time algorithms for NNI-based local searches.

1. Are the duplication cost and Robinson-Foulds distance equivalent?

2. Robinson-Foulds supertrees.

3. Efficient genome-scale phylogenetic analysis under the duplication-loss and deep coalescence cost models.

4. Triplet supertree heuristics for the tree of life.