| Literature DB >> 23394478 |
Abstract
BACKGROUND: It is generally admitted that the species tree cannot be inferred from the genetic sequences of a single gene because the evolution of different genes, and thus the gene tree topologies, may vary substantially. Gene trees can differ, for example, because of horizontal transfer events or because some of them correspond to paralogous instead of orthologous sequences. A variety of methods has been proposed to tackle the problem of the reconciliation of gene trees in order to reconstruct a species tree. When the taxa in all the trees are identical, the problem can be stated as a consensus tree problem.Entities:
Mesh:
Year: 2013 PMID: 23394478 PMCID: PMC3599424 DOI: 10.1186/1471-2105-14-46
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Five -trees of 7 leaves.
The whole set of bipartitions in the trees of Figure 1
| 1 | x | x | | | | 1 2 | 3 4 5 6 7 |
| 2 | x | x | | | x | 1 2 3 | 4 5 6 7 |
| 3 | x | | x | x | x | 1 2 3 4 5 | 6 7 |
| 4 | x | | | | x | 1 2 3 6 7 | 4 5 |
| 5 | | x | | | | 1 2 3 4 6 | 5 7 |
| 6 | | x | | | | 1 2 3 5 7 | 4 6 |
| 7 | | | x | | x | 1 3 | 2 4 5 6 7 |
| 8 | | | x | x | | 1 3 5 | 2 4 6 7 |
| 9 | | | x | x | | 1 2 3 5 | 4 6 7 |
| 10 | x | 1 5 | 2 3 4 6 7 |
Figure 2The majority consensus tree and an extended one .
Figure 3Consensus trees and of classes and .
Figure 4The Robinson-Foulds similarity without the common denominator (equal to 8) and its average linkage dendrogram.
The generalized score value of partitions of merging two classes
| 20 | | | | 35 | | | 28 | ||||
| 16 | 12 | | | 32 | 16 | | 28 | 39 | |||
| 16 | 12 | 24 | | 26 | 16 | 28 | | | | ||
| 24 | 16 | 20 | 13 |
Figure 5The final tree on genes given by the method.
Results on bootstrap trees
| UR | 8 | 7 | 2623 | 1311500 | 2 | 1304768 |
| trpB | 28 | 15 | 6248 | 3124000 | 2 | 3114271 |
| trpA | 45 | 9 | 3824 | 1912000 | 3 | 1900390 |
| putP | 57 | 17 | 6608 | 3304000 | 2 | 2508400 |
| polB | 119 | 14 | 5331 | 2665500 | 2 | 2639187 |
| icd | 69 | 15 | 5681 | 2840500 | 2 | 2929008 |
| HPI | 76 | 13 | 4971 | 2485500 | 2 | 2467626 |
| pabB | 57 | 8 | 3667 | 1833500 | 2 | 1827846 |
| DR | 12 | 8 | 2685 | 1342500 | 2 | 1335146 |
For each set of bootstrap gene trees, the number of bipartitions (BiP), the number of majority bipartitions (Maj), the weight of the consensus tree (W), its generalized score (), the number of classes of the best multiple consensus (NbClas) and the generalized score of this partition () are indicated.
Generalized scores of the genes for all possible numbers of classes
| 144 | 150 | 174 | 147 | 154 | 139 | 120 | 130 | 140 | |
| 144 | 150 | 135 | 159 | 169 | 136 | 146 | 129 | 140 | |
| 144 | 168 | 147 | 160 | 145 | 155 | 130 | 140 |
The two first rows correspond to the average linkage (AL) algorithm on both similarity indices and the third one corresponds to greedy algorithm (GA) merging the two classes maximizing the score function.