| Literature DB >> 11734060 |
Y I Wolf1, I B Rogozin, N V Grishin, R L Tatusov, E V Koonin.
Abstract
BACKGROUND: The availability of multiple complete genome sequences from diverse taxa prompts the development of new phylogenetic approaches, which attempt to incorporate information derived from comparative analysis of complete gene sets or large subsets thereof. Such attempts are particularly relevant because of the major role of horizontal gene transfer and lineage-specific gene loss, at least in the evolution of prokaryotes.Entities:
Mesh:
Substances:
Year: 2001 PMID: 11734060 PMCID: PMC60490 DOI: 10.1186/1471-2148-1-8
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Distribution of conserved gene pairs among 31 clades of prokaryotes. Closely related genomes: E. coli-Buchnera sp., H. influenzae-P. mutocida, C. trachomatis-C. pneumoniae, P. horikoshii-P. abyssi, M. genitalium-M. pneumoniae-U. urealyticum., H. pyroli – C. jejuni, T. acidophilum-T. volcanium, were treated as a single clade. Nis the total number of conserved gene pairs.
Shared gene pairs in prokaryotic genomes.
| Aer | Sus | Arf | Pyh | Pya | Mej | Met | Has | Tha | Thv | Esc | Vic | Hai | Pam | Buc | Psa | Xyf | Nem | Cac | Mel | Rip | Hep | Caj | Bas | Bah | Lal | Sta | Stp | Myp | Myg | Urn | Myt | SyP | Der | Bob | Trp | Chp | Cht | Aqa | Thm | |
| Aepre | 298 | 263 | 207 | 241 | 123 | 172 | 238 | 212 | 227 | 198 | 61 | 102 | 112 | 138 | 209 | 93 | 92 | 146 | 219 | 53 | 87 | 116 | 190 | 200 | 128 | 149 | 96 | 55 | 54 | 51 | 164 | 82 | 172 | 66 | 64 | 63 | 65 | 75 | 148 | |
| Sulso | 30 | 333 | 242 | 775 | 159 | 211 | 795 | 353 | 352 | 795 | 87 | 124 | 145 | 186 | 333 | 137 | 109 | 234 | 332 | 70 | 106 | 152 | 310 | 313 | 194 | 233 | 126 | 53 | 52 | 46 | 281 | 107 | 241 | 60 | 54 | 63 | 63 | 91 | 219 | |
| Arcfu | 26 | 27 | 260 | 302 | 205 | 277 | 331 | 281 | 293 | 273 | 64 | 130 | 162 | 223 | 327 | 125 | 109 | 218 | 300 | 68 | 115 | 164 | 250 | 267 | 157 | 188 | 130 | 49 | 46 | 57 | 196 | 134 | 222 | 84 | 74 | 65 | 67 | 111 | 190 | |
| Pyrho | 26 | 23 | 26 | 434 | 170 | 221 | 219 | 195 | 207 | 178 | 56 | 96 | 105 | 139 | 170 | 95 | 84 | 112 | 167 | 44 | 77 | 100 | 167 | 172 | 103 | 132 | 91 | 40 | 40 | 44 | 120 | 84 | 151 | 74 | 74 | 53 | 51 | 68 | 162 | |
| Pyrab | 28 | 25 | 28 | 66 | 205 | 250 | 252 | 221 | 237 | 225 | 66 | 116 | 130 | 179 | 220 | 119 | 87 | 140 | 205 | 48 | 96 | 140 | 217 | 215 | 145 | 178 | 99 | 51 | 48 | 53 | 141 | 99 | 179 | 78 | 72 | 62 | 60 | 68 | 196 | |
| Metja | 17 | 16 | 22 | 25 | 27 | 225 | 147 | 134 | 142 | 108 | 43 | 63 | 75 | 85 | 108 | 68 | 54 | 79 | 105 | 35 | 52 | 81 | 99 | 94 | 71 | 88 | 62 | 35 | 33 | 37 | 86 | 74 | 96 | 44 | 46 | 39 | 34 | 62 | 106 | |
| Metth | 20 | 19 | 28 | 28 | 29 | 35 | 224 | 180 | 196 | 162 | 65 | 89 | 108 | 141 | 162 | 110 | 77 | 115 | 147 | 50 | 74 | 104 | 162 | 159 | 120 | 136 | 81 | 42 | 40 | 43 | 132 | 113 | 136 | 58 | 62 | 49 | 49 | 78 | 173 | |
| Halsp | 24 | 24 | 29 | 22 | 24 | 16 | 22 | 270 | 274 | 252 | 75 | 135 | 159 | 220 | 335 | 129 | 123 | 245 | 334 | 74 | 112 | 155 | 284 | 284 | 180 | 222 | 142 | 70 | 63 | 65 | 236 | 142 | 260 | 91 | 84 | 67 | 68 | 86 | 168 | |
| Theac | 23 | 34 | 25 | 21 | 22 | 16 | 19 | 25 | 494 | 238 | 67 | 102 | 108 | 156 | 285 | 102 | 98 | 198 | 273 | 68 | 103 | 122 | 234 | 229 | 147 | 181 | 102 | 46 | 45 | 40 | 213 | 95 | 197 | 53 | 57 | 60 | 62 | 75 | 147 | |
| Thevo | 25 | 33 | 27 | 22 | 24 | 17 | 21 | 26 | 66 | 243 | 69 | 99 | 109 | 148 | 283 | 111 | 104 | 188 | 272 | 73 | 102 | 123 | 222 | 224 | 144 | 177 | 100 | 47 | 47 | 40 | 219 | 100 | 202 | 54 | 54 | 60 | 60 | 80 | 147 | |
| Escco | 8 | 12 | 11 | 7 | 9 | 4 | 7 | 10 | 10 | 10 | 415 | 700 | 826 | 1178 | 1368 | 634 | 491 | 734 | 1000 | 191 | 263 | 378 | 783 | 721 | 452 | 566 | 303 | 136 | 123 | 107 | 544 | 282 | 478 | 198 | 173 | 165 | 159 | 209 | 409 | |
| Vibch | 6 | 7 | 5 | 6 | 6 | 5 | 7 | 6 | 6 | 6 | 20 | 241 | 274 | 362 | 346 | 262 | 216 | 196 | 214 | 113 | 122 | 145 | 213 | 206 | 125 | 177 | 91 | 84 | 81 | 73 | 186 | 83 | 134 | 112 | 100 | 99 | 97 | 101 | 176 | |
| Haein | 8 | 8 | 8 | 7 | 8 | 5 | 6 | 9 | 7 | 7 | 32 | 22 | 684 | 648 | 632 | 358 | 335 | 343 | 418 | 135 | 172 | 231 | 359 | 347 | 273 | 331 | 216 | 105 | 99 | 108 | 252 | 136 | 236 | 140 | 116 | 132 | 126 | 113 | 241 | |
| Pasmu | 7 | 8 | 9 | 7 | 8 | 5 | 7 | 9 | 6 | 6 | 37 | 22 | 54 | 794 | 738 | 415 | 370 | 401 | 482 | 140 | 189 | 277 | 423 | 418 | 286 | 365 | 222 | 110 | 100 | 104 | 268 | 172 | 264 | 145 | 135 | 142 | 138 | 132 | 278 | |
| Bucsp | 6 | 8 | 10 | 6 | 8 | 4 | 6 | 10 | 7 | 6 | 48 | 20 | 34 | 41 | 1256 | 594 | 467 | 648 | 780 | 180 | 231 | 345 | 677 | 648 | 372 | 490 | 286 | 126 | 113 | 110 | 420 | 271 | 403 | 202 | 174 | 166 | 162 | 192 | 372 | |
| Pseae | 8 | 12 | 11 | 6 | 8 | 4 | 6 | 12 | 10 | 10 | 47 | 14 | 24 | 28 | 46 | 704 | 537 | 997 | 1297 | 224 | 268 | 397 | 926 | 849 | 447 | 589 | 330 | 135 | 126 | 122 | 691 | 380 | 624 | 220 | 184 | 181 | 179 | 248 | 419 | |
| Xylfa | 7 | 9 | 8 | 7 | 8 | 5 | 8 | 8 | 7 | 7 | 28 | 24 | 25 | 27 | 30 | 28 | 345 | 461 | 471 | 154 | 175 | 217 | 375 | 355 | 227 | 297 | 162 | 84 | 82 | 85 | 312 | 178 | 271 | 134 | 118 | 125 | 128 | 154 | 239 | |
| Neime | 8 | 7 | 8 | 7 | 7 | 5 | 6 | 9 | 8 | 8 | 22 | 23 | 26 | 76 | 24 | 21 | 27 | 332 | 383 | 151 | 178 | 238 | 306 | 300 | 193 | 233 | 157 | 85 | 85 | 79 | 258 | 138 | 225 | 105 | 117 | 123 | 120 | 128 | 192 | |
| Caucr | 8 | 11 | 11 | 6 | 7 | 4 | 6 | 13 | 10 | 10 | 27 | 11 | 17 | 19 | 26 | 36 | 25 | 18 | 1020 | 206 | 196 | 289 | 638 | 605 | 311 | 432 | 228 | 96 | 92 | 88 | 561 | 295 | 456 | 149 | 139 | 135 | 128 | 186 | 302 | |
| Meslo | 9 | 13 | 12 | 7 | 8 | 4 | 6 | 14 | 12 | 11 | 34 | 9 | 17 | 19 | 27 | 43 | 20 | 16 | 43 | 225 | 220 | 337 | 850 | 792 | 430 | 527 | 300 | 103 | 103 | 107 | 691 | 369 | 582 | 177 | 161 | 164 | 157 | 208 | 400 | |
| Ricpr | 6 | 6 | 6 | 5 | 5 | 5 | 6 | 7 | 7 | 8 | 9 | 17 | 12 | 11 | 10 | 9 | 14 | 17 | 13 | 11 | 91 | 116 | 175 | 165 | 108 | 138 | 99 | 71 | 70 | 63 | 149 | 100 | 139 | 82 | 71 | 81 | 82 | 86 | 109 | |
| Helpy | 10 | 9 | 10 | 9 | 10 | 7 | 8 | 11 | 11 | 11 | 12 | 16 | 15 | 14 | 12 | 10 | 15 | 19 | 12 | 10 | 14 | 250 | 203 | 217 | 147 | 183 | 113 | 76 | 73 | 88 | 172 | 100 | 171 | 110 | 105 | 90 | 89 | 131 | 162 | |
| Camje | 11 | 12 | 13 | 10 | 13 | 9 | 10 | 13 | 11 | 11 | 17 | 16 | 18 | 20 | 18 | 15 | 17 | 22 | 16 | 15 | 14 | 33 | 329 | 312 | 202 | 240 | 134 | 81 | 77 | 80 | 239 | 177 | 232 | 113 | 118 | 100 | 94 | 150 | 222 | |
| Bacsu | 9 | 13 | 11 | 8 | 10 | 4 | 7 | 13 | 10 | 10 | 26 | 10 | 15 | 17 | 24 | 29 | 16 | 14 | 25 | 29 | 9 | 10 | 16 | 1234 | 615 | 931 | 486 | 178 | 175 | 164 | 621 | 284 | 530 | 223 | 197 | 162 | 154 | 186 | 473 | |
| Bacha | 10 | 14 | 12 | 8 | 10 | 4 | 7 | 13 | 11 | 10 | 25 | 10 | 15 | 18 | 24 | 27 | 16 | 14 | 24 | 28 | 9 | 11 | 16 | 56 | 575 | 869 | 460 | 173 | 166 | 166 | 594 | 282 | 522 | 213 | 193 | 158 | 149 | 191 | 491 | |
| Lacla | 9 | 12 | 10 | 7 | 10 | 5 | 9 | 12 | 10 | 10 | 18 | 10 | 17 | 16 | 16 | 15 | 14 | 13 | 15 | 17 | 9 | 12 | 15 | 29 | 28 | 534 | 473 | 150 | 137 | 124 | 363 | 158 | 314 | 129 | 114 | 111 | 107 | 115 | 325 | |
| Staau | 9 | 12 | 10 | 8 | 10 | 5 | 8 | 12 | 10 | 10 | 21 | 11 | 18 | 18 | 20 | 19 | 16 | 13 | 19 | 19 | 9 | 12 | 14 | 44 | 42 | 32 | 434 | 182 | 169 | 169 | 480 | 215 | 395 | 170 | 140 | 160 | 146 | 141 | 384 | |
| Strpy | 8 | 9 | 9 | 8 | 8 | 6 | 7 | 11 | 8 | 8 | 12 | 8 | 15 | 14 | 13 | 12 | 11 | 12 | 11 | 12 | 10 | 11 | 11 | 24 | 24 | 40 | 28 | 150 | 137 | 128 | 246 | 128 | 240 | 138 | 120 | 110 | 106 | 92 | 235 | |
| Mycpn | 8 | 5 | 5 | 5 | 6 | 6 | 6 | 8 | 5 | 5 | 6 | 14 | 10 | 9 | 7 | 5 | 8 | 10 | 6 | 4 | 14 | 13 | 10 | 9 | 10 | 14 | 14 | 18 | 203 | 138 | 119 | 63 | 100 | 95 | 74 | 66 | 66 | 61 | 110 | |
| Mycge | 8 | 5 | 5 | 6 | 6 | 6 | 5 | 7 | 5 | 6 | 6 | 14 | 10 | 8 | 6 | 5 | 8 | 10 | 6 | 5 | 15 | 13 | 10 | 9 | 9 | 13 | 13 | 17 | 87 | 133 | 118 | 62 | 97 | 93 | 72 | 67 | 67 | 62 | 109 | |
| Ureur | 7 | 4 | 6 | 6 | 7 | 7 | 6 | 7 | 5 | 5 | 5 | 12 | 11 | 9 | 6 | 5 | 8 | 9 | 5 | 5 | 13 | 16 | 11 | 9 | 9 | 12 | 13 | 16 | 47 | 48 | 117 | 62 | 100 | 89 | 75 | 65 | 64 | 57 | 113 | |
| Myctu | 10 | 16 | 11 | 7 | 8 | 5 | 8 | 14 | 13 | 13 | 20 | 12 | 13 | 13 | 17 | 24 | 17 | 15 | 27 | 28 | 10 | 12 | 15 | 26 | 26 | 20 | 24 | 14 | 9 | 9 | 9 | 255 | 444 | 125 | 127 | 133 | 132 | 163 | 289 | |
| SynPC | 7 | 8 | 10 | 8 | 8 | 8 | 11 | 12 | 8 | 8 | 12 | 8 | 10 | 11 | 13 | 14 | 13 | 11 | 16 | 16 | 11 | 10 | 17 | 13 | 14 | 11 | 12 | 10 | 8 | 8 | 8 | 16 | 255 | 76 | 83 | 72 | 69 | 140 | 165 | |
| Deira | 13 | 15 | 14 | 11 | 12 | 7 | 9 | 18 | 13 | 14 | 19 | 10 | 14 | 14 | 17 | 23 | 16 | 15 | 23 | 24 | 11 | 13 | 17 | 23 | 24 | 19 | 21 | 16 | 8 | 8 | 9 | 25 | 18 | 136 | 118 | 124 | 122 | 156 | 269 | |
| Borbu | 8 | 5 | 8 | 9 | 9 | 7 | 7 | 9 | 6 | 6 | 9 | 17 | 13 | 11 | 11 | 9 | 12 | 11 | 9 | 8 | 14 | 17 | 14 | 12 | 12 | 11 | 12 | 15 | 20 | 21 | 20 | 9 | 8 | 11 | 191 | 104 | 100 | 87 | 152 | |
| Trepa | 8 | 5 | 7 | 10 | 8 | 7 | 8 | 9 | 6 | 6 | 8 | 15 | 10 | 10 | 9 | 7 | 11 | 13 | 8 | 7 | 12 | 17 | 15 | 10 | 10 | 10 | 9 | 13 | 15 | 16 | 17 | 9 | 9 | 9 | 43 | 94 | 94 | 93 | 161 | |
| Chlpn | 9 | 6 | 6 | 7 | 7 | 6 | 6 | 7 | 7 | 7 | 8 | 16 | 13 | 12 | 9 | 7 | 12 | 14 | 8 | 8 | 16 | 15 | 13 | 8 | 9 | 10 | 11 | 12 | 15 | 16 | 16 | 10 | 8 | 10 | 21 | 19 | 245 | 75 | 123 | |
| Chltr | 9 | 6 | 7 | 7 | 7 | 5 | 6 | 7 | 7 | 7 | 7 | 15 | 12 | 11 | 9 | 7 | 12 | 14 | 8 | 7 | 16 | 15 | 12 | 8 | 8 | 9 | 10 | 12 | 15 | 16 | 16 | 10 | 8 | 10 | 20 | 19 | 87 | 73 | 121 | |
| Aquae | 8 | 8 | 10 | 7 | 7 | 8 | 9 | 8 | 7 | 8 | 9 | 12 | 9 | 9 | 10 | 9 | 13 | 12 | 11 | 9 | 12 | 18 | 17 | 9 | 10 | 9 | 9 | 8 | 10 | 10 | 9 | 11 | 15 | 12 | 13 | 14 | 12 | 11 | 163 | |
| Thema | 13 | 16 | 14 | 14 | 16 | 10 | 15 | 12 | 11 | 11 | 17 | 16 | 16 | 17 | 17 | 15 | 16 | 14 | 15 | 17 | 10 | 15 | 19 | 22 | 25 | 23 | 23 | 18 | 12 | 12 | 12 | 17 | 13 | 17 | 15 | 17 | 13 | 13 | 15 |
The diagonal (bold) shows the total number of conserved gene pairs in each genome. The upper triangle of the matrix shows the raw number of gene pairs shared by each pair of genomes and the lower triangle shows the value of 100*Oij; Oij is the Jaquard coefficient calculated as Qij = Cij/(Ni+Nj-Cij) where Cij is the number of gene pairs shared by genomes i and j, and Ni and Nj are the total numbers of conserved gene pairs in genome pairs in genomes i and j, respectively [51].
Figure 2Distribution of identity percentage between probable orthologs in genome pairs. The distributions are for the sets of probable orthologs detected with an e-value cut-off of 0.001. For species name abbreviations, see Materials and Methods.
Figure 3Maximum parsimony tree (Dollo parsimony) based on absence-presence of genomes in orthologous gene sets. The tree is unrooted. The circles indicate the level of bootstrap support, with the following color coding: red: 90–100%, yellow: 80–90%, green: 70–80%, blue: 60–70%, magenta: 40–60%. The nodes with <40% support are unmarked.
Figure 4Maximum parsimony tree (Dollo parsimony) of prokaryotes based on presence-absence of gene pairs in genomes. The designations are as in Fig. 3.
Figure 5Distance tree constructed using the median of the percent identity distribution between probable orthologs for evolutionary distance calculation. An E-value cut-off of 0.001 was used to identify bidirectional best hits between proteins encoded in all pairs of genomes. Distances were calculated using the logarithmic formula. The designations are as in Fig. 3.
Symmetric distances between genome-trees constructed with different methods.
| Gene presence-absence | Conserved gene pairs | Identity distributions | |
| Symmetric distancea | |||
| Gene presence-absence | |||
| Conserved gene pairs | 52 | ||
| Identity distributions | 54 | 44 | |
| Concatenated ribosomal | 56 | 44 | 38 |
| proteins | |||
aNumber of different partitions of the total of 74 partitions.
Figure 6Maximum-likelihood tree produced from concatenated alignments of the universal subset of ribosomal proteins. The designations are as in Fig. 3.
Figure 7The Kishino-Hasegawa test for the "1" indicates the original position of the tested clade in the concatenated ribosomal proteins tree (Fig. 6). The remaining numbers show the alternative positions tested for each of these species (in green ovals for Aquifex and blue for Thermotoga). For the likelihood values and RELL bootstrap values for each of the corresponding topologies, see Table 3A.
Figure 8The Kishino-Hasegawa test for the The identical scheme of producing alternative topologies was used for each of the three species. For example for Deinococcus (see Table 4) the green ovals (## 2 to 13) indicate alternative placements of Deinococcus with Mycobacterium and Synechocystis occupying the original position and the blue ovals (## 14 to 25) indicate alternative placements of the Mycobacterium-Synechocystis pair with Deinococcus left in the original position. The same was done with Mycobacterium versus Deinococcus-Synechocystis pair (Table 5) and Synechocystis versus Deinococcus-Mycobacterium pair (Table 6).
Testing non-trivial groupings from the concatenated ribosomal protein tree with the Kishino-Hasegawa testa
| (A) | ||||
| # | Likelihood | ΔLb | σΔLc | RELL-BPd |
| 1 | -242983.7 | best | N/A | 0.9251 |
| 2 | -243174.6 | -190.9 | 38.5 | 0.0000 |
| 3 | -243185.0 | -201.3 | 38.0 | 0.0000 |
| 4 | -243146.1 | -162.5 | 32.1 | 0.0000 |
| 5 | -243267.6 | -283.9 | 49.0 | 0.0000 |
| 6 | -243293.3 | -309.7 | 49.0 | 0.0000 |
| 7 | -243218.8 | -235.2 | 41.8 | 0.0000 |
| 8 | -243301.0 | -317.3 | 45.7 | 0.0000 |
| 9 | -243315.4 | -331.8 | 45.0 | 0.0000 |
| 10 | -243242.8 | -259.1 | 40.0 | 0.0000 |
| 11 | -243005.7 | -22.0 | 12.2 | 0.0227 |
| 12 | -243196.1 | -212.4 | 39.2 | 0.0000 |
| 13 | -243201.5 | -217.9 | 38.8 | 0.0000 |
| 14 | -243157.9 | -174.3 | 32.2 | 0.0000 |
| 15 | -243318.8 | -335.1 | 49.8 | 0.0000 |
| 16 | -243355.8 | -372.1 | 48.3 | 0.0000 |
| 17 | -243247.1 | -263.4 | 42.1 | 0.0000 |
| 18 | -243236.5 | -252.8 | 51.2 | 0.0000 |
| 19 | -243232.0 | -248.3 | 51.1 | 0.0000 |
| 20 | -243207.0 | -223.3 | 45.0 | 0.0000 |
| 21 | -243002.6 | -19.0 | 12.8 | 0.0522 |
| (B) | ||||
| # | Likelihood | ΔL | σΔL | RELL-BP |
| 1 | -242983.7 | best | N/A | 0.8239 |
| 2 | -243091.1 | -107.4 | 40.8 | 0.0002 |
| 3 | -243122.6 | -138.9 | 43.0 | 0.0000 |
| 4 | -243135.8 | -152.1 | 43.1 | 0.0000 |
| 5 | -243088.1 | -104.5 | 36.3 | 0.0000 |
| 6 | -243037.3 | -53.7 | 41.0 | 0.0775 |
| 7 | -243064.0 | -80.4 | 40.2 | 0.0020 |
| 8 | -243024.5 | -40.9 | 31.8 | 0.0574 |
| 9 | -243030.9 | -47.3 | 19.0 | 0.0011 |
| 10 | -243017.6 | -33.9 | 20.8 | 0.0090 |
| 11 | -243052.1 | -68.4 | 30.3 | 0.0010 |
| 12 | -243070.6 | -86.9 | 37.3 | 0.0013 |
| 13 | -243066.1 | -82.5 | 40.4 | 0.0122 |
| 14 | -243143.1 | -159.4 | 39.6 | 0.0000 |
| 15 | -243151.3 | -167.7 | 43.3 | 0.0000 |
| 16 | -243186.7 | -203.0 | 42.2 | 0.0000 |
| 17 | -243102.9 | -119.3 | 36.6 | 0.0001 |
| 18 | -243167.6 | -184.0 | 37.8 | 0.0000 |
| 19 | -243155.4 | -171.7 | 38.9 | 0.0000 |
| 20 | -243065.3 | -81.7 | 29.6 | 0.0007 |
| 21 | -243017.6 | -33.9 | 20.8 | 0.0121 |
| 22 | -243030.9 | -47.3 | 19.0 | 0.0006 |
| 23 | -243068.3 | -84.7 | 29.9 | 0.0009 |
| 24 | -243103.9 | -120.3 | 36.7 | 0.0000 |
| 25 | -243135.2 | -151.5 | 39.0 | 0.0000 |
| (C) | ||||
| # | Likelihood | ΔL | σΔL | RELL-BP |
| 1 | -242983.7 | best | N/A | 0.8589 |
| 2 | -243160.7 | -177.0 | 46.5 | 0.0000 |
| 3 | -243192.5 | -208.9 | 48.9 | 0.0000 |
| 4 | -243216.2 | -232.5 | 48.6 | 0.0000 |
| 5 | -243140.3 | -156.6 | 44.1 | 0.0000 |
| 6 | -243146.6 | -163.0 | 48.1 | 0.0000 |
| 7 | -243153.9 | -170.3 | 48.0 | 0.0000 |
| 8 | -243071.4 | -87.7 | 41.0 | 0.0013 |
| 9 | -243023.4 | -39.7 | 34.2 | 0.0443 |
| 10 | -243037.2 | -53.5 | 33.3 | 0.0052 |
| 11 | -243098.4 | -114.7 | 39.6 | 0.0000 |
| 12 | -243126.1 | -142.4 | 44.5 | 0.0000 |
| 13 | -243146.5 | -162.8 | 46.9 | 0.0000 |
| 14 | -243087.0 | -103.3 | 52.5 | 0.0010 |
| 15 | -243128.5 | -144.8 | 54.7 | 0.0000 |
| 16 | -243150.8 | -167.2 | 54.1 | 0.0000 |
| 17 | -243079.5 | -95.9 | 49.1 | 0.0014 |
| 18 | -243136.6 | -153.0 | 50.5 | 0.0000 |
| 19 | -243152.6 | -168.9 | 49.7 | 0.0000 |
| 20 | -243062.9 | -79.3 | 41.5 | 0.0012 |
| 21 | -243037.2 | -53.5 | 33.3 | 0.0059 |
| 22 | -243023.4 | -39.7 | 34.2 | 0.0327 |
| 23 | -243047.8 | -64.1 | 43.2 | 0.0209 |
| 24 | -243062.5 | -78.8 | 49.7 | 0.0192 |
| 25 | -243076.6 | -93.0 | 51.9 | 0.0080 |
| (D) | ||||
| Synechocystis sp. | ||||
| # | Likelihood | ΔL | σΔL | RELL-BP |
| 1 | -242983.7 | best | N/A | 0.9617 |
| 2 | -243118.5 | -134.8 | 47.5 | 0.0000 |
| 3 | -243077.9 | -94.3 | 51.4 | 0.0265 |
| 4 | -243115.8 | -132.1 | 50.9 | 0.0000 |
| 5 | -243084.6 | -101.0 | 46.4 | 0.0031 |
| 6 | -243184.5 | -200.8 | 46.3 | 0.0000 |
| 7 | -243208.1 | -224.4 | 45.5 | 0.0000 |
| 8 | -243135.7 | -152.1 | 38.1 | 0.0000 |
| 9 | -243072.3 | -88.6 | 32.1 | 0.0006 |
| 10 | -243083.7 | -100.0 | 31.6 | 0.0000 |
| 11 | -243099.4 | -115.7 | 40.6 | 0.0000 |
| 12 | -243102.5 | -118.8 | 45.2 | 0.0003 |
| 13 | -243097.2 | -113.5 | 47.6 | 0.0030 |
| 14 | -243204.5 | -220.8 | 48.0 | 0.0000 |
| 15 | -243279.8 | -296.2 | 49.3 | 0.0000 |
| 16 | -243288.4 | -304.7 | 49.4 | 0.0000 |
| 17 | -243194.3 | -210.7 | 42.9 | 0.0000 |
| 18 | -243180.8 | -197.1 | 49.5 | 0.0000 |
| 19 | -243177.5 | -193.8 | 49.4 | 0.0000 |
| 20 | -243090.1 | -106.4 | 41.4 | 0.0038 |
| 21 | -243083.7 | -100.0 | 31.6 | 0.0000 |
| 22 | -243072.3 | -88.6 | 32.1 | 0.0010 |
| 23 | -243129.2 | -145.5 | 38.5 | 0.0000 |
| 24 | -243165.6 | -181.9 | 45.1 | 0.0000 |
| 25 | -243195.5 | -211.9 | 47.6 | 0.0000 |
Figure 9The Kishino-Hasegawa test for the unification of the See Table 7.
| (E) | ||||
| The Demococcus-Mycobacterium-Synechocystis clade | ||||
| # | Likelihood | ΔL | σΔL | RELL-BP |
| 1 | -242983.7 | 0.0 | <-best | 0.7280 |
| 2 | -243065.3 | -81.7 | 34.7 | 0.0000 |
| 3 | -243122.4 | -138.8 | 37.2 | 0.0000 |
| 4 | -243148.7 | -165.1 | 35.8 | 0.0000 |
| 5 | -243053.8 | -70.1 | 28.9 | 0.0001 |
| 6 | -243103.1 | -119.5 | 33.6 | 0.0000 |
| 7 | -243096.4 | -112.7 | 34.1 | 0.0001 |
| 8 | -243003.1 | -19.4 | 23.2 | 0.1697 |
| 9 | -243010.5 | -26.9 | 21.5 | 0.0560 |
| 10 | -243028.9 | -45.3 | 31.2 | 0.0419 |
| 11 | -243054.3 | -70.7 | 34.7 | 0.0042 |
Figure 10The Kishino-Hasegawa test for the Spirochete-Chlamydia clade. Green ovals: chlamydia, blue ovals: spirochetes. See Table 8.
| (F) | ||||
| The spirochaete-chlamydia clade | ||||
| # | Likelihood | ΔL | σΔL | RELL-BP |
| 1 | -242983.7 | best | N/A | 0.6173 |
| 2 | -243055.2 | -71.5 | 21.5 | 0.0000 |
| 3 | -243050.7 | -67.1 | 34.7 | 0.0078 |
| 4 | -243096.8 | -113.2 | 33.0 | 0.0000 |
| 5 | -243045.5 | -61.9 | 25.0 | 0.0007 |
| 6 | -243066.5 | -82.8 | 32.8 | 0.0012 |
| 7 | -243072.2 | -88.5 | 32.4 | 0.0006 |
| 8 | -243049.0 | -65.3 | 25.2 | 0.0005 |
| 9 | -243036.7 | -53.1 | 21.7 | 0.0016 |
| 10 | -243057.4 | -73.7 | 21.9 | 0.0000 |
| 11 | -242998.3 | -14.6 | 40.2 | 0.3605 |
| 12 | -243086.4 | -102.7 | 36.2 | 0.0000 |
| 13 | -243024.8 | -41.1 | 28.0 | 0.0071 |
| 14 | -243146.2 | -162.5 | 31.4 | 0.0000 |
| 15 | -243130.7 | -147.0 | 32.9 | 0.0000 |
| 16 | -243077.2 | -93.6 | 23.4 | 0.0000 |
| 17 | -243036.9 | -53.3 | 22.1 | 0.0027 |
| (G) | ||||
| ε-proteobacteria | ||||
| # | Likelihood | ΔL | σΔL | RELL-BP |
| 1 | -242983.7 | best | N/A | 0.5482 |
| 2 | -243093.9 | -110.3 | 32.7 | 0.0000 |
| 3 | -243009.8 | -26.1 | 39.6 | 0.0417 |
| 4 | -242991.7 | -8.0 | 41.2 | 0.3788 |
| 5 | -243007.7 | -24.0 | 34.1 | 0.0308 |
| 6 | -243121.1 | -137.4 | 30.3 | 0.0000 |
| 7 | -243112.4 | -128.7 | 31.1 | 0.0000 |
| 8 | -243076.4 | -92.8 | 22.0 | 0.0000 |
| 9 | -243071.1 | -87.4 | 29.7 | 0.0000 |
| 10 | -243055.0 | -71.4 | 33.4 | 0.0005 |
Figure 11The Kishino-Hasegawa test for the unification of ε-proteobacteria with the rest of Proteobacteria. See Table 9.
| (H) | ||||
| Crenarchaeota and Euryarchaeota | ||||
| # | Likelihood | ΔL | σΔL | RELL-BP |
| 1 | -242983.7 | best | N/A | 0.5840 |
| 2 | -242993.2 | -9.5 | 33.7 | 0.4160 |
aThe numbers correspond to those in Fig. 5bThe likelihood difference with the first (original) topology cThe standard deviation of the above dThe bootstrap probability of the given topology estimated with RELL method [49].
COGs used for the comparative analysis of Maximum Likelihood trees for individual protein families
| COG | speca | protb | Name |
| COG0012 | 40 | 41 | Predicted GTPase |
| COG0013 | 40 | 40 | Alanyl-tRNA synthetase |
| COG0016 | 40 | 40 | Phenylalanyl-tRNA synthetase alpha submit |
| COG0018 | 40 | 41 | Arginyl-tRNA synthetase |
| COG0020 | 37 | 40 | Undecaprenyl pyrophosphate synthase |
| COG0048 | 39 | 39 | Ribosomal protein S12 |
| COG0049 | 40 | 41 | Ribosomal protein S7 |
| COG0051 | 40 | 40 | Ribosomal protein S10 |
| COG0052 | 40 | 40 | Ribosomal protein S2 |
| COG0060 | 40 | 40 | Isoleucyl-tRNA synthetase |
| COG0061 | 37 | 40 | Predicted kinase |
| COG0064 | 30 | 30 | Asp-tRNAAsn/Glu-tRNAGIn amidotransferase B subunit (PET 112 homolog) |
| COG0072 | 40 | 40 | Phenylalanyl-tRNA synthetase beta subunit |
| COG0080 | 40 | 41 | Ribosomal protein LI 1 |
| COG0081 | 40 | 40 | Ribosomal protein LI |
| COG0082 | 33 | 33 | Chorismate synthase |
| COG0085 | 40 | 40 | DNA-directed RNA polymerase beta subunit/140 kD subunit (split gene in |
| Mjan, Mthe, Aful) | |||
| COG0087 | 40 | 40 | Ribosomal protein L3 |
| COG0088 | 40 | 40 | Ribosomal protein L4 |
| COG0090 | 40 | 40 | Ribosomal protein L2 |
| COG0091 | 40 | 40 | Ribosomal protein L22 |
| COG0092 | 40 | 40 | Ribosomal protein S3 |
| COG0093 | 39 | 39 | Ribosomal protein LI 4 |
| COG0094 | 40 | 40 | Ribosomal protein L5 |
| COG0096 | 40 | 40 | Ribosomal protein S8 |
| COG0097 | 40 | 40 | Ribosomal protein L6 |
| COG0098 | 40 | 40 | Ribosomal protein S5 |
| COG0099 | 40 | 40 | Ribosomal protein S13 |
| COG0100 | 39 | 39 | Ribosomal protein S11 |
| COG0101 | 38 | 38 | Pseudouridylate synthase (tRNA psi55) |
| COG0102 | 40 | 40 | Ribosomal protein LI 3 |
| COG0103 | 40 | 40 | Ribosomal protein S9 |
| COG0104 | 31 | 31 | Adenylosuccinate synthase |
| COG0105 | 33 | 33 | Nucleoside diphosphate kinase |
| COG0126 | 39 | 39 | 3-phosphoglycerate kinase |
| COG0127 | 34 | 35 | Xanthosine triphosphate pyrophosphatase |
| COG0128 | 33 | 35 | 5-enolpyruvylshikimate-3-phosphate synthase |
| COG0130 | 37 | 37 | Pseudouridine synthase |
| COG0134 | 30 | 30 | Indole-3-glycerol phosphate synthase |
| COG0135 | 30 | 30 | Phosphoribosylanthranilate isomerase |
| COG0143 | 40 | 41 | Methionyl-tRNA synthetase |
| COG0148 | 39 | 43 | Enolase |
| COG0149 | 39 | 41 | Triosephosphate isomerase |
| COG0151 | 30 | 30 | Phosphoribosylamine-glycine ligase |
| COG0152 | 30 | 32 | Phosphoribosylaminoimidazolesuccinocarboxamide (SAICAR) synthase |
| COG0159 | 31 | 31 | Tryptophan synthase alpha chain |
| COG0162 | 40 | 43 | Tyrosyl-tRNA synthetase |
| COG0164 | 35 | 35 | Ribonuclease HII |
| COG0166 | 35 | 35 | Glucose-6-phosphate isomerase |
| COG0167 | 32 | 37 | Dihydroorotate dehydrogenase |
| COG0169 | 33 | 39 | Shikimate 5-dehydrogenase |
| COG0171 | 35 | 37 | NAD synthase |
| COG0172 | 40 | 40 | Seryl-tRNA synthetase |
| COG0173 | 30 | 30 | Aspartyl-tRNA synthetase |
| COG0178 | 31 | 33 | Excinuclease ATPase submit |
| COG0180 | 40 | 43 | Tryptophanyl-tRNA synthetase |
| COG0190 | 33 | 33 | 5,10-methylene-tetrahydrofolate dehydrogenase |
| COG0193 | 30 | 30 | Peptidyl-tRNA hydrolase |
| COG0197 | 40 | 40 | Ribosomal protein L16/L10E |
| COG0198 | 40 | 40 | Ribosomal protein L24 |
| COG0200 | 40 | 40 | Ribosomal protein LI 5 |
| COG0201 | 40 | 41 | Preprotein translocase subunit SecY |
| COG0202 | 40 | 40 | DNA-directed RNA polymerase alpha subunit/40 kD subunit |
| COG0203 | 30 | 30 | Ribosomal protein LI 7 |
| COG0215 | 38 | 39 | Cysteinyl-tRNA synthetase |
| COG0216 | 30 | 30 | Protein chain release factor A |
| COG0221 | 31 | 31 | Inorganic pyrophosphatase |
| COG0222 | 30 | 30 | Ribosomal protein L7/L12 |
| COG0223 | 30 | 34 | Methionyl-tRNA formyltransferase |
| COG0231 | 40 | 46 | Translation elongation factor P/translation initiation factor eIF-5A |
| COG0233 | 30 | 30 | Ribosome recycling factor |
| COG0237 | 39 | 41 | Dephospho-CoA kinase |
| COG0242 | 30 | 35 | N-formylmethionyl-tRNA deformylase |
| COG0244 | 40 | 40 | Ribosomal protein L10 |
| COG0250 | 40 | 43 | Transcription antiterminator |
| COG0256 | 40 | 40 | Ribosomal protein LI 8 |
| COG0258 | 40 | 47 | 5'-3' exonuclease (including N-terminal domain of Poll) |
| COG0261 | 30 | 30 | Ribosomal protein L21 |
| COG0264 | 30 | 30 | Translation elongation factor Ts |
| COG0272 | 30 | 31 | NAD-dependent DNA ligase (contains BRCT domain type II) |
| COG0275 | 30 | 30 | Predicted S-adenosylmethionine-dependent methyltransferase involved in cell |
| envelope biogenesis | |||
| COG0284 | 32 | 32 | Orotidine-5'-phosphate decarboxylase |
| COG0290 | 30 | 30 | Translation initiation factor IF3 |
| COG0292 | 30 | 30 | Ribosomal protein L20 |
| COG0294 | 33 | 36 | Dihydropteroate synthase |
| COG0305 | 30 | 31 | Replicative DNA helicase |
| COG0313 | 30 | 31 | Predicted methyltransferases |
| COG0319 | 30 | 30 | Predicted metal-dependent hydrolase |
| COG0335 | 30 | 30 | Ribosomal protein LI 9 |
| COG0336 | 30 | 30 | tRNA-(guanine-N1)-methyltransferase |
| COG0340 | 32 | 34 | Biotin-(acetyl-CoA carboxylase) ligase |
| COG0343 | 35 | 36 | Queuine/archaeosine tRNA-ribosyltransferase |
| COG0351 | 31 | 34 | Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase |
| COG0359 | 30 | 30 | Ribosomal protein L9 |
| COG0441 | 40 | 43 | Threonyl-tRNA synthetase |
| COG0442 | 40 | 40 | Prolyl-tRNA synthetase |
| COG0452 | 32 | 32 | Phosphopantothenoylcysteine synthetase/decarboxylase |
| COG0461 | 33 | 34 | Orotate phosphoribosyltransferase |
| COG0462 | 37 | 40 | Phosphoribosylpyrophosphate synthetase |
| COG0481 | 30 | 30 | Membrane GTPase LepA |
| COG0495 | 40 | 41 | Leucyl-tRNA synthetase |
| COG0504 | 38 | 38 | CTP synthase (UTPammonia lyase) |
| COG0519 | 33 | 33 | GMP synthase – PP-ATPase domain |
| COG0522 | 40 | 40 | Ribosomal protein S4 and related proteins |
| COG0525 | 40 | 40 | Valyl-tRNA synthetase |
| COG0528 | 40 | 40 | Uridylate kinase |
| COG0532 | 40 | 40 | Translation initiation factor 2 (GTPase) |
| COG0533 | 40 | 40 | Metal-dependent proteases with possible chaperone activity |
| COG0536 | 30 | 30 | Predicted GTPase |
| COG0540 | 30 | 30 | Aspartate carbamoyltransferase, catalytic chain |
| COG0541 | 40 | 40 | Signal recognition particle GTPase |
| COG0544 | 30 | 30 | FKBP-type peptidyl-prolyl cis-trans isomerase (trigger factor) |
| COG0547 | 30 | 35 | Anthranilate phosphoribosyltransferase |
| COG0552 | 40 | 40 | Signal recognition particle GTPase |
| COG0556 | 31 | 31 | Helicase subunit of the DNA excision repair complex |
| COG0571 | 30 | 30 | dsRNA-specific ribonuclease |
| COG0573 | 30 | 34 | ABC-type phosphate transport system, permease component |
| COG0576 | 34 | 35 | Molecular chaperone GrpE (heat shock protein) |
| COG0581 | 30 | 34 | ABC-type phosphate transport system, permease component |
| COG0587 | 30 | 35 | DNA polymerase III alpha subunit |
| COG0597 | 30 | 30 | Lipoprotein signal peptidase |
| COG0653 | 30 | 32 | Preprotein translocase subunit SecA (ATPase, RNA helicase) |
| COG0682 | 30 | 30 | Prolipoprotein diacylglyceryltransferase |
| COG0691 | 30 | 30 | tmRNA-binding protein |
| COG0706 | 30 | 34 | Preprotein translocase subunit YidC |
| COG0781 | 30 | 30 | Transcription termination factor |
| COG0858 | 30 | 30 | Ribosome-binding factor A |
| COG1160 | 30 | 30 | Predicted GTPases |
| COG1214 | 30 | 30 | Inactive homologs of metal-dependent proteases, putative molecular chaperones |
| COG1466 | 30 | 30 | DNA polymerase III delta subunit |
| COG1488 | 32 | 35 | Nicotinic acid phosphoribosyltransferase |
| COG2812 | 30 | 30 | DNA polymerase III, gamma/tau subunits |
aNumber of represented species. bNumber of proteins.
Figure 13A census of the topologies of maximum-likelihood trees for individual protein families.Thermotoga and Aquifex. In each panel, the left top icon shows the grouping tested and the remaining icons show the most common alternative topologies for the given species/group. Dotted lines indicate optional presence of (possibly several) members of the indicated group (e.g. "proteo" with several dotted lines leading to it means that any number and combination of proteobacterial proteins could be present on the given branch). For each icon, the number of COG trees with the given topology (upper number) and the size of the subset supported by at least 70% bootstrap values (lower number) are indicated. Uncertain topologies (lacking clearly defined taxonomic units on the other side of the subtree or those without bootstrap support) are indicated by multiple dotted lines without indication of the neighbor. Abbreviations: TA – Thema and/or Aquae; DMS – any combination of Deira, Myctu and SynPC. Note that, in some cases, which involve taxonomic clades rather than single organisms (e.g. spirochetes), failure of the corresponding species to form a clade in the given tree may lead to asymmetrical counts of topologies. For example, if a particular tree has a (Deira,(Trepa, Borbu)) branch, this tree will be included in both the Deira-spiro and spiro-Deira tallies. If, however, the subtree ((Deira, Trepa),(Aquae, Borbu)) is present, then the Deira-spiro and Aquae-spiro tallies gain one count each, but the spiro-Deira and spiro-Aquae tallies do not; instead, a case of spirochete polyphyly is registered.
Figure 14A census of the topologies of maximum-likelihood trees for individual protein families.Deinococcus, Mycobacterium and Synechocystis. The designations are as in Fig. 3.
Figure 15A census of the topologies of maximum-likelihood trees for individual protein families. Spirochetes, chlamydia and epsilon-protoebacteria. The designations are as in Fig. 3.