| Literature DB >> 18045498 |
Abstract
BACKGROUND: Alpha proteobacteria are one of the largest and most extensively studied groups within bacteria. However, for these bacteria as a whole and for all of its major subgroups (viz. Rhizobiales, Rhodobacterales, Rhodospirillales, Rickettsiales, Sphingomonadales and Caulobacterales), very few or no distinctive molecular or biochemical characteristics are known.Entities:
Mesh:
Substances:
Year: 2007 PMID: 18045498 PMCID: PMC2241609 DOI: 10.1186/1471-2180-7-106
Source DB: PubMed Journal: BMC Microbiol ISSN: 1471-2180 Impact factor: 3.605
Genome sizes, protein numbers and GC contents of sequenced alpha-proteobacteria
| Species Name | Order | Family | Genome Size (Mb) | GC content (%) | Protein Number | Reference |
| Rhizobiales | Bartonellaceae | 1.4 | 38.2 | 1283 | TIGR | |
| Rhizobiales | Bartonellaceae | 1.93 | 38.2 | 1488 | [7] | |
| Rhizobiales | Bartonellaceae | 1.58 | 38.8 | 1142 | [7] | |
| 9.1 | 64.1 | 8317 | [5] | |||
| 8.53 | 64.8 | 7622 | DOE-JGI | |||
| 7.5 | 65.5 | 6717 | Genoscope | |||
| 5.01 | 61.6 | 4326 | DOE-JGI | |||
| 3.4 | 2.5 | 3122 | [42] | |||
| 5.51 | 64.4 | 4878 | DOE-JGI | |||
| 5.51 | 65.0 | 4886 | DOE-JGI | |||
| 4.89 | 64.8 | 4397 | DOE-JGI | |||
| 5.47 | 65.0 | 4820 | [41] | |||
| 5.33 | 66.0 | 4683 | DOE-JGI | |||
| 3.29 | 57.2 | 3085 | [69] | |||
| 3.29 | 57.2 | 3198 | [38] | |||
| 3.28 | 57.2 | 3034 | [39] | |||
| 3.32 | 57.3 | 2123 | [40] | |||
| 3.3 | 57.2 | 2892 | TIGR | |||
| 7.6 | 62.5 | 7372 | [6] | |||
| 4.94 | 61.1 | 4543 | DOE-JGI | |||
| Rhizobiales | Rhizobiaceae | 5.67 | 59.0 | 4661 | [37] | |
| 6.53 | 61.0 | 5963 | [35] | |||
| 7.75 | 61.0 | 7263 | [81] | |||
| 6.69 | 62.2 | 6205 | [36] | |||
| 4.02 | 67.2 | 3737 | [51] | |||
| 3.71 | 61.9 | 3505 | [46] | |||
| 3.37 | 62.7 | 3063 | DOE-JGI | |||
| 4.4 | 62.2 | 4212 | DOE-JGI | |||
| 5.25 | 66.8 | 5077 | DOE-JGI | |||
| 4.45 | 68.8 | 4242 | DOE-JGI | |||
| 4.54 | 68.2 | 4333 | DOE-JGI | |||
| 4.42 | 69.0 | 4132 | DOE-JGI | |||
| 4.3 | 58.9 | 4129 | [44] | |||
| 4.6 | 64.1 | 4252 | [45] | |||
| 4.15 | 60.1 | 3864 | DOE-JGI | |||
| 2.92 | 60.8 | 2664 | [56] | |||
| 2.7 | 59.1 | 2437 | Rocky Mountain Lab | |||
| 3.97 | 67.1 | 3564 | DOE-JGI | |||
| 4.97 | 65.1 | 4559 | [57] | |||
| 4.41 | 65.4 | 3841 | DOE-JGI | |||
| 1.3 | 29.7 | 1354 | [2] | |||
| Rickettsiales | Anaplasmataceae | 1.2 | 49.8 | 949 | [70] | |
| Rickettsiales | Anaplasmataceae | 1.47 | 41.6 | 1264 | [71] | |
| 1.32 | 29.0 | 925 | DOE-JGI | |||
| 1.18 | 30.1 | 1105 | [71] | |||
| 1.5 | 27.5 | 950 | [72] | |||
| 1.52 | 27.5 | 888 | [73] | |||
| 0.86 | 41.1 | 932 | [71] | |||
| 1.52 | 31.6 | 1429 | [74] | |||
| 1.27 | 32.4 | 1374 | [75] | |||
| 1.59 | 32.5 | 1512 | [76] | |||
| 1.11 | 29.0 | 835 | [77] | |||
| 1.11 | 28.9 | 838 | [78] | |||
| 1.27 | 35.2 | 1195 | [79] | |||
| Rickettsiales | 1.08 | 34.2 | 805 | [80] | ||
| 3.05 | 63.1 | 3011 | GBM Foundation | |||
| 3.56 | 65.2 | 3937 | DOE-JGI | |||
| Sphingomonadales | 3.37 | 65.5 | 3195 | DOE-JGI | ||
| Sphingomonadales | 5.93 | 67.9 | 5345 | DOE-JGI | ||
| 2.06 | 46.3 | 1998 | [53] |
* indicates species that were used in blast searches
+ Although these species are classified as Rhodobacterales, results presented here and elsewhere [29,47] suggest their placement in the Caulobacterales.
DOE-JGI, Department of Energy, Joint Genome Institute; GBM, The Gordon and Betty Moore Foundation Marine Microbiology Initiative; TIGR, The Institute for Genome Research.
Figure 1A neighbour-joining distance tree based on concatenated sequences for 12 conserved proteins. The numbers on the nodes indicate bootstrap scores (out of 100) observed in the neighbour-joining (NJ), maximum parsimony (MP) and maximum-likelihood (ML) analyses (NJ/MP/ML). The species marked with * are presently not part of the Caulobacterales order, but the results of phylogenetic and phylogenomic studies presented here suggest their placement in this group.
Figure 2Character compatibility cliques showing the two largest cliques of mutually compatible characters based on the two states sites in the concatenated sequence alignment for 12 conserved proteins. The cliques consisted of 350 mutually compatible characters. The numbers of characters that distinguished different clades are indicated on the nodes. Rooting was done using the sequences for Helicobacter pylori and Campylobacter jejuni. *, as in Figure 1.
Figure 3Summary of the phylogenomic analyses showing the species distribution of various α-proteobacteria-specific proteins and the suggested evolutionary stages where the genes for these proteins have likely evolved. The genes IDs for some proteins described in earlier work are indicated [17]. The information for all other proteins can be found in the indicated Tables or Additional files. A large numbers of conserved indels that are specific for different groups or clades within α-proteobacteria shown here have also been identified in our earlier work [16] (not shown here). The branching order of α-proteobacteria relative to other bacteria has been established in earlier work [32,58].
Proteins specific for the Rhizobiaceae and related species
| Gene ID | Accession Number | Function | Gene ID | Accession Number | Function |
| mll00621 | Hypothetical | mlr07891 | Hypothetical | ||
| mll40681 | Hypothetical | mlr3016 | Omp10 | ||
| mll77911,4 | Hypothetical | msl65261 | Hypothetical | ||
| mlr0777 1 | Hypothetical | ||||
| mll0122 1,4 | Hypothetical | mll5001 1,4 | Hypothetical | ||
| mll1268 2 | Hypothetical | mll8359 1,4 | Hypothetical | ||
| mll2847 2,3 | Hypothetical | mlr1823 1 | Hypothetical | ||
| mll2898 4 | Hypothetical | mlr0094 1,4 | MhpC (COG0596) | ||
| mll4298 1,2 | Hypothetical | ||||
| mll0080 | Hypothetical | mll67034 | Hypothetical | ||
| mll0867 | Hypothetical | mlr1904 | Hypothetical | ||
| mll9619 | Hypothetical | mlr3274 | Hypothetical | ||
| mlr5174 4 | Hypothetical | mlr4951 | NodF | ||
| mll6303 | Hypothetical | ||||
| mll0459 | Hypothetical | mll2007 | Hypothetical | ||
| mll1779 | Hypothetical | mlr1999 | Hypothetical | ||
| mll6195 | Hypothetical | mlr2029 | Hypothetical | ||
| mll8758 | Hypothetical | mlr6601 | Hypothetical | ||
| mlr3037 | Transcriptional regulator | ||||
1 Missing in Bartonella
2 Missing in Agrobacterium
3 Missing in Rhizobium.
4 Also found in Stappia aggregata
Proteins that are specific for the Bradyrhizobiaceae group
| Gene ID | Accession Number | Function | Gene ID | Accession Number | Function |
| bll60141,2 | Putative general secretion pathway protein M | Nwi_2179 | Hypothetical | ||
| Nwi_1093 | Hypothetical | Nwi_24321 | Hypothetical | ||
| Nwi_1227 | Hypothetical | Nwi_24761 | Putative bacterioferritin | ||
| Nwi_17861 | Hypothetical | Nwi_2572 | Hypothetical | ||
| Nwi_1788 | Hypothetical | Nwi_2623 | Hypothetical | ||
| Nwi_21471,3 | Hypothetical | Nwi_2707 | Hypothetical | ||
| bll58992 | Hypothetical | Nwi_2021 | Hypothetical | ||
| blr61061,2 | Hypothetical | Nwi_2063 | Hypothetical | ||
| Nwi_0278 | Hypothetical | Nwi_2064 | Hypothetical | ||
| Nwi_0503 | Hypothetical | Nwi_2163 | Hypothetical | ||
| Nwi_0528 | Hypothetical | Nwi_2173 | Hypothetical | ||
| Nwi_06051 | Hypothetical | Nwi_21831 | Hypothetical | ||
| Nwi_07101, | Hypothetical | Nwi_22083 | Hypothetical | ||
| Nwi_0925 | Hypothetical | Nwi_2244 | Hypothetical | ||
| Nwi_09661, | Hypothetical | Nwi_2247 | Hypothetical | ||
| Nwi_1084 | Hypothetical | Nwi_2379 | Hypothetical | ||
| Nwi_1092 | Hypothetical | Nwi_23812 | Hypothetical | ||
| Nwi_11073 | Hypothetical | Nwi_24143 | Hypothetical | ||
| Nwi_1108 | Hypothetical | Nwi_2489 | Hypothetical | ||
| Nwi_1336 | Hypothetical | Nwi_249213 | Hypothetical | ||
| Nwi_1139 | Hypothetical | Nwi_2500 | Hypothetical | ||
| Nwi_12473 | Hypothetical | Nwi_25063 | Hypothetical | ||
| Nwi_1270 | Hypothetical | Nwi_25093 | Hypothetical | ||
| Nwi_12753 | Hypothetical | Nwi_2531 | Hypothetical | ||
| Nwi_14541 | Hypothetical | Nwi_2575 | Hypothetical | ||
| Nwi_1498 | Hypothetical | Nwi_2577 | Hypothetical | ||
| Nwi_1512 | Hypothetical | Nwi_258813 | Hypothetical | ||
| Nwi_15813 | Hypothetical | Nwi_2630 | Hypothetical | ||
| Nwi_1582 | Hypothetical | Nwi_2676 | Hypothetical | ||
| Nwi_1586 | Dihydrofolate reductase | Nwi_26773 | Hypothetical | ||
| Nwi_16491,3 | Hypothetical | Nwi_2769 | Hypothetical | ||
| Nwi_1674 | Hypothetical | Nwi_27891 | Hypothetical | ||
| Nwi_1705 | Hypothetical | Nwi_29843 | Hypothetical | ||
| Nwi_1711 | Hypothetical | Nwi_2959 | Hypothetical | ||
| Nwi_1785 | Hypothetical | Nwi_3035 | Hypothetical | ||
| Nwi_1793 | Hypothetical | Nwi_3140 | Hypothetical | ||
| Nwi_18001,2 | Hypothetical | Nwi_31411 | Hypothetical | ||
1 Missing in one or more strains of Rhodopseudomonas
2 Missing in one or more species of Nitrobacter
3 Missing in Bradyrhizobium japonicum USDA 110
Proteins that are specific for the Rhodobacterales
| Gene ID | Accession Number | Function | Gene ID | Accession Number | Function |
| TM1040_00931 | Hypothetical | TM1040_1842 | Phasin, PhaP | ||
| TM1040_01842 | Hypothetical | TM1040_19673 | Putative CheA signal transduction | ||
| TM1040_02361,2 | Hypothetical | TM1040_1988 | Hypothetical | ||
| TM1040_0471 | Putative rod shape-determining protein MreD | TM1040_22632 | Hypothetical | ||
| TM1040_05862 | Hypothetical | TM1040_2370 | Hypothetical | ||
| TM1040_05872 | Hypothetical | TM1040_24253 | Hypothetical | ||
| TM1040_0697 | Hypothetical | TM1040_2466 | GCN5-related N-acetyltransferase COG045 | ||
| TM1040_07502,4 | Hypothetical | TM1040_24872 | Hypothetical | ||
| TM1040_07521 | Lipoprotein, putative | TM1040_25823 | Hypothetical | ||
| TM1040_1063 | Gene transfer agent | TM1040_2999 | Hypothetical | ||
| TM1040_1064 | Gene transfer agent | TM1040_30773 | Hypothetical | ||
| TM1040_12473 | Hypothetical | TM1040_3749 | Hypothetical | ||
| TM1040_13502 | Hypothetical | TM1040_3759 | Lipoprotein, putative | ||
| TM1040_1406 | Outer membrane chaperone Skp (OmpH) | TM1040_37643 | Putative transmembrane protein | ||
| TM1040_1567 | Hypothetical | ||||
| TM1040_15581 | Hypothetical | TM1040_21574 | Hypothetical | ||
| TM1040_17351 | Hypothetical | TM1040_24431,4 | Lipolytic enzyme, G-D-S-L | ||
| TM1040_18441,5,6 | Hypothetical | TM1040_26807 | Hypothetical | ||
| TM1040_10998 | Hypothetical | TM1040_3189 | |||
| TM1040_14238 | Hypothetical | TM1040_32028 | |||
| TM1040_1451 | Hypothetical | TM1040_32088 | |||
| TM1040_19868 | Hypothetical | TM1040_32268 | |||
| TM1040_2106 | Hypothetical | TM1040_35298 | |||
| TM1040_21398 | Hypothetical | TM1040_36268 | |||
| TM1040_30758 | Hypothetical | ||||
1 Missing in Loktanella vestfoldensis SKA53
2 Missing in one or more Rhodobacter sphaeroides strains
3 Missing in Paracoccus denitrificans PD12222
4 Missing in Rhodobacterales bacterium HTCC2654
5 Missing in one or more species of Roseovarius
6 Missing in Oceanicola batsensis HTCC2597
7 Missing in one or more species of Roseobacter
8 Missing in Silicibacter pomeroyi DSS-3
Proteins specific for the Caulobacter and related species
| Gene ID | Accession Number | Function | Gene ID | Accession Number | Function |
| CC04861 | Hypothetical | CC1066 | Hypothetical | ||
| CC24801 | Hypothetical | CC1586 | Hypothetical | ||
| CC27642 | Hypothetical | CC2207 | Hypothetical | ||
| CC3101 | Hypothetical | CC2628 | hfaA protein | ||
| CC0512 | Hypothetical | CC2639 | Hypothetical | ||
| CC1064 | Hypothetical | ||||
All of the proteins in this Table are present in C. crescentus as well as in O. alexandrii and M. maris.
1 These proteins are also present in H. neptunium and P. bermudensis.
2 Also found in P. bermudensis.
Proteins that are specific for the Sphingomonadales group of species
| Gene ID | Accession Number | Function | Gene ID | Accession Number | Function |
| Saro_0018 | Hypothetical | Saro_1291 | Hypothetical | ||
| Saro_00521 | Hypothetical | Saro_1378 | Hypothetical | ||
| Saro_0087 | Hypothetical | Saro_19141 | Hypothetical | ||
| Saro_0150 | Hypothetical | Saro_2130 | Hypothetical | ||
| Saro_0232 | Hypothetical | Saro_2788 | Hypothetical | ||
| Saro_0409 | Hypothetical | Saro_2958 | Hypothetical | ||
| Saro_1088 | Hypothetical | Saro_3138 | Hypothetical | ||
| Saro_1144 | Hypothetical | Saro_3213 | Hypothetical | ||
| Saro_0044 | Hypothetical | Saro_17482 | |||
| Saro_0154 | Hypothetical | Saro_1785 | |||
| Saro_04153 | Hypothetical | Saro_1972 | |||
| Saro_0458 | Hypothetical | Saro_2036 | |||
| Saro_1078 | Hypothetical | Saro_2037 | |||
| Saro_1126 | Hypothetical | Saro_2333 | |||
| Saro_1160 | Hypothetical | Saro_2548 | |||
| Saro_1163 | Hypothetical | ||||
1 Missing in Sphingomonas wittchii
2 Blast scores for C. crescentus and H. neptunium are also significant
3 Significant blast scores for C. crescentus
Figure 4Partial sequence alignments of DNA Gyrase B showing a 4 aa insert that is mainly specific for the Sphingomonadales species. A 4–5 aa insert present in some other α-proteobacteria could be due to either LGTs or taxonomic anomalies. The dashes (-) denote identity with the amino acid on the top line. Sequence information for other groups of bacteria (which do not contain this insert) is not shown.
Proteins that are specific for the Rhodospirillales group
| Gene ID | Accession Number | Function | Gene ID | Accession Number | Function |
| GOX06331 | Hypothetical | GOX0963 | Hypothetical | ||
| GOX06952 | Hypothetical | GOX12583 | Hypothetical | ||
| GOX0143 | Hypothetical | GOX1616 | Hypothetical | ||
| GOX0343 | ANK, ankyrin repeats | GOX2216 | Hypothetical | ||
| GOX1212 | Phage portal protein | GOX2275 | Hypothetical | ||
| GOX1215 | Putative phage protein | GOX2316 | Hypothetical | ||
| GOX1222 | Putative phage protein | GOX2452 | Hypothetical | ||
| GOX1224 | Putative phage protein | GOX2454 | Hypothetical | ||
| GOX1233 | Hypothetical | GOX2456 | Hypothetical | ||
| Rru_A0125 | Putative diguanylate phosphodiesterase | Rru_A2592 | Hypothetical | ||
| Rru_A0152 | Hypothetical | Rru_A2828 | Hypothetical | ||
| Rru_A0531 | Hypothetical | Rru_A3562 | Hypothetical | ||
| Rru_A1689 | Hypothetical | Rru_A3636 | Hypothetical | ||
| Rru_A1756 | Hypothetical | Rru_A3662 | Hypothetical | ||
| Rru_A2112 | Hypothetical | Rru_A3739 | Hypothetical | ||
| Rru_A2510 | Predicted transcriptional regulator | Rru_A3800 | Hypothetical | ||
1 Missing in Rhodospirillum
2 Missing in Acidiphilium
3 Missing in Magnetospirillum
Figure 5Partial sequence alignments of RNA polymerase β subunit (RpoB) showing a large insert (boxed) that is a distinctive characteristic of various Rhodospirillales species and not found in any other bacteria. There are two homologs of RpoB in Magentospirillum (Mag.) magneticum and only one of these contains the insert. The dashes (-) denote identity with the amino acid on the top line.
Protein that are specific for the Rickettsiales group of species
| Gene ID | Accession Number | Function | Gene ID | Accession Number | Function |
| WD0161 | Hypothetical | WD07711 | Hypothetical | ||
| WD0715 | Hypothetical | ||||
| WD0083 | Hypothetical | WD0827 | Hypothetical | ||
| WD0157 | Hypothetical | WD0863 | Hypothetical | ||
| WD0821 | Hypothetical | ||||
| WD0148 | Hypothetical | WD0772 | Hypothetical | ||
| WD0412 | Hypothetical | WD1025 | Hypothetical | ||
| WD0467 | Preprotein translocase, SecG | WD1056 | Hypothetical | ||
| WD0757 | Hypothetical | WD1220 | Hypothetical | ||
| WD0764 | Hypothetical | WD1230 | Hypothetical | ||
1 Missing in Neorickettsia and Orientia