| Literature DB >> 23908650 |
Radhey S Gupta1, Sharmeen Mahmood, Mobolaji Adeolu.
Abstract
The Spirochaetes species cause many important diseases including syphilis and Lyme disease. Except for their containing a distinctive endoflagella, no other molecular or biochemical characteristics are presently known that are specific for either all Spirochaetes or its different families. We report detailed comparative and phylogenomic analyses of protein sequences from Spirochaetes genomes to understand their evolutionary relationships and to identify molecular signatures for this group. These studies have identified 38 conserved signature indels (CSIs) that are specific for either all members of the phylum Spirochaetes or its different main clades. Of these CSIs, a 3 aa insert in the FlgC protein is uniquely shared by all sequenced Spirochaetes providing a molecular marker for this phylum. Seven, six, and five CSIs in different proteins are specific for members of the families Spirochaetaceae, Brachyspiraceae, and Leptospiraceae, respectively. Of the 19 other identified CSIs, 3 are uniquely shared by members of the genera Sphaerochaeta, Spirochaeta, and Treponema, whereas 16 others are specific for the genus Borrelia. A monophyletic grouping of the genera Sphaerochaeta, Spirochaeta, and Treponema distinct from the genus Borrelia is also strongly supported by phylogenetic trees based upon concatenated sequences of 22 conserved proteins. The molecular markers described here provide novel and more definitive means for identification and demarcation of different main groups of Spirochaetes. To accommodate the extensive genetic diversity of the Spirochaetes as revealed by different CSIs and phylogenetic analyses, it is proposed that the four families of this phylum should be elevated to the order level taxonomic ranks (viz. Spirochaetales, Brevinematales ord. nov., Brachyspiriales ord. nov., and Leptospiriales ord. nov.). It is further proposed that the genera Borrelia and Cristispira be transferred to a new family Borreliaceae fam. nov. within the order Spirochaetales.Entities:
Keywords: Borreliaceae; Brachyspiriales; Leptospiriales; Spirochaetaceae; Spirochaetes; Spirochaetes phylogeny and taxonomy; conserved signature indels; molecular signatures
Year: 2013 PMID: 23908650 PMCID: PMC3726837 DOI: 10.3389/fmicb.2013.00217
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Genome characteristics of the sequenced members of the phylum Spirochaetes.
| NC_017238 | 1.4 | 27.90 | 1 | 17 | Casjens et al., | |
| NC_015921 | 1.4 | 28.33 | 1 | 16 | Schutzer et al., | |
| NC_001318 | 1.52 | 28.18 | 1 | 21 | Zhong and Barbour, | |
| NC_017808 | 1.53 | 29.06 | 1 | 39 | Elbir et al., | |
| NC_011229 | 1.57 | 28.02 | 1 | 16 | Lescot et al., | |
| NC_006156 | 0.99 | 28.12 | 1 | 11 | Glöckner et al., | |
| NC_010673 | 0.93 | 29.81 | 1 | 2 | Dai et al., | |
| NC_011244 | 1.24 | 27.51 | 1 | 7 | Unité des Rickettsies | |
| NZ_ABJZ00000000 | 1.28 | 28.27 | 1 | 9 | Casjens et al., | |
| NZ_ABKB00000000 | 1.25 | 27.69 | – | 8 | Schutzer et al., | |
| NC_008710 | 0.92 | 29.10 | 1 | – | Rocky Mountain Laboratories | |
| NZ_ABCY00000000 | 0.35 | 25.83 | – | 11 | Schutzer et al., | |
| NZ_ARSY00000000 | 3.05 | 27.00 | 1 | 1 | DOE-JGI | |
| NC_017243 | 3.31 | 27.19 | 1 | 1 | Håfström et al., | |
| NC_014150 | 3.24 | 27.80 | 1 | – | Pati et al., | |
| NC_019908 | 2.56 | 27.90 | 1 | – | Lin et al., | |
| NZ_AHKT00000000 | 4.52 | 54.30 | – | – | DOE-JGI | |
| NC_010842 | 3.96 | 38.90 | 2 | 1 | Picardeau et al., | |
| NC_008509 | 3.93 | 40.20 | 2 | – | Bulach et al., | |
| NZ_AHMO00000000 | 4.49 | 42.90 | – | – | JCV | |
| NZ_AHMM00000000 | 4.57 | 44.50 | – | – | JCV | |
| NZ_AOVR00000000 | 4.6 | 35.00 | 2 | – | JCV | |
| NZ_AHMN00000000 | 4.4 | 35.90 | – | – | JCV | |
| NZ_AHMP00000000 | 4.48 | 44.70 | – | – | JCV | |
| NZ_AHOO00000000 | 4.21 | 35.90 | – | – | JCV | |
| NZ_AKXE00000000 | 4.19 | 38.00 | – | – | JCV | |
| NZ_ADOR00000000 | 3.88 | 41.80 | – | – | Chou et al., | |
| NZ_AKWV00000000 | 4.04 | 41.70 | – | – | JCV | |
| NZ_AFLV00000000 | 4.37 | 40.80 | – | – | JCV | |
| NC_015436 | 2.23 | 50.60 | 1 | – | Abt et al., | |
| NC_015152 | 3.32 | 48.90 | 1 | – | DOE-JGI | |
| NC_016633 | 3.59 | 46.20 | 1 | – | DOE-JGI | |
| NC_017098 | 3.29 | 57.80 | 1 | – | DOE-JGI | |
| NC_014364 | 4.65 | 49.00 | 1 | – | Mavromatis et al., | |
| NC_017583 | 2.56 | 60.90 | 1 | – | DOE-JGI | |
| NC_015577 | 3.86 | 49.80 | 1 | – | JCV | |
| NC_015500 | 3.06 | 51.50 | 1 | – | DOE-JGI | |
| NC_015732 | 3.24 | 45.60 | 1 | – | Abt et al., | |
| NC_002967 | 2.84 | 37.90 | 1 | 1 | Seshadri et al., | |
| NC_000919 | 1.14 | 52.80 | 1 | – | Fraser et al., | |
| NC_015714 | 1.13 | 52.70 | – | – | Smajs et al., | |
| NZ_AEFH00000000 | 2.83 | 40.10 | – | – | WUGSC | |
| NC_015578 | 4.06 | 50.80 | 1 | – | JCV | |
| NZ_AGRW00000000 | 3.45 | 53.20 | – | – | DOE-JGI | |
| NZ_AJGU00000000 | 3.03 | 40.30 | – | – | CSIRO | |
| NC_015385 | 2.9 | 39.17 | 1 | 1 | Han et al., | |
| NZ_ACYH00000000 | 2.51 | 45.70 | – | – | JCV | |
| NC_018020 | 4.41 | 53.60 | 1 | 1 | DOE-JGI |
Genomic information was collected from: http://www.ncbi.nlm.nih.gov/genomes/
Unité des Rickettsies: Genome sequenced by Unité des Rickettsies at Center National de Référence.
Rocky Mountain Laboratories: Genome sequenced by the Laboratory of Human Bacterial Pathogenesis at Rocky Mountain Laboratories.
DOE-JGI: Genome sequenced by the United States Department of Energy Joint Genome Institute.
JCV: Genome sequenced by the J. Craig Venter Institute.
WUGSC: Genome sequenced by the Washington University Genome Sequencing Center.
CSIRO: Genome sequenced by the Commonwealth Scientific and Industrial Research Organization.
Type strain.
Figure 2A ML tree based on the 16S rRNA gene sequences of representative species from cultured genera within the phylum Spirochaetes. Bootstrap values are shown at branch nodes. The different families of the phylum Spirochaetes are marked. The letterT refers to the type strain of the species. The accession numbers of the 16S rRNA gene sequences used in this analysis are provided in Supplemental Table 1.
Figure 1A phylogenetic tree of genome sequenced members of the phylum Spirochaetes based on the concatenated amino acid sequences of 22 conserved proteins. The tree shown is a maximum-likelihood (ML) distance tree. Bootstrap values are shown at branch nodes for both maximum-likelihood and neighbor-joining tree construction methods as ML/NJ. The different sequenced families and two main clades of the family Spirochaetaceae supported by the tree are marked. The letter T refers to the type strain of the species.
Figure 3A partial sequence alignment of the flagellar basal-body rod protein FlgC, showing a CSI (boxed) that is uniquely present in all members of the phylum Spirochaetes. Sequence information for only a limited number of species from the Spirochaetes and other bacteria is shown here, but unless otherwise indicated similar CSIs were detected in all members of the indicated group and not detected in any other bacterial species in the top 250 Blastp hits. The dashes (−) in the alignments indicate identity with the residue in the top sequence. GenBank identification (GI) numbers for each sequence are indicated in the second column. Sequence homologs for this protein were not identified from members of the genus Sphaerochaeta.
Figure 4A partial sequence alignment of the protein alanyl-tRNA synthetase showing a two amino acid insertion (boxed) identified in homologs from the family . Sequence information for other Spirochaetaceae specific CSIs is presented in Supplemental Figures 3–6 and summarized in Table 2.
Conserved signature Indels that are specific for members of the family .
| Phosphoribosylpyrophosphate synthetase | prsA | 496158147 | Figure | 15 aa ins | 97–143 |
| Alanyl-tRNA synthetase | alaS | 386859446 | Supplemental Figure | 2 aa ins | 277–306 |
| Phosphoribosylpyrophosphate synthetase | prsA | 387827445 | Supplemental Figure | 8 aa ins | 256–297 |
| Preprotein translocase | secY | 15639201 | Supplemental Figure | 1 aa del | 340–373 |
| Peptide chain release factor 2 | prfB | 257457828 | Supplemental Figure | 1 aa del | 137–176 |
| DNA mismatch repair protein MutS | mutS | 224532424 | Supplemental Figure | 2 aa del | 720–751 |
| DNA mismatch repair protein MutL | mutL | 338706271 | Supplemental Figure | 4 aa del | 494–520 |
Figure 5Partial sequence alignments of (A) Flagellar hook-associated protein FlgK and (B) DNA polymerase I, showing two CSIs that are specific for the family . Sequence homologs for flagellar hook-associated protein FlgK were not identified from members of the genus Sphaerochaeta. Sequence information for other Brachyspiraceae specific CSIs is presented in Supplemental Figures 7–10 and summarized in Table 3.
Conserved signature Indels that are specific for members of the family .
| Flagellar hook-associated protein FlgK | flgK | 225620569 | Figure | 1 aa ins | 62–104 |
| DNA polymerase I | polA | 296127550 | Figure | 1 aa ins | 810–852 |
| Valyl-tRNA synthetase | valS | 300871449 | Supplemental Figure | 1 aa ins | 225–263 |
| Valyl-tRNA synthetase | valS | 300871449 | Supplemental Figure | 2 aa del | 660–703 |
| ATP-dependent protease La | lon | 225620632 | Supplemental Figure | 1 aa ins | 760–793 |
| Glutamyl-tRNA amidotransferase subunit B | gatB | 300871379 | Supplemental Figure | 1 aa ins | 325–361 |
Figure 6Partial sequence alignments of (A) 50S Ribosomal protein L14 and (B) Alanyl-tRNA synthetase, showing two CSIs that are specific for the family . Sequence information for other Leptospiraceae specific CSIs is presented in Supplemental Figures 11–13 and summarized in Table 4.
Conserved signature Indels that are specific for members of the family .
| 50S Ribosomal protein L14 | rplN | 5163214 | Figure | 8 aa ins | 36–73 |
| Alanyl-tRNA synthetase | alaS | 45656657 | Figure | 4 aa ins | 165–211 |
| 30S Ribosomal protein S2 | rpsB | 116330588 | Supplemental Figure | 2 aa ins | 108–141 |
| Flagellar filament core protein FlaB | flaB | 12657818 | Supplemental Figure | 4 aa del | 130–168 |
| Flagellar basal-body rod protein FlgG | flgG | 294828153 | Supplemental Figure | 1 aa ins | 80–123 |
Figure 7(A) Partial sequence alignment of the protein 6-phosphofructokinase (pyrophosphate) containing a 1 amino acid insert in a conserved region that is specifically present in the species from the genera Treponema, Spirochaeta, and Sphaerochaeta, but not found in any other sequenced bacteria. (B) Partial sequence alignment of phosphofructokinase containing a 6 amino acid insert that is specific for the genera Borrelia. Sequence information for other CSIs showing similar specificities is provided in Table 5 and in Supplemental Figures 14–30.
Conserved Signature Indels that are specific for groups within the family .
| 6-phosphofructokinase (pyrophosphate) | pfp | 15639102 | Figure | 1 aa ins | 148–184 | |
| Bifunctional Hpr kinase/phosphatase | hprK | 3322886 | Supplemental Figure | 1 aa ins | 183–221 | |
| 30S ribosomal protein S13 | rpsM | 302337499 | Supplemental Figure | 1 aa del | 1–39 | |
| Phosphofructokinase | pfk | 219685531 | Figure | 6 aa ins | 275–319 | |
| 50S ribosomal protein L4 | rplD | 224534698 | Supplemental Figure | 1 aa ins | 103–136 | |
| tRNA pseudouridine 55 synthase | truB | 203284699 | Supplemental Figure | 2 aa ins | 143–178 | |
| Translation elongation factor Tu | tuf | 203284386 | Supplemental Figure | 1 aa del | 330–369 | |
| Histidyl-tRNA synthetase | hisS | 187918014 | Supplemental Figure | 1 aa del | 273–301 | |
| Seryl-tRNA synthetase | serS | 187918098 | Supplemental Figure | 1 aa del | 231–264 | |
| Spoiiij-associtated protein | jag | 219684344 | Supplemental Figure | 3 aa ins | 114–154 | |
| Nicotinate phosphoribosyltransferase | pncB | 187918492 | Supplemental Figure | 1 aa del | 134–159 | |
| Ribose 5-phosphate isomerase | rpiA | 119953435 | Supplemental Figure | 1 aa ins | 86–110 | |
| Ribonuclease Z | rnz | 195941574 | Supplemental Figure | 2 aa ins | 64–94 | |
| Hypothetical protein BGAFAR04_0762 | – | 386859948 | Supplemental Figure | 1 aa ins | 206–236 | |
| Signal recognition particle, subunit FFH/SRP54 | – | 119953471 | Supplemental Figure | 1 aa ins | 374–412 | |
| Hypothetical protein BSV1_0075 | – | 15594416 | Supplemental Figure | 1 aa del | 52–97 | |
| Aspartyl/glutamyl-tRNA amidotransferase subunit A | gatA | 119953137 | Supplemental Figure | 1 aa ins | 364–402 | |
| Ribosomal RNA methyltransferase | rlmE | 203284234 | Supplemental Figure | 1 aa ins | 15–48 | |
| LysM domain/M23/M37 peptidase domain protein | – | 224534310 | Supplemental Figure | 1 aa ins | 320–365 |
Figure 8A summary diagram depicting the distribution of identified CSIs and the proposed reclassification of the groups within the phylum Spirochaetes. A representative strain is listed for each genome sequenced species. The letter T refers to the type strain of the species.