| Literature DB >> 31454394 |
Abstract
In a recent publication in PLOS ONE, Gabriele Margos and colleagues have questioned the division of the genus Borrelia into two genera on the basis that the differences in percentage of conserved proteins (POCP) between these two groups is >50%, which an earlier study has suggested as the threshold for differentiating prokaryotic genera. However, the POCP threshold is a poorly characterized and rarely used criterion for establishing distinction among prokaryotic genera. Detailed evaluation of the intergeneric POCP values for 37 genera from 3 different families (viz. Enterobacteriaceae- 24 genera, Morganellaceae-8 genera and Cystobacteraceae-5 genera) presented here shows that the POCP values for all genera within each of these families exceeded >58%. Thus, the suggested POCP threshold is not a useful criterion for delimitation of genus boundary and the objection by Margos et al. on this ground is invalid. Additionally, Margos et al. have questioned the specificities of ~15-20% of the conserved signature indels (CSIs) described in our work. However, as shown here, this concern is due to misunderstanding of the results and the CSIs in question are still highly-specific characteristics of the members of these genera and they provide important information regarding the evolutionary relationships of two new reptiles-echidna-related species viz. Borrelia turcica and Candidatus Borrelia tachyglossi to other Borrelia species. Results presented here show that both these species are deeper-branching members of the genus Borrelia and their placement within this genus is strongly supported by phylogenetic analyses and multiple uniquely shared CSIs with the other Borrelia species. Based on the large body of evidence derived from phylogenetic, genomic, molecular, phenotypic and clinical features, it is contended that the characteristics clearly distinguishing the Borrelia and Borreliella genera are far more numerous and of different kinds than those discerning most (all) other neighbouring genera of prokaryotes. Thus, the placement of these two groups of microorganisms into distinct genera, Borrelia and Borreliella, which clearly recognizes the differences among them, is highly appropriate and it should lead to a better understanding of the clinical, molecular and biological differences between these two important groups of microbes.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31454394 PMCID: PMC6711536 DOI: 10.1371/journal.pone.0221397
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1A comparison matrix showing the averages of the percentage of conserved proteins (POCP) within and between different genera of the family Enterobacteriaceae.
POCP was determined for all genome sequenced species from the family Enterobacteriaceae detailed in our earlier work [26]. The values along the diagonal shows the average POCP values for different species within a given genus (i.e. interspecies values), whereas all other values represent average intergeneric POCP values for different genera within this family. The blank cells indicate that only a single species was available for these genera and hence their interspecies values could not be calculated.
Fig 2A pair-wise comparison matrix based on percentage of conserved proteins (POCP) in chromosomal genes from different genome sequenced Borreliaceae species.
The matrix was constructed using an internally developed pipeline [13,14]. Genome pairs sharing higher POCP are shaded more darkly (red). Based on their POCP values, species belonging to the family Borreliaceae form two main groups, with one group containing all of the LD and related species (or Borreliella), and the other encompassing RF group of species together with the reptile-and echidna- associated species B. turcica and Candidatus Borrelia tachyglossi (genus Borrelia).
Fig 3Phylogenetic trees showing the branching of Borreliaceae species.
(A) A maximum-likelihood (ML) tree based on concatenated sequences of 703 core proteins found in the genomes of Borreliaceae species; (B) A tree based on sequence alignment for the RNA polymerase β’- subunit (RpoC protein). (C) A ML tree for Borreliaceae species based on 16S rRNA gene sequences.
Fig 4A summary diagram showing the species specificities of different CSIs reported in our earlier work [6].
The CSIs described in our earlier work were of two kinds. Panels (A) and (B) present the results for CSIs, where sequence information for outgroup species was available, whereas panels (C) and (D) show results for CSIs which are found in proteins that are limited to the Borreliaceae species (i.e. no homologs in any outgroup species). Panels (A) and (C) show the results as reported earlier [6], whereas panels (B) and (D) show how the observed specificities of the CSIs have been affected upon inclusion of sequences for B. turcica and Candidatus Borrelia tachyglossi. The asterisks (*) marks the CSIs whose specificities have been questioned by Margos et al. [10]. As shown here and as discussed in the text, these CSIs remain specific for the RF group (genus Borrelia) in addition to providing important information regarding the branching or phylogenetic placement of B. turcica and Candidatus Borrelia tachyglossi within the genus Borrelia and family Borreliaceae.
Conserved signature indels (CSIs) found in widely distributed proteins that are specific for the members of the Lyme disease Borrelia (i.e. genus Borreliella).
| Protein Name | GI Number | Indel Size | Indel Position | |
|---|---|---|---|---|
| Recombinase A | 492960118 | BB_0131 | 1 aa ins | 228–272 |
| Trigger factor Tig | 386854012 | BB_0347 | 2 aa ins | 106–142 |
| Chemotaxis protein CheY | 15594760 | BB_0415 | 1 aa del | 197–231 |
| DNA polymerase III subunit beta | 410679212 | BB_0438 | 1 aa del | 135–176 |
| Translation factor Sua5 | 15595079 | BB_0610 | 2 aa ins | 149–182 |
| Ferrous iron transporter A | 51598605 | BB_0730 | 1 aa del | 88–126 |
| Glucose-6-phosphate isomerase | 493478887 | BB_0734 | 1 aa ins | 81–134 |
These 7 CSIs described in our earlier work [6] are uniquely shared by different genome-sequenced Borreliella (or LD group) species. Updating of sequence information for these CSIs show that they are not found in any species from the genus Borrelia (RF group) including Borrelia turcica and Candidatus Borrelia tachyglossi. Sequence alignments for these CSIs have been presented in earlier work [6].
Fig 5Partial sequence alignments of two CSIs in proteins with outgroup species that were previously reported as specific for the RF clade.
Panel (A) shows a 6 aa insert in a hypothetical protein BDU327 (BB_0326) that is specifically found in all members of the genus Borrelia including B. turcica and Candidatus Borrelia tachyglossi. (B) This panel shows a 1 aa insert in the L-lactate permease protein, which is only shared by all RF clade species but is absent in the B. turcica and Candidatus Borrelia tachyglossi homologs, which are deeper branching members of the genus Borrelia (see Figs 3 and 4). Dashes (-) in all alignments shows sequence identity with the amino acids on the top line.
Conserved signature indels in proteins that are specific for either all members of the Genus Borrelia or those lacking in the deeper branching Borrelia turcica and Candidatus Borrelia tachyglossi.
| Protein Name | GI Number | Indel Size | Indel Position | |
|---|---|---|---|---|
| Hypothetical protein BRE16 | 203287484 | 3 aa ins | BB_0011 | 64–98 |
| Hypothetical protein BDU327 | 203284245 | 6 aa ins | BB_0326 | 866–907 |
| 1-phosphofructokinase | 203288064 | 1 aa del | BB_0630 | 101–139 |
| GTP-binding protein | 203288075 | 2 aa ins | BB_0643 | 42–87 |
| Nicotinamide-nucleotide adenylyltransferase | 187918635 | 1 aa del | BB_0782 | 31–61 |
| Hypothetical protein BT04711 | 119953261 | 1 aa del | BB_0471 | 216–261 |
| L-lactate permease | 386859838 | 1 aa ins | BB_0604 | 195–239 |
| Sodium/panthothenate symporter | 119953591 | 1 aa ins | BB_0814 | 421–454 |
These 8 CSIs were described as specific for the RF clade of Borrelia species in our earlier work [6]. Upon inclusion of sequence information for including Borrelia turcica and Candidatus Borrelia tachyglossi, 4 of these CSIs are also uniquely shared by these two species, whereas the other four CSIs listed below are absent in these two deeper branching species. These CSIs provide evidence that both Borrelia turcica and Candidatus Borrelia tachyglossi are members of the genus Borrelia and constitute deeper branching lineages of this genus (see summary Fig 4). Sequence alignment of one CSIs’ of each kind is presented in Fig 5.
Conserved signature indels in Borreliaceae-specific proteins distinguishing Borrelia and Borreliella and showing deeper branching of the RR group of species within the genus Borrelia.
| Protein Name | Indel Size | RF Clade | RR Species | LD Clade | Interpretation | |
|---|---|---|---|---|---|---|
| Hypothetical protein | BB_0028 | 2 aa | - | - | + | Distinguishes two genera |
| Hypothetical protein | BB_0028 | 1 aa | - | - | + | Distinguishes two genera |
| Hypothetical protein BRE47 | BB_0044 | 5 aa | - | - | + | Distinguishes two genera |
| L-proline transport system ATP-binding protein | BB_0146 | 1 aa | - | - | + | Distinguishes two genera |
| Penicillin-binding protein | BB_0136 | 1 aa | - | - | + | Distinguishes two genera |
| Hypothetical protein Q7M131 | BB_0125 | 1 aa | + | + | - | Distinguishes two genera |
| Hypothetical protein BT0110 | BB_0110 | 2 aa | - | - | + | Distinguishes two genera |
| Hypothetical protein BT0110 | BB_0110 | 2 aa | - | N/A | + | Distinguishes two genera |
| Glutamate racemase | BB_0100 | 6 aa | - | - | + | Distinguishes two genera |
| RNA methyltransferase RsmE | BB_0062 | 1 aa | + | + | - | Distinguishes two genera |
| DNA mismatch repair protein mutL | BB_0211 | 4 aa | + | + | - | Distinguishes two genera |
| Hypothetical protein BRE314 | BB_0227 | 1 aa | + | + | - | Distinguishes two genera |
| Methylgalactoside ABC transporter ATP-binding protein | BB_0318 | 1 aa | - | - | + | Distinguishes two genera |
| Sensory transduction histidine kinase | BB_0420 | 1 aa | - | - | + | Distinguishes two genera |
| Hypothetical protein Q7M860 | BB_0455 | 2 aa | + | + | - | Distinguishes two genera |
| Hypothetical protein KK90081 | BB_0083 | 1 aa | - | - | + | Distinguishes two genera |
| Outer membrane protein | BB_0167 | 1 aa | - | - | + | Distinguishes two genera |
| Transglycosylase SLT domain-containing protein | BB_0259 | 1 aa | - | - | + | Distinguishes two genera |
| Cell division protein FtsZ | BB_0299 | 1 aa | - | - | + | Distinguishes two genera |
| Excinuclease ABC subunit C | BB_0457 | 1 aa | - | - | + | Distinguishes two genera |
| Hypothetical protein BG0519 | BB_0507 | 1 aa | - | - | + | Distinguishes two genera |
| Hypothetical protein BBIDN1270545 | BB_0543 | 4 aa | - | - | + | Distinguishes two genera |
| Hypothetical protein BBUN400354 | BB_0354 | 3 aa | - | - | + | Distinguishes two genera |
| Hypothetical protein BBUZS70553 | BB_0543 | 1 aa | - | - | + | Distinguishes two genera |
| Hypothetical protein BB0554 | BB_0554 | 1 aa | - | - | + | Distinguishes two genera |
| Hypothetical protein BB0554 | BB_0554 | 2 aa | - | - | + | Distinguishes two genera |
| Hypothetical protein BBUCA803285 | BB_0664 | 1 aa | - | - | + | Distinguishes two genera |
| Chemotaxis protein | BB_0681 | 1 aa | - | - | + | Distinguishes two genera |
| Hypothetical protein L14403475 | BB_0707 | 1 aa | - | - | + | Distinguishes two genera |
| Membrane protein | BB_0234 | 1 aa | + | - | Insertion occurred after the branching of | |
| DNA polymerase III subunit -ta | BB_0455 | 2 aa | - | + | Insertion occurred after the branching of | |
| Hypothetical protein BB0838 | BB_0838 | 3 aa | - | + | + | Deletion in the RF clade |
| Putative lipoprotein | BB_0227 | 3 aa | + | - | - | Insertion in the RF Clade |
| Hypothetical protein BRE355 | BB_0353 | 1 aa | + | - | - | Insertion in the RF Clade |
| Hypothetical protein Q7M140 | BB_0134 | 2 aa | + | - | - | Insertion in the RF Clade |
| Hypothetical protein BG0159 | BB_0161 | 1 aa | - | + | + | Deletion in the RF clade |
| Methyl-accepting chemotaxis protein | BB_0681 | 2 aa | + | - | - | Insertion in the RF Clade |
| Chemotaxis protein | BB_0681 | 1 aa | - | + | + | Deletion in the RF clade |
These CSIs were previously indicated to differentiate members of the genus Borrelia and Borreliella [6]. With the inclusion of sequence information for B. turcica and Candidadus Borrelia tachyglossi, these CSIs still differentiate the members of these two genera; however, some of them also show the deep branching of the RR group of species in comparison to the other Borrelia species. Abbreviations: RR–refer to the repitles- and echidna- related species B. turcica and Candidadus Borrelia tachyglossi RF–Relapsing Fever Clade; LD–Lyme Disease Clade; + = presence of insert;— = absence of insert
# Candidadus Borrelia tachyglossi contains the insert but it is absent in B. turcica.
Fig 6Partial sequence alignments of three CSIs in proteins found only in the Borreliaceae species providing differentiation among members of the genera Borrelia and Borreliella.
(A) This panel shows a 2 aa CSI in a hypothetical protein BT0110 that differentiates the members of the genera Borrelia and Borreliella. Twenty nine other CSIs also show a similar species distribution (Table 3). Due to the absence of outgroup species it is difficult to infer whether this CSI is an insert in the genus Borrelia or a deletion in the genus Borreliella. (B) A 3 aa CSI in a putative lipoprotein that is specific for the RF clade of species. Due to the absence of this CSI in the LD clade as well as in B. turcica and Candidatus Borrelia tachyglossi homologs this CSI is an insert in the RF clade of species (see Fig 4). (C) A 2 aa CSI in DNA polymerase III subunit delta, which is commonly shared by the LD clade of species and Cand. Borrelia tachyglossi, but absent in B. turcica and the RF group of species. Based on its species distribution, this CSI is inferred to be an insert in a common ancestor of the RF clade and B. turcica (see Fig 4 for additional information).
Clinical, molecular and phenotypic differences between members of the RF-RR group (genus Borrelia) and the Lyme disease group (genus Borreliella).
| Characteristics | Genus | Genus | References |
|---|---|---|---|
| Relapsing fever causing bacteria | Encompasses All | None | See [ |
| Lyme-disease causing bacteria | None | Encompasses All | See [ |
| Branching in phylogenetic trees based on 16S rRNA and other genes/proteins sequences | In all phylogenetic trees, members of the RF+RR group and the LD group form strongly-supported clades clearly separated from each other. | [ | |
| Average Nucleotide Identity (ANI) Matrix based on Genomes | Members of the RF+RR group and the LD group are clearly differentiated based on higher similarity seen between the members of each group. | [ | |
| Average Amino Acid Identity (AAI) Matrix based on Genomes | Members of the RF+RR group and the LD group are clearly differentiated based on higher similarity seen between the members of each group. | [ | |
| Percentage of Conserved Protein (POCP) Matrix based on Genome Sequences | Members of the RF+RR group and the LD group are clearly differentiated based on higher similarity seen between the members of each group. | Present study | |
| Conserved Signature Indels (CSIs) in widely-distributed proteins | 4 CSIs exclusively found in different members of this group. 4 other CSIs are also specific for the RF clade but lacking in deeper branching RR group of species. | 7 CSIs exclusively found in the LD group differentiating it from the RF+RR group of species. | [ |
| Conserved Signature Indels (CSIs) in Borreliaceae-specific proteins | 29 CSIs provide clear differentiation between these two groups of species + 9 CSIs specific for the RF clade and show deeper branching of the RR group of species. | Present study | |
| Conserved Signature Proteins | 4 CSPs exclusively found in most members of this group. | 17 CSPs exclusively (or mainly) found in members of this group. | [ |
| Arthropod vectors | Argasid ticks, prostriate and metastriate ixodid ticks and human body louse | Primarily prostriate ticks of the genus | See [ |
| Density of Spirochetes in blood of infected humans/animals | High | Low | See [ |
| Average number of flagella at one end of cells | Mostly in the range of 15–20 | Generally in the range of 7–11 | See [ |
In addition to the characteristics noted in this Table, some other molecular and phenotypic differences between members of these two genera have been summarized by Barbour [5].
# Based on updated sequence information.
* only B. recurrentis is transmitted via a louse.