| Literature DB >> 30760200 |
Agustín Estrada-Peña1, Alejandro Cabezas-Cruz2.
Abstract
BACKGROUND: The evolutionary history of a species is frequently derived from molecular sequences, and the resulting phylogenetic trees do not include explicit functional information. Here, we aimed to assess the functional relationships among bacteria in the Spirochaetes phylum, based on the biological processes of 42,489 proteins in reference proteomes of 34 Spirochaetes species. We tested the hypothesis that the species in the genus Borrelia might be sufficiently different to warrant splitting them into two separate genera.Entities:
Keywords: Borrelia; Borreliella; Functional analysis; Phyloproteomics; Unsupported split
Mesh:
Substances:
Year: 2019 PMID: 30760200 PMCID: PMC6375133 DOI: 10.1186/s12862-019-1379-2
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
The list of species and strains included in this study, with reference to the identification number of the proteome and the organism in the UniProt database, as well as the number of proteins annotated
| Organism | Group | Proteome ID | Organism ID | Protein count |
|---|---|---|---|---|
| Lb | UP000005216 | 390,236 | 1394 | |
|
| Rf | UP000019262 | 1,313,293 | 855 |
| Lb | UP00000276 | 290,434 | 1262 | |
| Lb | UP000183624 | 64,897 | 1794 | |
| Lb | UP000001634 | 521,010 | 1403 | |
| Lb | UP000001807 | 224,326 | 1290 | |
| Lb | UP000006901 | 445,985 | 1223 | |
| Lb | UP000006162 | 498,740 | 1518 | |
| Rf | UP000019330 | 1,313,292 | 895 | |
| Rf | UP000005212 | 1,155,096 | 1469 | |
| Rf | UP000000612 | 1,293,575 | 845 | |
| Rf | UP000000611 | 412,419 | 1287 | |
| Rf | UP000019148 | 1,432,657 | 1995 | |
| Lb | UP000006103 | 498,743 | 1346 | |
| Rf | UP000075229 | 140 | 1230 | |
| Rf | UP000019324 | 1,313,291 | 840 | |
| Lb | UP000185492 | 1,674,146 | 1131 | |
| Rf | UP000176410 | 47,466 | 1079 | |
| Rf | UP000019332 | 1,292,392 | 874 | |
| Rf | UP000019331 | 1,313,294 | 873 | |
| Rf | UP000000612 | 412,418 | 977 | |
| Rf | UP000019262 | 1,313,293 | 855 | |
| Lb | UP000030940 | 1,245,910 | 898 | |
| Lb | UP000006166 | 498,741 | 1300 | |
| Lb | UP000003481 | 498,742 | 1297 | |
| Lb | UP000006163 | 445,987 | 1349 | |
| LTB | UP000001803 | 565,034 | 2642 | |
| LTB | UP000007346 | 1,133,568 | 2637 | |
| LTB | UP000001847 | 456,481 | 3723 | |
| LTB | UP000011783 | 1,193,007 | 4773 | |
| LTB | UP000001408 | 57,678 | 3676 | |
| FL | UP000002318 | 573,413 | 4211 | |
| FL | UP000007939 | 760,011 | 1819 | |
| FL | UP000008466 | 158,189 | 3007 | |
| FL | UP000005632 | 158,190 | 3150 | |
| FL | UP000007383 | 889,378 | 2766 | |
|
| FL | UP000029692 | 1,480,694 | 2285 |
| FL | UP000001296 | 665,571 | 2199 | |
| LTB | UP000008212 | 243,275 | 2753 | |
| LTB | UP000014541 | 1,125,699 | 2287 | |
| LTB | UP000000811 | 243,276 | 1028 |
The organisms are alphabetically sorted and are included according to its standard denomination in UniProt. The column “group” indicates the four main groups of taxa considered in this study: Rf relapsing fever species of Borrelia, Lb Lyme borreliosis group of Borrelia spp., FL free-life species, LTB parasitic species without vector
Fig. 1Venn diagram displays shared biological processes among different bacteria species. The numbers indicate the numbers of cell processes that are unique (no overlap) or shared (intersecting areas) by the different groups of spirochaetes included in this study. Rf: Relapsing fever group; LTB: Leptospira spp., Treponema spp., and Brachyspira spp.; Lb: Lyme borreliosis group; FL: free-living species, including the genera Spirochaeta, Sphaerochaeta, and Sediminispirochaeta
Fig. 2Venn diagrams display shared proteins in the ten most important biological processes (BP) in the species of Spirochaetes targeted in this study. The numbers indicate the mean number of proteins per species involved in each BP. Heat maps included below the Venn diagram of each BP show the Sorensen’s dissimilarity index between the proteins involved in a given process for the different species of bacteria. Rf: Relapsing fever group; LTB: Leptospira spp., Treponema spp., and Brachyspira spp.; Lb: Lyme borreliosis group; FL: free-living species, including the genera Spirochaeta, Sphaerochaeta, and Sediminispirochaeta. The total number of proteins involved in each process is as follows: Carbohydrate metabolism: 263, DNA hydrolysis: 150, DNA repair: 99, Metabolism: 594, Phosphorylation: 358, Redox: 798, Regulation of transcription: 385, Translation: 174, Transmembrane transport: 359, Transport: 711
Fig. 3Detrended Canonical Analysis plot of all biological processes (BPs) found in 41 species of the phylum Spirochaetes displays the similarities between species or strains. Each small black point (unlabeled) represents a BP. The blue points are the species or strains of spirochaetes, placed along the two first axes of the DCA. At the scale shown, the species in the groups Rf and Lb (circled) cannot be separated into individual points; instead, they are all tightly clustered (grouped inside the black circle) and separated from the other taxa. Abbreviations: L: Leptospira, T: Treponema, Br: Brachyspira, Se: Sediminispirochaeta, Sp: Sphaerochaeta, S: Spirochaeta
Fig. 4A heat map displays Jaccard’s dissimilarity indexes, calculated from the presence/absence comparison of proteins of every species in seven genera of the phylum Spirochaetes. The index of dissimilarity values are color-coded (note the color scale). Cold colors indicate low dissimilarity and warm colors represent high dissimilarity. The resulting dendrogram, based on the inverse of the Jaccard’s dissimilarity index, is inserted at left of the heat map
Fig. 5Phyloproteomic network shows the shared proteins in the 41 species and strains of Spirochaetes. The main four groups of species are Rf: Recurrent fever group; Lb: Lyme borreliosis group; FL: Free life species of the genera Sediminispirochaeta, Spirochaeta and Sphaerochaeta; and LTB: Leptospira, Treponema, Brachyspira. The small, unlabeled circles represent proteins, and the four large, labeled circles represent the indicated species. The proteins are either clustered around a group of species (unique proteins) or they are linked (shared) with other groups. The links and circles located between groups are color-coded according to the cluster of origin (blue: FL; red: LTB; yellow: Rf and Lb). The size of the circle indicates the weighted degree of importance in the network, which is related to the number of times the protein is shared by the species of Spirochaetes; the smallest points indicate that the protein was only recorded in one group, the largest circles indicate that the protein is shared by all four groups of species