| Literature DB >> 29515524 |
Trestan Pillonel1, Claire Bertelli1, Gilbert Greub1.
Abstract
The Chlamydiae phylum exclusively encompasses bacteria sharing a similar obligate intracellular life cycle. Existing 16S rDNA data support a high diversity within the phylum, however genomic data remain scarce owing to the difficulty in isolating strains using culture systems with eukaryotic cells. Yet, Chlamydiae genome data extracted from large scale metagenomic studies might help fill this gap. This work compares 33 cultured and 27 environmental, uncultured chlamydial genomes, in order to clarify the phylogenetic relatedness of the new chlamydial clades and to investigate the genetic diversity of the Chlamydiae phylum. The analysis of published chlamydial genomes from metagenomics bins and single cell sequencing allowed the identification of seven new deeply branching chlamydial clades sharing genetic hallmarks of parasitic Chlamydiae. Comparative genomics suggests important biological differences between those clades, including loss of many proteins involved in cell division in the genus Similichlamydia, and loss of respiratory chain and tricarboxylic acid cycle in several species. Comparative analyses of chlamydial genomes with two proteobacterial orders, the Rhizobiales and the Rickettsiales showed that genomes of different Rhizobiales families are much more similar than genomes of different Rickettsiales families. On the other hand, the chlamydial 16S rRNAs exhibit a higher sequence conservation than their Rickettsiales counterparts, while chlamydial proteins exhibit increased sequence divergence. Studying the diversity and genome plasticity of the entire Chlamydiae phylum is of major interest to better understand the emergence and evolution of this ubiquitous and ancient clade of obligate intracellular bacteria.Entities:
Keywords: Chlamydiae; bacterial taxonomy; chlamydial metabolism; comparative genomics; endosymbiont evolution; obligate intracellular
Year: 2018 PMID: 29515524 PMCID: PMC5826181 DOI: 10.3389/fmicb.2018.00079
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
List of genomes included in the analysis, with GC content (%), size (bp) and completeness as evaluated using checkM.
| 2664102 | 2,246 | 55.76 | 1 | 100 | ||
| 1144377 | 935 | 39.87 | 1 | 98.28 | ||
| 1041170 | 842 | 36.92 | 1 | 94.83 | ||
| 1173390 | 968 | 39.22 | 1 | 98.28 | ||
| 1166239 | 966 | 39.38 | 1 | 98.28 | ||
| 1059583 | 905 | 37.94 | 1 | 96.55 | ||
| 1146066 | 939 | 38.32 | 4 | 96.55 | ||
| 1072950 | 900 | 40.34 | 1 | 98.28 | ||
| 1106197 | 934 | 41.08 | 1 | 100 | ||
| 1230230 | 1,029 | 40.58 | 1 | 98.28 | ||
| 1171660 | 983 | 39.06 | 1 | 98.28 | ||
| 1113233 | 933 | 38.54 | 1 | 98.28 | ||
| 2529957 | 2,075 | 32.42 | 98 | 96.55 | ||
| 1073507 | 886 | 42.03 | 13 | 98.28 | ||
| 1042519 | 887 | 41.31 | 1 | 98.28 | ||
| Chlamydiae bacterium Ga0074140 | 1724203 | 1,639 | 47.82 | 6 | 96.55 | |
| Chlamydiae bacterium GWA2_50_15 | 1177368 | 993 | 49.34 | 32 | 93.1 | |
| Chlamydiae bacterium GWC2_50_10 | 1172283 | 966 | 48.94 | 52 | 86.21 | |
| Chlamydiae bacterium GWF2_49_8 | 1019733 | 760 | 49.23 | 93 | 77.9 | |
| Chlamydiae bacterium RIFCSPHIGHO2_01_FULL_44_39 | 1569649 | 1,466 | 44.72 | 30 | 95.69 | |
| Chlamydiae bacterium RIFCSPHIGHO2_02_FULL_45_9 | 1342635 | 1,156 | 44.66 | 104 | 80.72 | |
| Chlamydiae bacterium RIFCSPHIGHO2_02_FULL_49_29 | 1387015 | 1,187 | 49.08 | 69 | 93.1 | |
| Chlamydiae bacterium RIFCSPHIGHO2_12_FULL_27_8 | 974360 | 817 | 27.43 | 98 | 72.1 | |
| Chlamydiae bacterium RIFCSPHIGHO2_12_FULL_44_59 | 1568548 | 1,470 | 44.72 | 30 | 95.69 | |
| Chlamydiae bacterium RIFCSPHIGHO2_12_FULL_49_11 | 1258484 | 1,065 | 48.45 | 71 | 81.27 | |
| Chlamydiae bacterium RIFCSPHIGHO2_12_FULL_49_32 | 1397302 | 1,190 | 48.91 | 72 | 89.66 | |
| Chlamydiae bacterium RIFCSPHIGHO2_12_FULL_49_9 | 1320387 | 1,166 | 48.59 | 94 | 71.24 | |
| Chlamydiae bacterium RIFCSPLOWO2_01_FULL_28_7 | 708526 | 572 | 28.07 | 75 | 58.21 | |
| Chlamydiae bacterium RIFCSPLOWO2_01_FULL_44_52 | 1537590 | 1,438 | 44.74 | 29 | 95.69 | |
| Chlamydiae bacterium RIFCSPLOWO2_02_FULL_45_22 | 1575869 | 1,475 | 44.7 | 30 | 95.69 | |
| Chlamydiae bacterium RIFCSPLOWO2_02_FULL_49_12 | 1413329 | 1,175 | 48.99 | 58 | 93.1 | |
| Chlamydiae bacterium RIFCSPLOWO2_12_FULL_45_20 | 1542182 | 1,443 | 44.75 | 28 | 95.69 | |
| Chlamydiae bacterium RIFCSPLOWO2_12_FULL_49_12 | 1418742 | 1,224 | 49.16 | 46 | 91.38 | |
| Chlamydiae bacterium SM23_39 | 1126604 | 986 | 26.23 | 67 | 96.55 | |
| Chlamydiales bacterium 38-26 | 2834110 | 2,327 | 38.12 | 8 | 98.28 | |
| Chlamydiales bacterium SCGC AB-751-O23 | 986924 | 715 | 35.45 | 89 | 46.24 | |
| Chlamydiales bacterium SCGC AG-110-M15 | 929815 | 711 | 41.8 | 54 | 40.52 | |
| Chlamydiales bacterium SCGC AG-110-P3 | 1299661 | 954 | 46.83 | 96 | 47.49 | |
| Criblamydia sequanensis CRIB-18 | 2969604 | 2,422 | 38.24 | 23 | 97.41 | |
| 2820195 | 2,202 | 48.23 | 29 | 97.41 | ||
| Neochlamydia sp. EPS4 | 2530677 | 1,882 | 38.09 | 112 | 97.41 | |
| Neochlamydia sp. S13 | 3187074 | 2,232 | 38.03 | 1,342 | 97.41 | |
| Neochlamydia sp. TUME1 | 2546323 | 1,879 | 38.02 | 254 | 97.41 | |
| 2999361 | 2,409 | 38.94 | 72 | 99.66 | ||
| 2971261 | 2,477 | 38.97 | 95 | 99.66 | ||
| 3008885 | 2,309 | 39.04 | 162 | 97.07 | ||
| 3072383 | 2,532 | 39.03 | 1 | 99.66 | ||
| 2307885 | 2,025 | 38.71 | 34 | 100 | ||
| 2397675 | 1,775 | 34.82 | 178 | 100 | ||
| 2414465 | 1,841 | 34.72 | 1 | 98.28 | ||
| 2864073 | 2,359 | 42.81 | 4 | 98.28 | ||
| 2885090 | 2,372 | 42.7 | 1 | 100 | ||
| 3423982 | 2,766 | 42.05 | 33 | 100 | ||
| Protochlamydia sp. R18 S13 | 2722699 | 2,006 | 34.78 | 795 | 100 | |
| Protochlamydia sp. W-9 | 2484573 | 1,815 | 34.48 | 402 | 100 | |
| 1830543 | 1,692 | 36.16 | 38 | 100 | ||
| 2701449 | 2,299 | 32.45 | 3 | 98.28 | ||
| 981540 | 913 | 39.52 | 169 | 70 | ||
| 2496337 | 2,229 | 41.78 | 1 | 100 | ||
| 2116312 | 1,832 | 43.78 | 1 | 98.28 |
Figure 1Diversity of the phylum Chlamydiae. (A) Phylogenetic tree of cultured and uncultured representative of the phylum Chlamydiae reconstructed based on the concatenated alignment of 108 single copy orthologs conserved in more than 90% of the genomes. Nodes with support values lower than 1 are indicated with black dots. The width of the black dots does not reflect a genetic distance (lines a shifted to accommodate the presence of the dots). *Genome sequenced using a shotgun metagenomics approach. +Single-cell amplified genomes (B) GC content, genome size and estimated genome completeness (C) quality measures: (i) number of missing core proteins (out of 108 proteins) (ii) number of duplicated core genes (iii) number of contigs in the assembly (iv) number of contigs without any CDS (v) number of contigs without any best PLAST hits against chlamydial sequences (RefSeq database version 81). (D) Described families of the Chlamydiae phylum.
Figure 2Circular representation of the uncultured genome RIFCSPHIGHO2_01_FULL_44_39 (accession: MGLR01000000). The outer circle indicates contig boundaries. The predicted open reading frames (ORFs) of the leading and lagging strands are reported in gray. Pink ORFs are proteins without a best hit against chlamydial sequences in the RefSeq database. Violet ORFs are proteins without any significant hit in the RefSeq database. The inner blue/red circles show the conservation of the closest identified orthologous protein (red scale) in the 37 other chlamydial species and one other genome of group 3 (Figure 1). Identity values were calculated based on the alignment of orthologous groups inferred using OrthoFinder. The absence of any ortholog is indicated in blue. Most contigs exhibit a majority of best PLAST hits against chlamydial sequences.
Figure 3Updated classification of the order Chlamydiales based on genomic data. Proposed new Candidatus genera, families and orders are indicated in bold. Only three genera (Chlamydia, Neochlamydia and Protochlamydia) have more than one species. Vertical blue lines indicate genomes of the same species. Numbers from 1 to 12 indicate new candidate species. Candidate species were not named. Eleven new Candidatus genera are proposed. Novochlamydia (from Latin novus, new), Pelagichlamydia (from the Latin world pelagus, sea), Abzuchlamydia (from Abzu, the underground waters of the Aquifer in the Sumerian mythology), Enkichlamydia (from the Sumerian god Enki, lord of the Abzu), Limichlamydia (from the Latin world limus, silt, alluvium), Alluviochlamydia (from the Latin world alluvio, alluvium), Arenachlamydia (from the Latin world Arena, sand), Hydrochlamydia (from the Greek prefix hydro-, relating to water), Kinetochlamydia (from the Greek prefix kineto-, motion; movement), Motilichlamydia (from Latin world motus, past participle of movere, to move), Thermochlamydia (from the Greek world thermos, warm, hot).
Figure 4Distribution of the identity of reciprocal best blast hit between pairs of genomes. (A) Protein sequences exhibit similar levels of divergence between different Parachlamydiaceae genera and Waddlia chondrophila (different family), with a median amino acid identity around 55%. The distribution is clearly shifted for Simkania negevensis, with a median identity inferior to 50%. (B) Distribution of protein identities between C. trachomatis and the closely related C. muridarum genome, C. pneumoniae, W. chondrophila, and two deep branching taxa (S. epinepheli and one groundwater metagenomic bin). The continuous decline in sequence identity with divergence time is clearly visible here. The most distant chlamydial lineages are extremely divergent and skewed, with a lot of lowly conserved proteins and some highly conserved ones (mode of 35% sequence identity). (C) Distribution of sequence identities of C. sequenanesis and representatives of four recognized families and the outgroup, A. muciniphila. (D) Identical analyses comparing the obligate intracellular bacteria Rickettsia prowazekii (alpha-proteobacteria) with Orientia tsutsugamushi (Rickettsieae), Anaplasma marginale (Anaplasmataceae), Ehrlichia chafeensis (Anaplasmataceae), Midichloria mitochondrii (“Candidatus Midichloriaceae”), and Agrobacterium fabrum (free-living member of the Rhizobiales order).
Figure 5Relationships between 16S rRNA and protein sequences conservation within three distinct bacterial orders: Rhizobiales, Chlamydiales, and Rickettsiales (indicated with colors). (A) Relationships between 16S rRNA identity and the median protein identity within the three bacterial orders and (B) for different families of the same order. (C) Relationships between 16S rRNA and the modal identity within the three bacterial orders and (D) between different families of the same order.
Figure 6Comparative genomics of the phylum Chlamydiae. Identified homologs of proteins involved in nucleotide uptake (ntt), part of molecular machines (i.e., T3SS, Flagellum) and part of selected metabolic pathway. For KEGG pathways/modules, number indicates non redundant count of KEGG orthologs part of the corresponding pathway/module. Reference locus tag, KEGG and Pfam accessions are indicated in Table S1.