| Literature DB >> 30202970 |
Alyce Taylor-Brown1, Trestan Pillonel2, Gilbert Greub2, Lloyd Vaughan3,4, Barbara Nowak5, Adam Polkinghorne1.
Abstract
Chlamydiae are an example of obligate intracellular bacteria that possess highly reduced, compact genomes (1.0-3.5 Mbp), reflective of their abilities to sequester many essential nutrients from the host that they no longer need to synthesize themselves. The Chlamydiae is a phylum with a very wide host range spanning mammals, birds, fish, invertebrates, and unicellular protists. This ecological and phylogenetic diversity offers ongoing opportunities to study intracellular survival and metabolic pathways and adaptations. Of particular evolutionary significance are Chlamydiae from the recently proposed Ca. Parilichlamydiaceae, the earliest diverging clade in this phylum, species of which are found only in aquatic vertebrates. Gill extracts from three Chlamydiales-positive Australian aquaculture species (Yellowtail kingfish, Striped trumpeter, and Barramundi) were subject to DNA preparation to deplete host DNA and enrich microbial DNA, prior to metagenome sequencing. We assembled chlamydial genomes corresponding to three Ca. Parilichlamydiaceae species from gill metagenomes, and conducted functional genomics comparisons with diverse members of the phylum. This revealed highly reduced genomes more similar in size to the terrestrial Chlamydiaceae, standing in contrast to members of the Chlamydiae with a demonstrated cosmopolitan host range. We describe a reduction in genes encoding synthesis of nucleotides and amino acids, among other nutrients, and an enrichment of predicted transport proteins. Ca. Parilichlamydiaceae share 342 orthologs with other chlamydial families. We hypothesize that the genome reduction exhibited by Ca. Parilichlamydiaceae and Chlamydiaceae is an example of within-phylum convergent evolution. The factors driving these events remain to be elucidated.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30202970 PMCID: PMC6171736 DOI: 10.1093/gbe/evy195
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Details of Gill Samples Utilized for Metagenome Sequencing and Resulting Metagenome and Genome Assemblies
| Sample | 76ST10 | Hat2 | 007YTK11 | |
|---|---|---|---|---|
| Gill Sample Details | Host | Striped Trumpeter ( | Barramundi ( | Yellowtail Kingfish ( |
| Collection date | Nov 2010 | Aug 2012 | May 2011 | |
| Collection location | Tasmania | South Australia | South Australia | |
| Chlamydial species | ||||
| Initial metagenome Assembly metrics | No. raw reads | 16,776,972a | 40,539,772a | 29,405,932b |
| No. trimmed reads | 11,06,014c | 25,744,042d | 24,551,544 | |
| No. contigs (≥0 bp) | 58,390 | 153,109 | 369,007 | |
| No. contigs (≥1,000 bp) | 2,016 | 10,469 | 58,677 | |
| Largest contig (bp) | 604,417 | 265,296 | 687,426 | |
| G+C content (%) | 40.31 | 38.98 | 40.33 | |
| N50 | 2,409 | 1,599 | 4,589 | |
| L50 | 519 | 5,011 | 12,065 | |
| No. unique 16S rRNA sequences | 2 | 1 | 3 | |
| Chlamydial genome assembly metrics | No. contigs | 8 | 28 | 7 |
| Largest contig (bp) | 419,991 | 97,196 | 415,258 | |
| Genome completenesse | 94.4% | 94.4% | 94.4% | |
| G + C content (%) | 37.45 | 43.73 | 38.83 | |
| N50 | 150,576 | 37,594 | 415,258 | |
| Mean read coverage | ∼692× | ∼46× | ∼49× | |
| % trimmed reads mapped | 38% | 1.7% | 1% |
aObtained on an Illumina NextSeq (11 samples per lane/flow cell).
bObtained on an Illumina HiSeq (9 samples per lane/flow cell).
cReads from single lane used for assembly.
dReads from two lanes used for assembly.
ePresence of conserved bacterial proteins.
Genome Characteristics for Members of the Phylum Chlamydiae
| Family (No. Genomes Used for Comparison) | Chromosome Size (Mbp) | No. Predicted CDS | Plasmid Size (kbp) (No. ORFs) | G + C Content (%) | No. rRNA Operons | No. tRNAs | Coding Percentage (%) |
|---|---|---|---|---|---|---|---|
| 0.771–0.981 | 704–943 | No evidence | 37.5–43.7 | 1 | 41–42 | 90.2–92.4 | |
| 2.417–3.424 | 1,986–3,042 | 39–145 (40–160) | 33.0–44.0 | 1–4 | 35–42 | 69.7 | |
| 2.116–2.141 | 1,934, 2,028 | 15.5 | 43 | 2 | 34–37 | 92.2–93 | |
| 2.496 | 2,519 | 132 (136) | 38 | 1 | 35 | 91.3 | |
| 2.820–2.969 | 2,213, 2,426 | 9.1 (15), 89.5 (92) | 38.0–48.0 | 1 | 40 | 86.0–89.7 | |
| 1.041–1.229 | 911–1,112 | 7.5 | 37.4–41.6 | 1–2 | 37–39 | 89.0–91.2 |
Note.—Ranges of values are given for each family, with the number of species or strains used for comparison in brackets.
W. chondrophila plasmid in WSU 86-1044 strain only.
Chlamydia plasmid not found in C. abortus or Ca. C. ibidis. Plasmid also not present in all strains of C. pecorum, C. pneumoniae, or C. trachomatis.
Coding percentage of 69.7% and 71.3% in Neochlamydia sp. TUME1 and Neochlamydia sp. s13; remainder of genomes have a minimum coding percentage of 82.0%.
Presence of Sets of Universal Marker Genes in Chlamydiae and Other Small-genome Bacteria
| Species | Core Minimal Bacterial Gene Set ( | Core Minimal Bacterial Gene Set—Subset ( | 208 Orthologues in Amoebal & Related Pathogens ( | 100 COGs Conserved in 99% of Bacteria ( | 100 COGs Lost from Intracellular Bacteria ( | Single-copy Marker Genes Present in 95% of Bacteria (Dupont et al. 2011; |
|---|---|---|---|---|---|---|
| 185/200 | 27/36 | 141/208 | 99/100 | 6/100 | 101/107 | |
| 186/200 | 24/36 | 208/208 | 99/100 | 30/100 | 106/107 | |
| 183/200 | 23/36 | 208/208 | 100/100 | 12/100 | 106/107 | |
| 24/36 | 123/208 | 98/100 | 101/107 | |||
| 24/36 | 155/208 | 99/100 | 105/107 | |||
| 20/36 | 136/208 | 97/100 | 105/107 |
Presence includes putative non-orthologous gene displacements.
Presence refers to how many of the “lost” genes are retained.
. 1.—Genome dynamics of Ca. Parilichlamydiaceae within the Chlamydiae. (a) Relationship between chromosome size and number of coding regions in Chlamydiae. Species are colored based on chlamydial family. (b) Phylogenetic tree depicting the relationship between the Ca. Parilichlamydiaceae (yellow background) and other classified chlamydial species with genomic representatives. A free-living relative of the Chlamydiae, Akkermansia muciniphila was used as an outgroup species to root the tree. The tree was constructed by FastTree with a Jones–Taylor–Thorton model (Price et al. 2009) based on a MAFFT alignment with a BLOSUM62 scoring matrix (Katoh and Standley 2013) of 29 concatenated core proteins (pgk, argS, cafA, efp, sigA, sufB, sufS, secA, htrA, IleS, tuf, valS, pgsA, rpoA, cysS, priA, gmk, EngD, fusA, serS, rluD, pgi, gatB, rnhB, truB, rbfA, infB, nusA, and rpe). Both were executed in Geneious (Kearse et al. 2012). Numbers on the nodes indicate branch support values. (c) Orthologous groups shared and unique to representative species of six chlamydial families. Shared and unique orthologous groups are represented by the shaded triangles. Protein sequences were clustered using OrthoFinder. (d) Number of coding regions for each genome (bar chart), and proportion (pie chart) of each genome that is unique (dark shading) or shared with at least one other species (light shading).
. 2.—Genomic synteny within the Ca. Parilichlamydiaceae. Genome maps with tblastx analysis for (a) the three Ca. Parilichlamydiaceae species characterized in this study (contigs are denoted by unshaded pointed rectangles); (b) Ca. S latridicola and Chlamydia trachomatis and (c) Ca. S latridicola and Protochlamydia amoebophila (b). Coding regions are shaded vertical blocks along the black horizontal lines. Grey diagonal lines indicate position of orthologs between different species.
. 3.—Major metabolic pathways inferred from genome sequences of three species of the Ca. Parilichlamydiaceae. Schematic diagram of selected metabolic processes in the reduced genomes of Ca. Parilichlamydiaceae species. The RB is represented within the inclusion in the host cell cytoplasm. Solid and dashed lines represent predicted or absent paths in the Ca. Parilichlamydiaceae, respectively. Substrates and transporters are colored as per the legend.