| Literature DB >> 25424998 |
Emilie E L Muller1, Nicolás Pinel2, Cédric C Laczny1, Michael R Hoopmann3, Shaman Narayanasamy1, Laura A Lebrun1, Hugo Roume1, Jake Lin1, Patrick May1, Nathan D Hicks4, Anna Heintz-Buschart1, Linda Wampach1, Cindy M Liu4, Lance B Price4, John D Gillece4, Cédric Guignard5, James M Schupp4, Nikos Vlassis1, Nitin S Baliga3, Robert L Moritz3, Paul S Keim4, Paul Wilmes1.
Abstract
Microbial communities are complex and dynamic systems that are primarily structured according to their members' ecological niches. To investigate how niche breadth (generalist versus specialist lifestyle strategies) relates to ecological success, we develop and apply an integrative workflow for the multi-omic analysis of oleaginous mixed microbial communities from a biological wastewater treatment plant. Time- and space-resolved coupled metabolomic and taxonomic analyses demonstrate that the community-wide lipid accumulation phenotype is associated with the dominance of the generalist bacterium Candidatus Microthrix spp. By integrating population-level genomic reconstructions (reflecting fundamental niches) with transcriptomic and proteomic data (realised niches), we identify finely tuned gene expression governing resource usage by Candidatus Microthrix parvicella over time. Moreover, our results indicate that the fluctuating environmental conditions constrain the accumulation of genetic variation in Candidatus Microthrix parvicella likely due to fitness trade-offs. Based on our observations, niche breadth has to be considered as an important factor for understanding the evolutionary processes governing (microbial) population sizes and structures in situ.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25424998 PMCID: PMC4263124 DOI: 10.1038/ncomms6603
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Figure 1Microbial community dynamics and lipid accumulation from wastewater.
(a) Fractions of taxa identified across the communities sampled on four distinct dates (SD1–SD4). Roman numerals refer to the four biological replicates sampled per time point. The blue star indicates the representative sample from SD3 subjected to the integrated omic analysis. (b) Average genus-level abundances of the two dominant populations. The most abundant microbial population in winter was identified as Candidatus Microthrix spp., whereas a population tentatively identified (confidence level <0.8) as Perlucidibaca spp. was dominant in autumn. (c) Long-chain fatty acid intracellular accumulation per sampling date expressed as ratios between quantified intracellular and extracellular long-chain fatty acid abundances. (d) Genus-level alpha diversity and evenness. (b–d), error bars represent s.d. (n=4).
Figure 2Identification of genomic fragments derived from distinct populations.
(a) Binning of assembled contigs (≥1,000 bp in length) on the basis of pentanucleotide signatures and visualization using the BH-SNE algorithm followed by human-augmented clustering of composite genome (CG) groups. (b) Violin plots of the G+C percentage for contigs within each of the CG groups. (c) Percentage amino-acid identity of the two subpopulations in CG8 (CG8a and CG8b) compared with the two sequenced Candidatus Microthrix parvicella (Bio17-1 (ref. 16) and RN1 (ref. 17)) genomes. The values are median±s.d. and n is the number of putative orthologues identified as best BLAST hits. Boxplots represent the lower quartile, median and upper quartile. Whiskers are placed at × 1.5 interquartile range beyond the lower and upper quartiles.
Figure 3Population-resolved integrated omics.
(a) Circos plots63 of genome-wide gene expression levels for the 10 reconstructed composite genomes (CGs). Tracks (from the innermost concentric track to the outermost): log10 of metagenomic fragments per 1 kb of sequence per 106 mapped reads (FPKM46; dark grey), log10 of the numbers of detected SNPs per gene (black), log10 of metatranscriptomic FPKM (red), log2 of the protein expression levels as Normalized Spectral Indices (NSI; blue), and reconstructed contigs ordered by size. The track scales are identical across the plots. The sizes of the individual Circos plots are weighted according to the log10 of inferred population size (Methods). (b) The number of genes for each COG category encoded by the different CGs and their corresponding relative transcript levels (grey bars). COG categories are: A, RNA processing and modification; B, chromatin structure and dynamics; C, energy production and conversion; D, cell cycle control, cell division, chromosome partitioning; E, amino acid transport and metabolism; F, nucleotide transport and metabolism; G, carbohydrate transport and metabolism; H, coenzyme transport and metabolism; I, lipid transport and metabolism; J, translation, ribosomal structure and biogenesis; K, transcription; L, replication, recombination and repair; M, cell wall/membrane/envelope biogenesis; N, cell motility; O, posttranslational modification, protein turnover, chaperones; P, inorganic ion transport and metabolism; Q, secondary metabolites biosynthesis, transport and catabolism; R, general function prediction only; S, function unknown; T, signal transduction mechanism; U, intracellular trafficking, secretion and vesicular transport; V, defence mechanisms; Z, cytoskeleton; Multi−I, multiple COG category excluding I; Multi+I, multiple COG category including I; No, no COG category assigned.
Figure 4In situ fine-tuning of gene expression by Candidatus Microthrix parvicella.
(a) Dendrogram based on the amino acid similarities of 29 predicted long-chain fatty acid-CoA ligases encoded by the Candidatus Microthrix parvicella population (CG8b). Metagenomic (grey) and metatranscriptomic (red) data represented as fragments per 1 kb of sequence per 106 mapped reads (FPKM), the protein abundance data (blue) is represented as the log10 of the Normalized Spectral Indices (NSI). (b) Qualitative gene expression patterns of the five most prevalent homologous gene sets belonging to the COG category ‘I—lipid transport and metabolism’ encoded by Candidatus Microthrix parvicella (CG8b) and the CG5 population at the metatranscriptomic (red) and metaproteomic (blue) levels across four sampling time points.
The characteristics of composite genomes CG5 and CG8b at different sampling time points.
| Average composite genome coverage ( × ) | 9.65 | 23.36 | 30.02 | 45.74 | 20.54 | 65.57 | 35.06 | 81.81 |
| Proportion of total metagenomic reads mapped per composite genome (%) | 8.10 | 36.50 | 16.81 | 37.48 | 11.27 | 51.38 | 13.71 | 47.85 |
| Percentage of ORFs expressed at the RNA level | 92.7 | 45.8 | 78.9 | 25.1 | 85.3 | 32.0 | 87.0 | 36.8 |
| Number of detected variants (based on the metagenomic data) | 5,428 | 11,702 | 42,250 | 11,588 | 29,431 | 12,596 | 37,699 | 11,517 |
| Number of detected variants (based on the metatranscriptomic data) | 11,481 | 777 | 24,353 | 1,366 | 26,227 | 2,923 | 28,247 | 3,504 |
| Variant density per CG population | 2.34E−04 | 7.43E−05 | 8.78E−04 | 7.17E−05 | 9.13E−04 | 5.68E−05 | 9.61E−04 | 5.58E−05 |