| Literature DB >> 23405248 |
Christopher S Miller1, Kim M Handley, Kelly C Wrighton, Kyle R Frischkorn, Brian C Thomas, Jillian F Banfield.
Abstract
In microbial ecology, a fundamental question relates to how community diversity and composition change in response to perturbation. Most studies have had limited ability to deeply sample community structure (e.g. Sanger-sequenced 16S rRNA libraries), or have had limited taxonomic resolution (e.g. studies based on 16S rRNA hypervariable region sequencing). Here, we combine the higher taxonomic resolution of near-full-length 16S rRNA gene amplicons with the economics and sensitivity of short-read sequencing to assay the abundance and identity of organisms that represent as little as 0.01% of sediment bacterial communities. We used a new version of EMIRGE optimized for large data size to reconstruct near-full-length 16S rRNA genes from amplicons sheared and sequenced with Illumina technology. The approach allowed us to differentiate the community composition among samples acquired before perturbation, after acetate amendment shifted the predominant metabolism to iron reduction, and once sulfate reduction began. Results were highly reproducible across technical replicates, and identified specific taxa that responded to the perturbation. All samples contain very high alpha diversity and abundant organisms from phyla without cultivated representatives. Surprisingly, at the time points measured, there was no strong loss of evenness, despite the selective pressure of acetate amendment and change in the terminal electron accepting process. However, community membership was altered significantly. The method allows for sensitive, accurate profiling of the "long tail" of low abundance organisms that exist in many microbial communities, and can resolve population dynamics in response to environmental change.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23405248 PMCID: PMC3566076 DOI: 10.1371/journal.pone.0056018
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Sequencing bias for amplicon ends.
Shown are data determined by mapping reads for a representative library (index 2 subsample 3) against EMIRGE-reconstructed16S rRNA sequences. A Proportion of mapped library fragments (y-axis) that begin a given number of bp away from the nearest reconstructed amplicon end (x-axis), averaged across all reconstructed 16S rRNA amplicons. There is a strong preference for fragments to begin at position 0 or 1. B Total library base coverage plotted in terms of relative position within an amplicon. Average reconstructed amplicon length was 1464 bp.
Figure 2Principal coordinates analysis clusters the 48 subsample communities by biological sample.
EMIRGE-reconstructed rRNA genes were used to construct a phylogenetic tree. From this tree, pairwise distances were calculated between each of the 48 subsample communities using either abundance-weighted (A) or unweighted (B) Unifrac, and principal coordinates analysis was used to reduce the dimensionality of the resulting distance matrices for visualization. Percentage variation explained by each principal coordinate is shown for each axis. Subsample communities clearly separate by biological sample. Weighted Unifrac accounts for a larger fraction of the variance in the first two principle coordinates than unweighted unifrac, indicating that changes in abundances are particularly informative.
Figure 3Phylum-level abundances of the 48 EMIRGE-reconstructed communities.
Taxonomic assignments were made with the RDP classifier for each OTU with a confidence cutoff of 0.8, and abundances were summed to the phylum level and are shown as a log-scaled heatmap. The barcoding index for each sample is listed along the bottom. Hierarchical clustering of the abundance vectors separates each community by biological sample.
Figure 4Alpha diversity of communities inferred by full-length rRNA and hypervariable-regions.
Alpha diversity metrics are shown for EMIRGE-reconstructed full-length OTUs (A, C) and OTUs based on in silico-extracted V3 regions from the EMIRGE-reconstructed sequences (B, D). A and B show the total number of OTUs identified with increasing sequencing effort. C and D show the total tree phylogenetic distance (PD) observed with increasing sequencing effort. Plots show the mean and standard deviation of 10 samples per simulated sequencing effort. Blue: background; red: iron reduction; orange: sulfate reduction.
Figure 5Community structure at varying levels of taxonomic resolution.
Reconstructed full-length OTUs were assigned taxonomy by the RDP classifier, and relative abundances at 4 taxonomic levels are shown for each of the 48 subsample datasets. Indices from left to right in each panel are as in Figure 2. Select taxa are identified: 1. Proteobacteria, 2. Firmicutes, 3. Bacteroidetes, 4. Unassigned, 5. TM7, 6. Tenericutes, 7. Gammaproteobacteria, 8. Epsilonproteobacteria, 9. Deltaproteobacteria, 10. Betaproteobacteria, 11. Alphaproteobacteria, 12. Clostridia, 13. Bacilli, 14. Bacteroidia, 15. Unclassified Firmicute, 16. Pseudomonadales, 17. Burkholderiales, 18. Clostridiales, 19. Bacillales, 20. Bacteroidales, 21. Desulfuromonadales, 22. Rhodocyclales, 23. Methylophilales, 24. Desulfobacterales, 25. Unclassified Clostridia, 26. Pseudomonadaceae, 27. Rhodocyclaceae, 28. Methylophilaceae, 29. Comamonadaceae, 30. Unclassified Betaproteobacteria, 31. Bacillaceae, 32. Unclassified Bacteroidales, 33. Geobacteraceae, 34. Peptococcaceae, 35. Unclassified Clostridiales.