| Literature DB >> 29386622 |
Stefano Campanaro1, Laura Treu2, Panagiotis G Kougias3, Xinyu Zhu3, Irini Angelidaki3.
Abstract
In the past few years, many studies investigated the anaerobic digestion microbiome by means of 16S rRNA amplicon sequencing. Results obtained from these studies were compared to each other without taking into consideration the followed procedure for amplicons preparation and data analysis. This negligence was mainly due to the lack of knowledge regarding the biases influencing specific steps of the microbiome investigation process. In the present study, the main technical aspects of the 16S rRNA analysis were checked giving special attention to the approach used for high throughput sequencing. More specifically, the microbial compositions of three laboratory scale biogas reactors were analyzed before and after addition of sodium oleate by sequencing the microbiome with three different approaches: 16S rRNA amplicon sequencing, shotgun DNA and shotgun RNA. This comparative analysis revealed that, in amplicon sequencing, abundance of some taxa (Euryarchaeota and Spirochaetes) was biased by the inefficiency of universal primers to hybridize all the templates. Reliability of the results obtained was also influenced by the number of hypervariable regions under investigation. Finally, amplicon sequencing and shotgun DNA underestimated the Methanoculleus genus, probably due to the low 16S rRNA gene copy number encoded in this taxon.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29386622 PMCID: PMC5792648 DOI: 10.1038/s41598-018-20414-0
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Outline of data analysis process. Gray boxes and black arrows represent the analysis workflow, gray dotted lines represent comparisons between different approaches used for data analysis. (C1) Influence of the database used to train the Bayesian classifier; (C2) influence of the number of reads on taxonomic results; (C3) influence of paired-end reads merging on the taxonomy; (C4) comparison between different 16S rRNA sequencing approaches (with independent taxonomic analysis of forward and reverse paired-ends); (C5) comparison between different 16S rRNA sequencing approaches (using merged paired-ends).
Figure 2Relative abundance of the ten major phyla identified in CSTR01 sample. Results were obtained from (A) ~712,000 16S rRNA amplicons, (B) ~34,000 shotgun DNA and (C) ~976,000 shotgun RNA sequences aligned to the 16S rRNA gene. Numbers refer to the reads obtained after paired-end merging. Results were obtained after training the naive Bayesian classifier on different databases (RDP release 11, Greengenes 13 08 and SILVA release 128).
Figure 3Number of taxa identified with an increasing number of reads. Reads from amplicons (merged paired-end) were random resampled starting from a minimum number of 1000 sequences up to 700,000 sequences. After taxonomic analysis the number of phyla, orders, families, classes and genera was calculated and reported in y axes.
Figure 4Comparison between abundance of different taxa determined using three sequencing approaches. Results are reported as average of the results determined in the six samples examined (CSTR01a-03a; CSTR01b-03b). The log2 ratios of the abundances calculated comparing two different approaches are reported in y axes. Grey bars represent comparison between amplicon sequencing and shotgun RNA, red bars represent the comparison between amplicon sequencing and shotgun DNA. Taxa having higher abundance in amplicons in comparison to shotgun RNA sequencing are reported as gray bars with positive values, those having higher values in amplicons in comparison to shotgun DNA are reported as red bars with positive values. (A) Comparison at phylum level between amplicons, shotgun RNA and shotgun DNA (for and rev sequences analyzed separately); (B) comparison at phylum level between amplicons and shotgun RNA; (C) the same comparison reported in (A) at genus level; (D) the same as reported in (B) at genus level. In (B) and (D) analysis was performed on 700,000 sequences obtained after merging for and rev paired-ends.
Figure 5Abundance of different taxa calculated considering shotgun rRNA sequences assigned to different hypervariable regions. (A) Fraction of sequences assigned to different taxonomic levels and normalized considering the total number of sequences assigned to each hypervariable region. Number of reads assigned to different phyla (B) and genera (C) calculated considering reads assigned to different hypervariable regions. Notice the logarithmic scale on y axes (number of sequences) in (B) and (C). In (A) all the six samples are reported and variability is represented as standard deviation on each bar, in (C) results are reported for sample CSTR01a.
Important remarks for analyzing the microbiome of anaerobic reactors for biogas production.
| Method | 16S rRNA amplicon seq. | Shotgun DNA | Shotgun RNA |
|---|---|---|---|
| Number of reads needed for accurate taxonomic analysis. | Low (>10,000). All the sequences target the 16S rRNA gene and this allows reliable investigation of the main taxa with few reads. | Very high (>1,000,000). Number of reads assigned to the 16S rRNA gene is low. | Intermediate (>100,000). Loss of reads determined by the presence of transcripts other than 16S rRNA gene is quite limited. |
| Possible suggestions. | Increase the number of clade-specific marker genes other than 16S rRNA using dedicated software (e.g. MetaPhlAn) | ||
| Hypervariable regions. | Analysis targets one or two selected regions. This can reduce accuracy in calculating abundance of specific taxa (e.g. | Analysis targets all the hypervariable regions. This can increase both the efficiency of taxonomic analysis and the evaluation of abundance for most taxonomic groups. | Same as shotgun DNA. |
| Possible suggestions. | Increase the number of hypervariable regions under investigation with longer reads (e.g. using PacBio SMRT technology) or analyzing more than one amplicon. V1-V2 regions seem particularly promising to improve taxonomic results. | ||
| Universal primers. | Universal primers introduce biases (e.g. | No amplification step needed, this reduces biases in taxonomic investigation. | Same as shotgun DNA. |
| Possible suggestions. | Perform accurate check for potential biases in 16S rRNA gene amplification. Use more than one couple of universal primers. | ||
| Transcriptional activity. | This approach targets genomic DNA, transcriptional activity cannot be monitored and expression level of the 16S rRNA gene does not influence analysis. | Same as 16S rRNA amplicon seq. | This approach targets RNA molecules and provides insights in activity of specific taxa. Analysis can be inaccurate in determining abundance of taxa characterized by high or low activity. |
| Possible suggestions. | Combine different sequencing approaches to gain insights both on microbial abundance and on their activity. | Same as 16S rRNA amplicon seq. | Same as 16S rRNA amplicon seq. |
Figure 6Abundance ratio (log2) determined for the 100 most abundant genera before and after Na-oleate addition. x and y axes report the log2 ratios obtained by dividing abundance level of genera “after” and “before Na-oleate” addition. “Blue dots” represent the comparison between log2 ratio determined for amplicon sequencing (x-axes) and log2 ratio determined for shotgun RNA sequencing (y-axes) (60,000 subsampled sequences). “Red dots” represent the comparison between the log2 ratio determined for amplicon sequencing (x-axes) and log2 ratio determined for shotgun DNA sequencing (y-axes).