| Literature DB >> 28903732 |
Imane Allali1,2, Jason W Arnold1, Jeffrey Roach3, Maria Belen Cadenas1, Natasha Butz1, Hosni M Hassan4, Matthew Koci4, Anne Ballou4, Mary Mendoza4, Rizwana Ali4, M Andrea Azcarate-Peril5.
Abstract
BACKGROUND: Advancements in Next Generation Sequencing (NGS) technologies regarding throughput, read length and accuracy had a major impact on microbiome research by significantly improving 16S rRNA amplicon sequencing. As rapid improvements in sequencing platforms and new data analysis pipelines are introduced, it is essential to evaluate their capabilities in specific applications. The aim of this study was to assess whether the same project-specific biological conclusions regarding microbiome composition could be reached using different sequencing platforms and bioinformatics pipelines.Entities:
Keywords: 16S rRNA amplicon sequencing - microbiome analysis - microbiome - microbiome composition - next generation sequencing platforms; Bioinformatics pipeline; NGS bias
Mesh:
Substances:
Year: 2017 PMID: 28903732 PMCID: PMC5598039 DOI: 10.1186/s12866-017-1101-8
Source DB: PubMed Journal: BMC Microbiol ISSN: 1471-2180 Impact factor: 3.605
Fig. 1Schematic of the experimental design of this study to test impact of library preparation methods and protocols on diversity and relative abundance of bacteria. Protocol steps are indicated on the left. Standard methods are in black boxes while non-standard methods with modified conditions are shown in grey boxes
Primer sequence information for all platforms
| Platform | Primer Name | Sequence (5′ -3′) | Targeting Region |
|---|---|---|---|
| Roche 454 | Roche Titanium Fusion Primer A | CCATCTCATCCCTGCGTGTCTCCGACTCAG | V1-V2 |
| Universal Bacterial Primer 8F | AGAGTTTGATCCTGGCTCAG | V1-V2 | |
| Roche Titanium Primer B | CCTATCCCCTGTGTGCCTTGGCAGTCTCAG | V1-V2 | |
| Reverse Bacterial Primer 338R | GCTGCCTCCCGTAGGAGT | V1-V2 | |
| Illumina MiSeq | Forward Primer | TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGAGAGTTTGATCCTGGCTCAG | V1-V2 |
| Reverse Primer | GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGGCTGCCTCCCGTAGGAGT | V1-V2 | |
| Ion Torrent PGM | Forward Primer composed of Ion Torrent adapter A | CCATCTCATCCCTGCGTGTCTCCGACTCAG | V1-V2 |
| Universal Bacterial Primer 8F | AGAGTTTGATCCTGGCTCAG | V1-V2 | |
| Reverse Primer of Ion Torrent trP1 adapter | CCTCTCTATGGGCAGTCGGTGAT | V1-V2 | |
| Reverse Bacterial Primer 338R | GCTGCCTCCCGTAGGAGT | V1-V2 |
Fig. 2Evaluated bioinformatics pipelines using QIIME [36] and UPARSE [37] using two different OTU picking methods (QIIME only) either with or without chimera removal steps
Characteristics of samples used in this study
| Sample | Intestinal Location | Treatment | Chicken |
| Time point (week) |
|---|---|---|---|---|---|
| M848A | Cecum | Control | 7 | Yes | 7 |
| M480A | Cecum | Vaccinated | 1 | No | 6 |
| M736A | Cecum | Vaccinated | 2 | No | 9 |
| M572A | Cecum | Vaccinated | 3 | No | 7 |
| M576A | Cecum | Vaccinated | 4 | No | 7 |
| M988A | Cecum | Vaccinated | 2 | Yes | 8 |
| M908A | Cecum | Vaccinated | 3 | Yes | 7 |
| M368A | Cecum | Prebiotics | 1 | No | 5 |
| M452A | Cecum | Prebiotics | 1 | No | 6 |
| M620A | Cecum | Prebiotics | 1 | No | 8 |
| M704A | Cecum | Prebiotics | 1 | No | 9 |
| M540A | Cecum | Prebiotics | 2 | No | 7 |
| M464A | Cecum | Prebiotics | 4 | No | 6 |
| M884A | Cecum | Prebiotics | 4 | Yes | 7 |
GS FLX (Roche), PGM (Ion Torrent, Life Technologies) and MiSeq (Illumina) platform comparison. Data was obtained from the corresponding platform’s website
| Roche 454 | Ion Torrent | Illumina MiSeq | |
|---|---|---|---|
| Sequencing Kit | GS FLX Titanium XLR70 | PGM 400 Sequencing | MiSeq Reagent Kits v2 |
| Expected Read Length | Up to 600 bp | Up to 400 bp | MiSeq Reagent Kit v2: Up to 2 × 250 bp |
| Typical Throughput | 450 Mb | Ion 314™ Chip v2: Up to 100 Mb | Up to 8.5 Gb |
| Reads per Run | ~1000,000 shotgun, | Ion 314™ Chip v2: 400–550 thousand | ~15 million reads |
| Consensus Accuracy | 99.995% | 99% | 99% |
| Run Time | 10 h | Ion 314™ Chip v2: 2.3 | 4 h and |
| Sample Input | gDNA, cDNA, or amplicons (PCR products) | gDNA, cDNA, or amplicons (PCR products) | gDNA, cDNA, or amplicons (PCR products) |
| Weight | 532 lbs. (242 kg) | 65 lbs. (30 kg) | 120 lbs. (54.5 kg) |
| Instrument cost | ~$500 K | ~ $80 k | ~ $125 k |
Comparative summary of sequencing depth, reads length, and quality between GS FLEX, PGM, and MiSeq platforms
| Platform | Raw reads | Filtered readsa | Mean Length after filtering (bp) | Percentage of reads kept after filtering | Mean quality score before quality filtering | Number of identified OTUs |
|---|---|---|---|---|---|---|
| Roche 454 | 118,018 | 113,306 | 377 | 96% | 37.7 | 1028 |
| Ion Torrent PGM | 481,593 | 71,652 | 297 | 14.9% | 23.4 | 2747 |
| Illumina Miseq | 4,149,441 | 3,811,042 | 334 | 91.8% | 37.5 | 3731 |
aNumber of reads after filtering, reads of less than 200 nucleotides and with quality scores below 25 were removed
Fig. 3A comparison of phylogenetic diversity (PD) and species richness (S) between the 6 runs (GS FLX, MiSeq1, MiSeq2, PGM1, PGM 2 and PGM3) and in each pipeline a Phylogenetic diversity b Species Richness. Panels on the right show a matrix comparison between pipelines. Numbers within cells indicate P-values >0.05 < 0.1. *P < 0.01,**P < 0.001
Fig. 4a Principal Coordinates Analysis PCoA (Unweighted UniFrac) plots of data generated by the three different platforms, analyzed by different bioinformatics pipelines and colored according to treatment group (Prebiotics, control and Salmonella-vaccinated). PERMANOVA F and P values and ANOSIM R and P values are indicated. b Procrustes analysis of sequencing data from the different platforms analyzed with the QIIME2 (de novo OTU picking plus chimera depletion). M and P values are indicated in the figure
Fig. 5Selected differences in relative abundances of the most impacted taxa according to data generated by different platforms (indicated by different colors) and bioniformatic analysis pipelines (indicated across the top). The full figure can be seen in Additional file 2: Figure S2
Fig. 6Unique species identified by the different bioinformatic analysis schemes. Boxes indicate taxa not detected by open reference OTU picking (QIIME) and UPARSE methods, which may be of significance for the study
Fig. 7Comparisons made between the two different OTU/variant calling (either DADA2 or QIIME de novo OTU picking at 99% similarity) and the two different taxa assigment algorithms (DADA2 or QIIME using the Greengenes database). Labels are: QIIME.QIIME indicating QIIME was used for OTU picking and taxonomic assigment, QIIME.DADA2 indicating QIIME was used for OTU picking and DADA2 was used for taxonomic assignment, DADA2.QIIME indicating the DADA2 was used for sequencing error supression and QIIME was used for taxonomic assignment, and DADA2.DADA2 indicating that used for both sequencing error suppresion and taxonomic assignment. a Procrustes analysis. b A comparison of the number of OTUs identified by DADA2 (clear boxes) and QIIME de novo OTU picking at 99% similarity (shaded boxes).c Taxonomic profiles of samples grouped by treatment and bioinformatics pipeline. Only major taxa are indicated in the Figure. d Correlation analysis of relative abundances of bacterial taxa at species level. For a complete list see Additional file 3: Table S3