| Literature DB >> 30801027 |
Johanna B Holm1, Michael S Humphrys1, Courtney K Robinson1, Matthew L Settles2, Sandra Ott1, Li Fu1, Hongqiu Yang1, Pawel Gajer1, Xin He3, Elias McComb1, Patti E Gravitt4, Khalil G Ghanem5, Rebecca M Brotman1, Jacques Ravel1.
Abstract
Amplification, sequencing, and analysis of the 16S rRNA gene affords characterization of microbial community composition. As this tool has become more popular and amplicon-sequencing applications have grown in the total number of samples, growth in sample multiplexing is becoming necessary while maintaining high sequence quality and sequencing depth. Here, modifications to the Illumina HiSeq 2500 platform are described which produce greater multiplexing capabilities and 300-bp paired-end reads of higher quality than those produced by the current Illumina MiSeq platform. To improve the feasibility and flexibility of this method, a 2-step PCR amplification protocol is also described that allows for targeting of different amplicon regions, and enhances amplification success from samples with low bacterial bioburden. IMPORTANCE Amplicon sequencing has become a popular and widespread tool for surveying microbial communities. Lower overall costs associated with high-throughput sequencing have made it a widely adopted approach, especially for projects that necessitate sample multiplexing to eliminate batch effect and reduced time to acquire data. The method for amplicon sequencing on the Illumina HiSeq 2500 platform described here provides improved multiplexing capabilities while simultaneously producing greater quality sequence data and lower per-sample cost relative to those of the Illumina MiSeq platform without sacrificing amplicon length. To make this method more flexible for various amplicon-targeted regions as well as improve amplification from low-biomass samples, we also present and validate a 2-step PCR library preparation method.Entities:
Keywords: 16S RNA; PCR; amplicon sequencing; ultrahigh throughput
Year: 2019 PMID: 30801027 PMCID: PMC6381223 DOI: 10.1128/mSystems.00029-19
Source DB: PubMed Journal: mSystems ISSN: 2379-5077 Impact factor: 6.496
FIG 1Illumina amplicon library preparation through 2-step PCR amplification. In the step 1 PCR, the target gene is amplified using primers that contain the heterogeneity space and the CS1 Illumina sequencing primers. The 2nd PCR targets the CS1 Illumina sequencing primer to add the indices and the Illumina flow-cell linker sequence. Sequencing proceeds wherein reads 1 and 4 contain the forward and reverse target gene sequence, respectively, and reads 2 and 3 contain the first and second barcode indices, respectively.
Summary of sequencing results for vaginal samples
| Library preparation method parameter | Result by PCR type | ||
|---|---|---|---|
| 1-step | 2-step | 2-step | |
| No. of samples subjected to amplification | 92 | 92 | 92 |
| No. of samples successfully amplified | 49 | 83 | 83 |
| Sequencing platform | MiSeq | MiSeq | HiSeq |
| Chimeric sequences detected (%) | 0.70 | 3.3 | 3.1 |
| Mean no. (±SE) of nonchimeric, assembled sequences per sample | 11,080 ± 1,506 | 14,282 ± 483 | 50,514 ± 4,427 |
| Median quality score per sample (Q1–Q3) | 36.2 | 34.9 | 37.1 |
Figure S4 summarizes the prequality filtering per-cycle quality scores.
Significant. Kruskal-Wallis, H = 187.85; P < 2.2 × 10−16.
FIG 2Heatmap of taxon relative abundances (rows) of samples (columns). Subject samples are separated by white lines, and samples are ordered by vaginal community state types and with the following color code: 1-step MiSeq, pink; 2-step HiSeq, blue; 2-step MiSeq, aqua. (B) tSNE representation of Jensen-Shannon distances between samples from the same subject. Samples primarily cluster by vaginal CST.
FIG 3Forward and reverse read quality profiles for 300 cycles on the Illumina HiSeq (1,536 samples) and MiSeq (444 samples) platforms. Amplicon libraries were prepared using a 2-step PCR method. Shown for each cycle are the mean quality score (green line), the median quality score (solid purple line), and the quartiles of the quality score distribution (dotted purple lines).
Sequencing run information for the MiSeq and HiSeq platforms
| Sequencing platform parameter | MiSeq | HiSeq2500 RR |
|---|---|---|
| Run details | 2× 300 bp PE | 2× 250 bp PE + 2× 50 bp PE |
| Mean no. (±SE) of assembled sequences per sample | 13,116 ± 479 | 49,851± 895 |
| No. of samples | 276 | 1,194 |
| Chimeric sequences detected (%) | 2.8 | 7.7 |
| Mean no. (±SE) of nonchimeric, assembled sequences per sample | 12,737 ± 463 | 45,988 ± 787 |
| Median quality score (Q25–Q75), forward reads | 35.7 | 36.1 |
| Median quality score (Q25–Q75), reverse reads | 33.9 | 33.2 |
| Cost of sequencing per sample (no. of multiplexed samples) | $6.38 (384) | $3.99 (1,568) |
Significant. Wilcoxon rank sum, W = 70352; P < 2.2 × 10−16.
Significant. Wilcoxon rank sum, W = 76453; P < 2.2 × 10−16.
1-Step PCR primers
| Primer | Sequence (5′→3′) |
|---|---|
| Forward | AATGATACGGCGACCACCGAGATCTACAC + GTGACTGGAGTTCAGACGTGTGCTCTTCCGATCT + |
| Reverse | CAAGCAGAAGACGGCATACGAGAT + ACACTCTTTCCCTACACGACGCTCTTCCGATCT + |
Primer sequences are presented as Illumina MiSeq 3′ flow cell linker + Illumina 5′ sequencing primer (CS1/CS2) + index + heterogeneity spacer + 16S rRNA gene V3-V4 primer.
2-Step protocol PCR primers
| Primer | Sequence (5′→3′) |
|---|---|
| Step 1 | |
| Forward | ACACTGACGACATGGTTCTACA + heterogeneity spacer (0–7 bp) + ACTCCTRCGGGAGGCAGCAG |
| Reverse | TACGGTAGCAGAGACTTGGTCT + heterogeneity spacer (0–7 bp) + GGACTACHVGGGTWTCTAAT |
| Step 2 | Illumina 3′ flowcell linker + index + CS1/CS2 |
| Forward | AATGATACGGCGACCACCGAGATCTACAC + index (8 bp) + ACACTGACGACATGGTTCTACA |
| Reverse | CAAGCAGAAGACGGCATACGAGAT + index (8 bp) + TACGGTAGCAGAGACTTGGTCT |
See Data Set S1 for full oligonucleotide sequences. For step 1, primer sequences are presented as Illumina 5′ sequencing primer (CS1/CS2) + heterogeneity spacer + 16S rRNA gene V3-V4 primer.
See Data Set S1 for full forward and reverse oligonucleotides, respectively. For step 2, primer sequences are presented as Illumina 3′ flow cell linker + index + CS1/CS2.