| Literature DB >> 24558975 |
Douglas W Fadrosh, Bing Ma, Pawel Gajer, Naomi Sengamalay, Sandra Ott, Rebecca M Brotman, Jacques Ravel1.
Abstract
BACKGROUND: To take advantage of affordable high-throughput next-generation sequencing technologies to characterize microbial community composition often requires the development of improved methods to overcome technical limitations inherent to the sequencing platforms. Sequencing low sequence diversity libraries such as 16S rRNA amplicons has been problematic on the Illumina MiSeq platform and often generates sequences of suboptimal quality.Entities:
Year: 2014 PMID: 24558975 PMCID: PMC3940169 DOI: 10.1186/2049-2618-2-6
Source DB: PubMed Journal: Microbiome ISSN: 2049-2618 Impact factor: 14.650
Figure 1Dual-indexed 16S rRNA gene PCR amplification strategy with heterogeneity spacer primers for sequencing on the MiSeq platform. (A) Dual-indexed PCR amplification primers targeting the V3-V4 hypervariable regions of the 16S rRNA gene contain a heterogeneity spacer region and linker sequence optimized for sequencing on the Illumina MiSeq platform. Using this approach enables sequencing using the standard Illumina HP10 and HP11 sequencing primers allowing for additional sequencing flexibility. (B) Schematic showing the first thirty sequencing cycles of eight mock amplicons prepared using the dual-indexed approach. This diagram illustrates how the index sequence and heterogeneity spacer (colored letters, white background) helps to alleviate the “low sequence diversity” issue associated with the MiSeq platform by creating a more even base composition at each cycle of the run.
Raw sequence reads and post-QA/QC sequence reads statistics from nine 250PE and three 300PE MiSeq runs
| | | | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| | | | | | | | | | | | |
| 250PE | 199–205 (130325) | 352 | 5.5 | 509 | 16.3 | 87.4 | 1.24 | 9,413,692 | 5,994,627; 6,243,014 | 268; 256 | 266.04 ± 7.12; 246.99 ± 23.98 |
| 250PE | 207–212 (130525) | 387 | 6 | 583 | 14.4 | 86.3 | 1.32 | 10,656,207 | 7,438,345; 7,196,027 | 268; 259 | 265.41}8.02; 248.67}23.14 |
| 250PE | 217–222 (130612) | 337 | 6.2 | 650 | 15.1 | 84.1 | 1.5 | 11,706,459 | 7,806,262; 8,021,515 | 268; 244 | 265.03}8.48; 240.30}20.49 |
| 250PE | 223–228 (130705) | 364 | 5.5 | 839 | 10.7 | 84.6 | 1.24 | 13,398,346 | 9,703,996; 9,226,330 | 268; 261 | 265.00}8.84; 250.24}23.38 |
| 250PE | 229–234 (130708) | 366 | 4 | 625 | 8.4 | 81.1 | 1.57 | 11,071,952 | 8,089,983; 7,167,888 | 268; 232 | 265.85}6.94; 228.97}28.21 |
| 250PE | 241–246 (130805) | 305 | 12 | 527 | 8.9 | 92.5 | 1.34 | 9,615,279 | 7,393,672; 7,180,867 | 268; 264 | 266.21}6.23; 255.01}19.15 |
| 250PE | 247–252 (130801) | 299 | 9 | 429 | 17.3 | 91.2 | 1.07 | 7,787,755 | 5,267,816; 5,190,563 | 268; 266 | 266.39}7.73; 256.12}20.82 |
| 250PE | 259–265 (130813) | 426 | 7 | 445 | 9.3 | 90.7 | 1.46 | 7,919,928 | 5,920,074; 5,425,815 | 268; 265 | 265.10}9.83; 255.54}17.47 |
| 250PE | 235–240 (130815)b | 371 | 11 | 351 | 14.7 | 90.7 | 1.71 | 8,517,215 | 6,268,044; 5,848,845 | 268; 267 | 267.73}2.75; 262.44}13.29 |
| 300PE | 235–240 (130916)b | 371 | 11 | 514 | 13.9 | 90.9 | 1.44 | 12,522,115 | 9,328,369 | 499 | 497.19}6.93 |
| 300PE | 266–270 (131011) | 271 | 11 | 441 | 15.4 | 91.3 | 1.54 | 10,699,820 | 8,111,221 | 485 | 488.03}9.78 |
| 300PE | 271–275 (131015) | 276 | 12 | 592 | 12.3 | 90.4 | 1.52 | 14,241,937 | 10,912,710 | 495 | 491.05}10.56 |
250PE, 250 bp paired-end read; 300PE, 300 bp paired-end read; Q30, base call quality value >30; QA, quality assurance; QC, quality control; S1, sequence read 1; S2, Sequence read 2.
a300PE statistics for assembled paired-end sequences; bThis pool was sequenced with both 250PE and 300PE protocol.
Figure 2Flow diagram outlining the sequence data analysis process. Pre-processing for sequences generated with the 250 bp paired-end read (250PE; left panel) and 300 bp paired-end read (300PE; right panel) MiSeq protocols. R1 and R2 refers to read 1 and read 2.
Figure 3Taxonomic assignments of clinical samples. (A) Ten anal samples sequenced using 250 bp paired-end read (250PE) MiSeq protocol (pool 199–205). Ten vaginal samples sequenced using (B) 250PE and (C) 300 bp paired-end read (300PE) MiSeq protocols (pool 235–240) and analyzed using QIIME (version 1.6.0).