| Literature DB >> 27342980 |
Fiona Fouhy1, Adam G Clooney2,3, Catherine Stanton1,3, Marcus J Claesson4,5, Paul D Cotter6,7.
Abstract
BACKGROUND: Next-generation sequencing platforms have revolutionised our ability to investigate the microbiota composition of complex environments, frequently through 16S rRNA gene sequencing of the bacterial component of the community. Numerous factors, including DNA extraction method, primer sequences and sequencing platform employed, can affect the accuracy of the results achieved. The aim of this study was to determine the impact of these three factors on 16S rRNA gene sequencing results, using mock communities and mock community DNA.Entities:
Keywords: 16S rRNA; Bias; DNA extraction; Gut microbiota; Ion PGM; MiSeq; Mock communities; Next-generation sequencing
Mesh:
Substances:
Year: 2016 PMID: 27342980 PMCID: PMC4921037 DOI: 10.1186/s12866-016-0738-z
Source DB: PubMed Journal: BMC Microbiol ISSN: 1471-2180 Impact factor: 3.605
Details on number of sequencing reads, read lengths, percentage of reads retained post quality analysis
| Primer set | Raw | Quality | Length | Remaining | % Retained | After Chimera Removal | % Chimeras | % Retained |
|---|---|---|---|---|---|---|---|---|
| MiSeq | ||||||||
| V4-V5 | ||||||||
| Mock DNA | 47966 | Q25 | 365–385 | 42701 | 89.023475 | 47966 | 0 | 100 |
| Qiagen PBSa | ||||||||
| Qiagen glycerol | 14071 | Q25 | 365–385 | 13724 | 97.533935 | 13717 | 0.05100554 | 99.94899446 |
| RBB PBS | 18072 | Q25 | 365–385 | 17253 | 95.4681275 | 18026 | 0.25453741 | 99.74546259 |
| RBB glycerol | 22650 | Q25 | 365–385 | 20534 | 90.6578366 | 22626 | 0.10596026 | 99.89403974 |
| V1-V2 | ||||||||
| Mock DNA | 576244 | Q25 | 305–325 | 310254 | 53.840734 | 308989 | 0.40773044 | 99.59226956 |
| Qiagen PBS | 206140 | Q25 | 305–325 | 165035 | 80.05966 | 164117 | 0.55624564 | 99.44375436 |
| Qiagen glycerol | 274886 | Q25 | 305–325 | 153566 | 55.86534 | 152034 | 0.99761666 | 99.00238334 |
| RBB PBS | 420677 | Q25 | 305–325 | 327953 | 77.958386 | 324319 | 1.10808561 | 98.89191439 |
| RBB glycerol | 342405 | Q25 | 305–325 | 189474 | 55.336224 | 181819 | 4.04013216 | 95.95986784 |
| V1-V2 deg | ||||||||
| Mock DNA | 339219 | Q25 | 305–325 | 164586 | 48.5190983 | 161382 | 1.94670264 | 98.05329736 |
| Qiagen PBS | 432220 | Q25 | 305–325 | 170830 | 39.5238536 | 166923 | 2.28706902 | 97.71293098 |
| Qiagen glycerol | 277087 | Q25 | 305–325 | 100478 | 36.262257 | 100031 | 0.4448735 | 99.5551265 |
| RBB PBS | 407061 | Q25 | 305–325 | 111020 | 27.2735536 | 110057 | 0.86741128 | 99.13258872 |
| RBB glycerol | 373903 | Q25 | 305–325 | 117567 | 31.4431818 | 116400 | 0.99262548 | 99.00737452 |
| Ion PGM | ||||||||
| V4-V5 | ||||||||
| Mock DNA | 123511 | Q25 | 420–440 | 57467 | 46.5278396 | 51923 | 9.64727583 | 90.35272417 |
| Qiagen PBS | 173203 | Q25 | 420–440 | 74942 | 43.2683037 | 60366 | 19.4497078 | 80.55029223 |
| Qiagen glycerol | 194132 | Q25 | 420–440 | 58267 | 30.0141141 | 49474 | 15.0908748 | 84.90912523 |
| RBB PBS | 211696 | Q25 | 420–440 | 77227 | 36.4801413 | 68006 | 11.9401246 | 88.05987543 |
| RBB glycerol | 203949 | Q25 | 420–440 | 84407 | 41.386327 | 71316 | 15.5093772 | 84.49062282 |
| V1-V2 | ||||||||
| Mock DNA | 389410 | Q25 | 360–380 | 191184 | 49.0958116 | 190016 | 0.61092978 | 99.38907022 |
| Qiagen PBS | 21501 | Q25 | 360–380 | 14852 | 69.0758569 | 14804 | 0.3231888 | 99.6768112 |
| Qiagen glycerol | 35900 | Q25 | 360–380 | 26518 | 73.8662953 | 26418 | 0.37710235 | 99.62289765 |
| RBB PBS | 35343 | Q25 | 360–380 | 19157 | 54.2030954 | 19046 | 0.57942267 | 99.42057733 |
| RBB glycerol | 62195 | Q25 | 360–380 | 42150 | 67.7707211 | 42003 | 0.34875445 | 99.65124555 |
| V1-V2 deg | ||||||||
| Mock DNA | 207570 | Q25 | 360–380 | 75999 | 36.6136725 | 71500 | 5.91981473 | 94.08018527 |
| Qiagen PBS | 398459 | Q25 | 360–380 | 236444 | 59.3396058 | 231427 | 2.12185549 | 97.87814451 |
| Qiagen glycerol | 439533 | Q25 | 360–380 | 214562 | 48.8159023 | 208065 | 3.02802919 | 96.97197081 |
| RBB PBS | 376207 | Q25 | 360–380 | 180289 | 47.9228191 | 174723 | 3.08726545 | 96.91273455 |
| RBB glycerol | 389283 | Q25 | 360–380 | 184442 | 47.3799267 | 166537 | 9.70765878 | 90.29234122 |
aDNA failed to amplify with V4-V5 MiSeq primers for the Qiagen PBS extracted cells so no sequencing data for this extraction sample
Fig. 1Rarefaction curves based on sample ID and number of observed species for mock cells (a) and mock DNA samples (b). Curves are approaching or are horizontal with the x axis indicating that additional sequencing would not yield additional novel data
Number of expected vs. detected species in mock DNA and cells
| Expected | Detected | No. of expected species detected | % of expected species detected | % Misidentified/false hit | |
|---|---|---|---|---|---|
| MiSeq | |||||
| V4-V5 mock DNA | 20 | 29 | 16 | 80 | 44 |
| V1-V2 mock DNA | 20 | 37 | 15 | 75 | 59 |
| V1-V2 deg mock DNA | 20 | 34 | 16 | 80 | 53 |
| V4-V5 RBB PBS | 22 | 51 | 16 | 73 | 68 |
| V4-V5 Qiagen glycerol | 22 | 24 | 17 | 77 | 29 |
| V4-V5 RBB glycerol | 22 | 30 | 16 | 73 | 47 |
| V1-V2 Qiagen PBS | 22 | 70 | 14 | 64 | 80 |
| V1-V2 Qiagen glycerol | 22 | 36 | 15 | 68 | 58 |
| V1-V2 RBB PBS | 22 | 40 | 16 | 73 | 60 |
| V1-V2 RBB glycerol | 22 | 36 | 17 | 77 | 53 |
| V1-V2deg Qiagen PBS | 22 | 38 | 15 | 68 | 61 |
| V1-V2deg Qiagen glycerol | 22 | 32 | 12 | 55 | 63 |
| V1-V2deg RBB glycerol | 22 | 29 | 14 | 64 | 52 |
| V1-V2deg RBB PBS | 22 | 31 | 12 | 55 | 61 |
| Ion PGM | |||||
| V4-V5 mock DNA | 20 | 33 | 20 | 100 | 40 |
| V1-V2 mock DNA | 20 | 27 | 19 | 95 | 29 |
| V1-V2 deg mock DNA | 20 | 27 | 19 | 95 | 29 |
| V4-V5 Qiagen PBS | 22 | 37 | 20 | 91 | 46 |
| V4-V5 RBB PBS | 22 | 40 | 20 | 91 | 50 |
| V4-V5 Qiagen glycerol | 22 | 31 | 20 | 91 | 35 |
| V4-V5 RBB glycerol | 22 | 38 | 20 | 91 | 47 |
| V1-V2 Qiagen PBS | 22 | 18 | 17 | 77 | 6 |
| V1-V2 Qiagen glycerol | 22 | 30 | 18 | 82 | 40 |
| V1-V2 RBB PBS | 22 | 24 | 18 | 82 | 25 |
| V1-V2 RBB glycerol | 22 | 26 | 18 | 82 | 31 |
| V1-V2 deg Qiagen PBS | 22 | 65 | 20 | 91 | 69 |
| V1-V2 deg Qiagen glycerol | 22 | 28 | 21 | 96 | 25 |
| V1-V2 deg RBB glycerol | 22 | 37 | 22 | 100 | 41 |
| V1-V2deg RBB PBS | 22 | 40 | 20 | 91 | 50 |
RBB repeat bead beating extraction method
Note: DNA failed to amplify with V4-V5 MiSeq primers for the Qiagen PBS extracted cells so no sequencing data for this extraction sample
Fig. 2Percentage relative abundance of expected species (n = 20) detected in mock community DNA based on sequencing platform and primer set used
Fig. 3Heat map of species abundance. Only the 20 expected taxa from mock DNA (HM-782D) were included. Hierarchical clustering was performed using hclusing default parameters (complete linkage). The blue colours represent samples sequenced on MiSeq platform while green represent the Ion PGM
Fig. 4Percentage relative abundance of expected species based on extraction procedure
Fig. 5Heat map of species abundance by sequencer and extraction method for mock community cells. Only expected taxa were included and hierarchical clustering was performed using hclust default parameters (complete linkage). The top colour legend depicts the sequencing technology and primer set. The blue colours represent samples sequenced on MiSeq platform while green represent the Ion PGM. The bottom legend represents the samples and extraction method
Sequences of primers used for MiSeq sequencing
| Sample | Primer sequence | Barcode | Ref |
|---|---|---|---|
| V4-V5 primer | [ | ||
| Forward primer | AATGATACGGCGACCACCGAGATCTACACTATGGTAATTGGGTGCCAGCMGCCGCGGTAA | ||
| Read 1 primer | TATGGTAATTGGGTGCCAGCMGCCGCGGTAA | ||
| Read 2 primer | AGTCAGTCAGTTCCGTCAATTYYTTTRAGTTT | ||
| Index primer | AAACTYAAARRAATTGACGGAACTGACTGACT | ||
| Reverse barcoded primers | |||
| PBS Qiagen | CAAGCAGAAGACGGCATACGAGATTAACGTGTGTGCAGTCAGTCAGTTCCGTCAATTYYTTTRAGTTT | TAACGTGTGTGC | |
| PBS RBB | CAAGCAGAAGACGGCATACGAGATCATTATGGCGTGAGTCAGTCAGTTCCGTCAATTYYTTTRAGTTT | CATTATGGCGTG | |
| Qiagen Glycerol | CAAGCAGAAGACGGCATACGAGATCCAATACGCCTGAGTCAGTCAGTTCCGTCAATTYYTTTRAGTTT | CCAATACGCCTG | |
| RBB Glycerol | CAAGCAGAAGACGGCATACGAGATGATCTGCGATCCAGTCAGTCAGTTCCGTCAATTYYTTTRAGTTT | GATCTGCGATCC | |
| Mock DNA | CAAGCAGAAGACGGCATACGAGATCAGCTCATCAGCAGTCAGTCAGTTCCGTCAATTYYTTTRAGTTT | CAGCTCATCAGC | |
| V1-V2 set 1 | [ | ||
| Forward primer | AATGATACGGCGACCACCGAGATCTACACTATGGTAATTTCAGAGTTTGATCCTGGCTCAG | ||
| Read 1 primer | TATGGTAATTTCAGAGTTTGATCCTGGCTCAG | ||
| Read 2 primer | AGTCAGTCAGCATGCTGCCTCCCGTAGGAGT | ||
| Index primer | ACTCCTACGGGAGGCAGCATGCTGACTGACT | ||
| Reverse barcoded primers | |||
| PBS Qiagen | CAAGCAGAAGACGGCATACGAGATTCTTGGAGGTCAAGTCAGTCAGCATGCTGCCTCCCGTAGGAGT | TCTTGGAGGTCA | |
| PBS RBB | CAAGCAGAAGACGGCATACGAGATTCACCTCCTTGTAGTCAGTCAGCATGCTGCCTCCCGTAGGAGT | TCACCTCCTTGT | |
| Qiagen Glycerol | CAAGCAGAAGACGGCATACGAGATGCACACCTGATAAGTCAGTCAGCATGCTGCCTCCCGTAGGAGT | GCACACCTGATA | |
| RBB Glycerol | CAAGCAGAAGACGGCATACGAGATGCGACAATTACAAGTCAGTCAGCATGCTGCCTCCCGTAGGAGT | GCGACAATTACA | |
| Mock DNA | CAAGCAGAAGACGGCATACGAGATTCATGCTCCATTAGTCAGTCAGCATGCTGCCTCCCGTAGGAGT | TCATGCTCCATT | |
| V1-V2 degenerate | [ | ||
| Forward primer | AATGATACGGCGACCACCGAGATCTACACTATGGTAATTTCAGMGTTYGATYMTGGCTCAG | ||
| Read 1 primer | TATGGTAATTTCAGMGTTYGATYMTGGCTCAG | ||
| Read 2 primer | AGTCAGTCAGCATGCTGCCTCCCGTAGGAGT | ||
| Index primer | ACTCCTACGGGAGGCAGCATGCTGACTGACT | ||
| Reverse barcoded primers | |||
| PBS Qiagen | CAAGCAGAAGACGGCATACGAGATTCTTGGAGGTCAAGTCAGTCAGCATGCTGCCTCCCGTAGGAGT | TCTTGGAGGTCA | |
| PBS RBB | CAAGCAGAAGACGGCATACGAGATTCACCTCCTTGTAGTCAGTCAGCATGCTGCCTCCCGTAGGAGT | TCACCTCCTTGT | |
| Qiagen Glycerol | CAAGCAGAAGACGGCATACGAGATGCACACCTGATAAGTCAGTCAGCATGCTGCCTCCCGTAGGAGT | GCACACCTGATA | |
| RBB Glycerol | CAAGCAGAAGACGGCATACGAGATGCGACAATTACAAGTCAGTCAGCATGCTGCCTCCCGTAGGAGT | GCGACAATTACA | |
| Mock DNA | CAAGCAGAAGACGGCATACGAGATTCATGCTCCATTAGTCAGTCAGCATGCTGCCTCCCGTAGGAGT | TCATGCTCCATT | |
Primers for amplification of DNA for sequencing on the Ion PGM platform
| Ion Linker | Barcode | Spacer | Primer | Ref | |
|---|---|---|---|---|---|
| V4-V5 | [ | ||||
| Forward barcoded primers | |||||
| Mock DNA | CCATCTCATCCCTGCGTGTCTCCGACTCAG | TCCCTTGTCTCC | GT | GTGCCAGCMGCCGCGGTAA | |
| PBS Qiagen | CCATCTCATCCCTGCGTGTCTCCGACTCAG | ACGAGACTGATT | GT | GTGCCAGCMGCCGCGGTAA | |
| PBS RBB | CCATCTCATCCCTGCGTGTCTCCGACTCAG | GCTGTACGGATT | GT | GTGCCAGCMGCCGCGGTAA | |
| Glycerol Qiagen | CCATCTCATCCCTGCGTGTCTCCGACTCAG | ATCACCAGGTGT | GT | GTGCCAGCMGCCGCGGTAA | |
| Glycerol RBB | CCATCTCATCCCTGCGTGTCTCCGACTCAG | TGGTCAACGATA | GT | GTGCCAGCMGCCGCGGTAA | |
| Reverse primer | CCTCTCTATGGGCAGTCGGTGAT | CC | CCGTCAATTYYTTTRAGTTT | ||
| V1-V2 set 1 | [ | ||||
| Forward barcoded primers | |||||
| Mock DNA | CCATCTCATCCCTGCGTGTCTCCGACTCAG | TGCATACACTGG | GT | AGAGTTTGATCCTGGCTCAG | |
| PBS Qiagen | CCATCTCATCCCTGCGTGTCTCCGACTCAG | AGTCGAACGAGG | GT | AGAGTTTGATCCTGGCTCAG | |
| PBS RBB | CCATCTCATCCCTGCGTGTCTCCGACTCAG | ACCAGTGACTCA | GT | AGAGTTTGATCCTGGCTCAG | |
| Glycerol Qiagen | CCATCTCATCCCTGCGTGTCTCCGACTCAG | GAATACCAAGTC | GT | AGAGTTTGATCCTGGCTCAG | |
| Glycerol RBB | CCATCTCATCCCTGCGTGTCTCCGACTCAG | GTAGATCGTGTA | GT | AGAGTTTGATCCTGGCTCAG | |
| Reverse primer | CCTCTCTATGGGCAGTCGGTGAT | CC | TGCTGCCTCCCGTAGGAGT | ||
| V1-V2 set 2 | [ | ||||
| Forward barcoded primers | |||||
| Mock DNA | CCATCTCATCCCTGCGTGTCTCCGACTCAG | GCGATATATCGC | GT | AGMGTTYGATYMTGGCTCAG | |
| PBS Qiagen | CCATCTCATCCCTGCGTGTCTCCGACTCAG | CGAGCAATCCTA | GT | AGMGTTYGATYMTGGCTCAG | |
| PBS RBB | CCATCTCATCCCTGCGTGTCTCCGACTCAG | AGTCGTGCACAT | GT | AGMGTTYGATYMTGGCTCAG | |
| Glycerol Qiagen | CCATCTCATCCCTGCGTGTCTCCGACTCAG | GTATCTGCGCGT | GT | AGMGTTYGATYMTGGCTCAG | |
| Glycerol RBB | CCATCTCATCCCTGCGTGTCTCCGACTCAG | CGAGGGAAAGTC | GT | AGMGTTYGATYMTGGCTCAG | |
| Reverse primer | CCTCTCTATGGGCAGTCGGTGAT | CC | TGCTGCCTCCCGTAGGAGT |