| Literature DB >> 32973734 |
Jake C Szamosi1, Jessica D Forbes2,3,4, Julia K Copeland5, Natalie C Knox4,6, Shahrokh Shekarriz7, Laura Rossi1, Morag Graham4,6, Christine Bonner4, David S Guttman5, Gary Van Domselaar4,6, Michael G Surette7, Charles N Bernstein2,3.
Abstract
BACKGROUND: In studies evaluating the microbiome, numerous factors can contribute to technical variability. These factors include DNA extraction methodology, sequencing protocols, and data analysis strategies. We sought to evaluate the impact these factors have on the results obtained when the sequence data are independently generated and analyzed by different laboratories.Entities:
Keywords: 16S rRNA; inflammatory bowel diseases; intestinal biopsies; microbiome; replicability; standards; technical variability
Year: 2020 PMID: 32973734 PMCID: PMC7472644 DOI: 10.3389/fmicb.2020.02028
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
FIGURE 1Schematic of experimental design. Biopsies taken from 32 patients were sent to 3 centers for DNA extraction. DNA was independently sequenced, extracted, and processed at each center and sent to McMaster (MM) for sequencing. FASTQ files were processed into OTU or ASV tables on site and sent to MM for processing.
Dataset description.
| Data Seta,b | DNA Extraction | Sequencing | Bioinformatic Processing | Sample Count | Min Depth | Max Depth | ASV/OTU | |
| MMESP | MM | MM | MM | 59 | 1,029 | 32,305 | 400 | |
| UTESP | UT | UT | UT | 64 | 1,022 | 167,922 | 757 | |
| UTESMMP | UT | UT | MM | 58 | 1,125 | 114,877 | 396 | |
| UTEMMSP | UT | MM | MM | 61 | 2,745 | 33,415 | 403 | |
| UMESP | UM | UM | UM | 64 | 5,480 | 273,886 | 289 | |
| UMESMMP | UM | UM | MM | 62 | 1,963 | 103,781 | 418 | |
| UMEMMSP | UM | MM | MM | 60 | 1,445 | 24,413 | 411 | |
| PE | All | MM | MM | |||||
| PS | All | All | MM | |||||
| PP | All | All | All |
Extraction and sequencing methods.
| Step | UT | UM | MM | |
| Extraction | Mechanical lysis | MO BIO PowerSoil DNA Isolation kit standard protocol | MO BIO PowerSoil DNA Isolation kit + zirconia/silica beads during homogenization | Bead-beating with ceramic and glass beads in GES + sodium phosphate |
| Enzymatic lysis | MO BIO PowerSoil DNA isolation kit + lysozyme incubation + proteinase K | Lysozyme and RNaseA | ||
| Incubation | SDS, NaCl, Proteinase K | |||
| Extraction | MO BIO PowerSoil DNA isolation kit | -centrifugation -phenol, chloroform, isoamyl | ||
| Purification | Zymo DNA Clean and Concentrator-25 kit | |||
| Amplification | Primers | 515F and 806R primers; V4 region | 515fXT and 806rXT primers; V4 region | Nested PCR: 8f and 926r primers [first reaction (1)] and 341f and 806r primers; V3-V4 region [second reaction (2)] |
| PCR Mix | −12.5 μL KAPA2G Robust HotStart ReadyMix (KAPA Biosystems) | −12.5 μL of KAPA2G Robust HotStart ReadyMix (KAPA Biosystems) | −1 unit of Taq, 1× buffer, 1.5 mM MgCl2 (ThermoFisher) | |
| PCR Cycles | −3 min @ 95°C | −3 min @ 95°C | −5 min @ 98°C | |
| Library Prep and Sequencing | Purification and Normalization | -gel purification on 2% TAE | -purified by AMPure XP -quantitated with PicoGreen and pooled in equimolar amounts | SequalPrep normalization kit (ThermoFisher) |
| Sequencing | -Illumina MiSeq; V2 chemistry | -Illumina MiSeq; V3 chemistry | -Illumina MiSeq; V2 chemistry -spike-in either 5% PhiX alone or 1% PhiX and 2.5% – 19% bacterial genomic DNA |
FIGURE 2Samples independently extracted and sequenced at each study center, bioinformatically processed at McMaster, and visualized together as (A) PCoA plots and (B) UPGMA trees. Samples independently extracted, sequenced, bioinformatically processed at each study center, and visualized as (C) PCoA plots and (D) UPGMA trees based on Bray-Curtis dissimilarity. Tree branches are colored according to patient. Diagnosis is indicated by the inner colored ring. Dataset is indicated by the outer colored ring.
FIGURE 3R2 values associated with center, diagnosis, and patient in all 7 unpooled datasets and 3 pooled datasets. PE, Post-Extraction; PS, Post-Sequencing; PP, Post-Processing.
FIGURE 4Estimated log2 fold-changes (with 95% confidence intervals) of the 16 most-common genera for all three diagnosis pairs in all 5 MMp datasets and the pooled PE and PS datasets.