| Literature DB >> 31666009 |
Jennifer M Bouso1, Paul J Planet2,3,4.
Abstract
BACKGROUND: Nontuberculous mycobacteria (NTM) are a major cause of pulmonary and systemic disease in at-risk populations. Gaps in knowledge about transmission patterns, evolution, and pathogenicity during infection have prompted a recent surge in genomic NTM research. Increased availability and affordability of whole genome sequencing (WGS) techniques provide new opportunities to sequence and construct complete bacterial genomes faster and at a lower cost. However, extracting large quantities of pure genomic DNA is particularly challenging with NTM due to its slow growth and recalcitrant cell wall. Here we report a DNA extraction protocol that is optimized for long-read WGS of NTM, yielding large quantities of highly pure DNA with no additional clean-up steps.Entities:
Keywords: Bronchiectasis; Chronic obstructive pulmonary disease; Cystic fibrosis; Genome assembly; Long-read sequencing; Mycobacteria; Mycobacterium abscessus complex; Mycobacterium avium complex; Nontuberculous mycobacteria; Whole genome sequencing
Mesh:
Substances:
Year: 2019 PMID: 31666009 PMCID: PMC6822416 DOI: 10.1186/s12864-019-6134-y
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Differentiation of tested methods by variable
| Method 1 | Method 2 | Method 3 | Method 4 | Method 5 | Method 6 | Method 7 | |
|---|---|---|---|---|---|---|---|
| Early vs. Late bead-beating | Early | Late | Late | Early | Early | Early | Early |
| Bead Quantity | 150 mg | 150 mg | 150 mg | 75 mg | 150 mg | 150 mg | 150 mg |
| Phenol vs. No Phenolb | No phenol | No phenol | Phenol | No phenol | Phenol | No phenol | No phenol |
| Precipitation Temp/Reagentc | RT/2-Prop | RT/2-Prop | Cold ETOH | RT/2-Prop | RT/2-Prop | Cold ETOH | RT/2-Prop |
| Precipitation Saltd | NaOAc | NaOAc | NaOAc | NaOAc | NaCl | NaOAc | NaOAc |
| Number of washes | 3 | 3 | 3 | 3 | 3 | 3 | 5 |
a “Early” bead-beating refers to the timing prior to enzymatic digestion; “Late” bead-beating refers to timing after enzymatic digestion
All Early bead-beating was done in high SDS concentration, see Additional file 1
b DNA extractions in “no phenol” were extracted as described in Methods with chloroform:isoamyl alcohol (24:1, Tris-saturated)
Extractions in “phenol” were extracted using phenol:chloroform:isoamyl alcohol (25:24:1, Tris-saturated, pH 8.0)
c Precipitation reagent was either RT 2-Prop (room temperature isopropanol) or Cold ETOH (ethanol). See Additional file 4: Figure S1
d Precipitation salt was either 3 M sodium acetate (pH 5.2) or 5 M NaCl
Fig. 1Quantity and quality of DNA by variable methods. a Total DNA by Qubit® and b 260/280 and c 260/230 by NanoDrop 2000 UV-Vis Spectrophotometer. All methods performed in triplicate with error bars (standard deviation). All methods produced sufficient total DNA except Method 6 (a). For (b) and (c), horizontal bars representing significance values for one-way ANOVA with Tukey’s post-hoc multiple comparisons test against Method 5. Significance in p-values as follows: 0.033 (*), 0.002 (**), < 0.001 (***). DNA molecular weight demonstrated by (d) gel electrophoresis of genomic DNA on a 0.6% ethidium bromide gel of extractions by Methods 1–7 in triplicate (a, b, c) and (e) Bioanalyzer of Method 1 and Method 5 in triplicate
Fig. 2Quantity and quality of DNA extractions of 38 MAC and MABSC clinical isolates using optimized DNA extraction method. a Total DNA by Qubit® and b 260/280 and c 260/230 by NanoDrop 2000 UV-Vis Spectrophotometer. Significance by unpaired t-test with p-values as 0.033 (*), 0.002 (**), < 0.001 (***). d OD600 at the time of extraction by species complex (ns). Scatter plots with lines of best fit for (e) OD600 versus total DNA (ns), (F) OD600 versus 260/280 showing a positive relationship (p = 0.0033, R2 = 0.2051), and OD600 versus 260/230
ONT Sequencing Run Statistics
| Total reads | Total bases | Mean read length (bp) | Median read length (bp) | Longest read | Phred score | %Error Probabilitya | |
|---|---|---|---|---|---|---|---|
| Method 1 | 1,455,977 | 1,634,706,997 | 1122.8 | 751.0 | 34,548 | 13 | 5.011% |
| Method 5 | 1,500,966 | 2,778,573,523 | 1851.2* | 1196.0* | 64,327 | 14 | 3.981% |
Raw values are listed for each MinION ONT runs using DNA extracted by Method 1 (after additional clean-up) and by Method 5. For statistical analyses comparing Method 1 (n = 3) versus Method 5 (n = 3), paired-t test comparisons (total reads, total bases, mean read length, median read length, longest read) were completed on basecalled, demultiplexed, and trimmed reads. Mean read length and median read length were significantly longer from Method 5, p = 0.0168 and p = 0.0101, respectively. Notably, Phred score was higher and error probability lower in the Method 5 run, which may reflect higher quality substrates
* Indicates p-value of p < 0.05
a Error probability percentage is a function of Mean Phred score, where probability P% = 100*10^(−Phred/10)
Assembly Statistics
| Sample | Length (bp) | Contigs | N50 (Mbp) | %GC | Coverage | Circularized, complete chromosome | Plasmids | Completeness/Contaminationb | Consistency, Coarse/Finec | |
|---|---|---|---|---|---|---|---|---|---|---|
| Method 1 | CHOP101034 | 5,440,506 | 8 | 4.46 | 68.9 | 102.3x | No | 2 | 100/2.5 | 98.7/96.5 |
| CHOP101115 | 5,391,592 | 7 | 4.04 | 69.0 | 85.2x | No | 2 | 100/5.7 | 98.7/96.6 | |
| CHOP101174 | 5,171,418 | 3 | 5.11 | 69.1 | 170.3x | Yesa | 2 | 100/1.7 | 98.2/95.5 | |
| Method 5 | CHOP101034 | 5,368,111 | 3 | 5.30 | 69.9 | 83.1x | Yesa | 1 | 100/0.9 | 98.5/97.3* |
| CHOP101115 | 5,393,712 | 5 | 5.25 | 68.9 | 97.3x | Yes | 2 | 100/0.9 | 98.8/97.5* | |
| CHOP101174 | 5,130,681 | 3 | 5.10 | 69.2 | 75.7x | Yesa | 2 | 100/0.9 | 98.7/97.3* |
Comparison of genomes assembled from reads generated from Method 1 versus Method 5 sequencing runs showing generally more complete assemblies from Method 5, with all Method 5 genomes producing complete and circularized bacterial chromosomes, while only 1/3 bacterial chromosomes by Method 1 being circular and complete. All listed parameters were evaluated for statistical significance between the two methods. Method 5 genomes had significantly higher fine consistency scores than Method 1 genomes by unpaired t-test, p = 0.0310
*Indicates significance with p < 0.05
aComplete and circularized chromosome and plasmids without extra unintegrated plasmid contigs
bCompleteness is the percentage of genes with universal roles represented in the genome; Contamination approximates the percentage of the genome that is contaminated and is estimated by universal roles that are represented more than once in the genome [37].
cGenome consistency estimates the percentage of universal roles expected to be present vs. absent (Coarse) in the genome and universal roles that are present in the exact number (Fine) as expected in the genome [37]
Fig. 3Representative complete and circularized assembly. CHOP101034 bacterial 1 chromosome (a), pMARIA (b), and pFLAC0181_CHOP101 (c). Genome graphics by PATRIC [37], from outside in, position label (Mbp, grey), contigs (dark blue), CDS forward (green), CDS reverse (purple), non-CDS features (teal), antimicrobial resistance genes (red), virulence factor genes (orange), transporters (blue), drug targets (black), GC content (lavender), GC skew (peach)