| Literature DB >> 31453259 |
Sanjay S Gautam1, Rajendra Kc1, Kelvin Wc Leong2, Micheál Mac Aogáin3, Ronan F O'Toole1,3,2.
Abstract
Bacterial whole genome sequencing (WGS) is becoming a widely-used technique in research, clinical diagnostic, and public health laboratories. It enables high resolution characterization of bacterial pathogens in terms of properties that include antibiotic resistance, molecular epidemiology, and virulence. The introduction of next-generation sequencing instrumentation has made WGS attainable in terms of costs. However, the lack of a beginner's protocol for WGS still represents a barrier to its adoption in some settings. Here, we present detailed step-by-step methods for obtaining WGS data from a range of different bacteria (Gram-positive, Gram-negative, and acid-fast) using the Illumina platform. Modifications have been performed with respect to DNA extraction and library normalization to maximize the output from the laboratory consumables invested. The protocol represents a simplified and reproducible method for producing high quality sequencing data. The key advantages of this protocol include: simplicity of the protocol for users with no prior genome sequencing experience and reproducibility of the protocol across a wide range of bacteria.Entities:
Keywords: Enterococcus faecium; Haemophilus influenzae; Mycobacterium tuberculosis; whole genome sequencing
Year: 2019 PMID: 31453259 PMCID: PMC6706130 DOI: 10.14440/jbm.2019.276
Source DB: PubMed Journal: J Biol Methods ISSN: 2326-9901
Percentage sequence coverage and mean read depth for each of the sequenced genomes with respect to reference strains.
| Sample | Reference sequence coverage (%) | Mean read depth |
|---|---|---|
| VRE1 | 90.98 | 197.4 |
| VRE2 | 89.4 | 178.4 |
| VRE3 | 90.96 | 168 |
| NtHi1 | 93.8 | 263.5 |
| NtHi2 | 86.5 | 61.3 |
| NtHi3 | 89.1 | 124.9 |
| MTBC1 | 96.4 | 104.4 |
| MTBC2 | 96.8 | 100.4 |
| MTBC3 | 96.8 | 68.4 |
Coverage refers to the percentage of reference genome bases covered by mapped sequence reads. Mean read depth indicates the mean number of times each base is mapped by a sequence read. Reference genomes used were E. faecium ST18 DO (TX16) (accession number NC_017960), Haemophilus influenzae 86-028NP (nontypeable) (accession number NC_007146), and Mycobacterium tuberculosis H37Rv (accession number NC000962). VRE, vancomycin resistant Enterococcus faecium; NtHi, non-typeable Haemophilus influenza; MTBC, Mycobacterium tuberculosis complex.
Troubleshooting table.
| Step # | Problems | Causes | Suggestions |
|---|---|---|---|
| 23 | Low concentration of AMPure XP bead captured purified products | Bead clean-up affects the quality and quantity of amplified libraries that will be included in downstream sequencing process | Make sure AMPure XP beads are held at room temperature for 20 min before starting the clean-up process. Furthermore, ensure that 80% ethanol is freshly prepared |
| 35 | Variation in the concentration of amplified library | The concentration of input DNA used for library preparation affects the final yield of genomic data | Measure the concentration of input DNA using Qubit fluorometer rather than a nanodrop and make the appropriate dilution for a DNA concentration of 0.2 ng/μl |
| 56 | Poor sequencing results | The choice of forward and reverse index primer set affects the sequencing of libraries prepared | Avoid primer combinations S502 with N705 /706, and S503 with N701/702 |
| 56 | Poor sequencing results | Effective denaturation of pooled library not achieved after bead clean-up | Ensure that the NaOH is freshly prepared at the correct concentration |
| 56 | Poor sequencing results | Repeated thawing and freezing of the pooled library reduces the quality of sequence reads generated | Before preparing the pooled library for loading onto MiSeq, ensure that the machine has already been appropriately cleaned after the previous run and has sufficient storage space (at least 25 GB) |