| Literature DB >> 30026559 |
Andrea D Tyler1, Laura Mataseje2, Chantel J Urfano2, Lisa Schmidt2, Kym S Antonation2, Michael R Mulvey2, Cindi R Corbett2,3.
Abstract
The MinION sequencer (Oxford Nanopore Technologies) is a paradigm shifting device allowing rapid, real time long read sequencing of nucleic acids. Yet external benchmarking of this technologies' capabilities has not been extensively reported, nor has thorough evaluation of its utility for field-based analysis with sub-optimal sample types been described. The aim of this study was to evaluate the capability of the MinION sequencer for bacterial genomic and metagenomic applications, with specific emphasis placed on the quality, yield, and accuracy of generated sequence data. Two independent laboratories at the National Microbiology Laboratory (Public Health Agency of Canada), sequenced a set of microbes in replicate, using the currently available flowcells, sequencing chemistries, and software available at the time of the experiment. Overall sequencing yield and quality improved through the course of this set of experiments. Sequencing alignment accuracy was high reaching 97% for all 2D experiments, though was slightly lower for 1D sequencing (94%). 1D sequencing provided much longer sequences than 2D. Both sequencing chemistries performed equally well in constructing genomic assemblies. There was evidence of barcode cross-over using both the native and PCR barcoding methods. Despite the sub-optimal nature of samples sequenced in the field, sequences attributable to B. anthracis the target organism used in this scenario, could none-the-less be detected. Together, this report showcases the rapid advancement in this technology and its utility in the context of genomic sequencing of microbial isolates of importance to public health.Entities:
Mesh:
Year: 2018 PMID: 30026559 PMCID: PMC6053456 DOI: 10.1038/s41598-018-29334-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1DNA yield in number of reads which were analyzed further (pass) and reads which were excluded based on either internal Epi2Me filtering parameters or inability to demultiplex (Fail), on each of the flowcells tested. FAB-R9.4; FAF- R9.4 SpotOn. The first set of samples were assessed using Epi2Me, the second set by Albacore, and the third by MinKNOW direct basecalling. *This sample had a second top up run added 1 hour following the commencement of sequencing.
Figure 2Mean quality per 2000 base pair window across each of the MinION runs included in this analysis calculated using fastqc[32]. Lines extend further as the length of the read increases. Earlier runs were characterized by both shorter reads, and lower quality output data generated. L1-2D-FAB29623 is broken into pre and post, referring to read quality before and after the wash buffer was applied.
Figure 3Boxplot depicting the mean genomic coverage obtained across each of the reference genomes included in this analysis. Sequenced organisms and the corresponding average levels of coverage for each are coloured as described.
Figure 4Error characteristics of data generated in various sequencing runs included in this analysis. (A) Proportion of bases accurately matching mapped segment of the reference. (B) Insertions per 100 aligned bases. (C) Deletions per 100 aligned bases. (D) Substitutions per 100 aligned bases.
Figure 5Assembly characteristics of contigs generated following subsampling of fastq reads generated with 1D sequencing and native barcoding. Dashed line is the reference genome size reported by NCBI.
Sequence quality of data generated using the MinION sequencing device with the 1D rapid kit from ONT (SQK-RAD002).
| Number of Reads | Reads with alignment | Aligned base identity | Identical Bases per 100 aligned | Inserted based per 100 aligned | Deleted bases per 100 aligned | Substitutions per 100 aligned | |
|---|---|---|---|---|---|---|---|
| L1-1D- FAF06136-RAP - Pass | 1277 | 1218 | 89.04% | 79.50% | 2.91% | 7.80% | 9.79% |
| L1-1D-FAF05394-RAP -Pass | 5044 | 4731 | 92.04% | 82.83% | 1.53% | 8.47% | 7.16% |
Data shown were obtained from a pair of isolates run in the laboratory, and used for benchmarking the technologies’ capabilities using NanoOK. F. hispaniensis = Francisella hispaniensis; Y. rohdei = Yersinia rohdei.
Primary sequencing output metrics characterizing data obtained from two MinION runs carried out at a remote Canadian location.
| L1-1D- FAF06136-RAP | L1-1D-FAF05394-RAP | Sample 1 T1 | Sample 1 T-final | Sample 2 T1** | Sample 2 T-final | |
|---|---|---|---|---|---|---|
| Elapsed time | 26 h | 26 h | 49 min | 5 hr | 60 min | 23 hr |
| Total sequences generated | 33467 | 31462 | 3389 | 14855 | 3292 | 54619 |
| Pass (minKNOW) sequences | 1278 | 5044 | 472 | 2611 | 1328 | 29623 |
| Reads skipped by local basecaller | 7532 | 11140 | 219 | 2044 | 249 | 5866 |
| Pass (mean q > 8) sequences | 1043 | 4105 | 365 | 1994 | 1013 | 2971 |
**Second library prepared from the same sample was added at this time.