| Literature DB >> 28794860 |
Miten Jain1, John R Tyson2, Matthew Loose3, Camilla L C Ip4,5, Ewan Birney6, Bonnie L Brown7, Terrance P Snutch2, Hugh E Olsen1, David A Eccles8, Justin O'Grady9, Sunir Malla3, Richard M Leggett10, Ola Wallerman11, Hans J Jansen12, Vadim Zalunin6.
Abstract
BACKGROUND: Long-read sequencing is rapidly evolving and reshaping the suite of opportunities for genomic analysis. For the MinION in particular, as both the platform and chemistry develop, the user community requires reference data to set performance expectations and maximally exploit third-generation sequencing. We performed an analysis of MinION data derived from whole genome sequencing of Escherichiacoli K-12 using the R9.0 chemistry, comparing the results with the older R7.3 chemistry.Entities:
Keywords: CsgG; MinION; NanoOK; R9 chemistry; data release; long reads; marginAlign; minoTour; nanopore sequencing; third-generation sequencing
Year: 2017 PMID: 28794860 PMCID: PMC5538040 DOI: 10.12688/f1000research.11354.1
Source DB: PubMed Journal: F1000Res ISSN: 2046-1402
Experimental conditions.
P1 refers to a typical R7.3 run from MARC Phase 1 [1]. P2 refers to the MARC Phase 2 R9.0 data presented in this study. NA: not available.
| P1b-Lab2-R2-2D | P2-Lab6-R1-2D | P2-Lab7-R1-2D | P2-Lab6-R1-1D | P2-Lab7-R1-1D | |
|---|---|---|---|---|---|
| Library & base-call
| 2D | 2D | 2D | 1D | 1D |
| Flow cell version | R7.3 | R9 | R9 | R9 | R9 |
| MinION device | Initial version | Mk1b | Mk1b | Mk1b | Mk1b |
| Experiment start date | 2015-07-25 | 2016-07-27 | 2016-07-27 | 2016-08-02 | 2016-07-29 |
| Active g1 pores
| 87.9 | 94 | NA | 94 | NA |
| Active g2 pores
| 60.7 | 77 | NA | 69 | NA |
| Mean ASIC
| 24.4 | 30.5 | 33.7 | 31.9 | 33.8 |
| Mean heat-sink
| 37.1 | 34.0 | 34.0 | 34.0 | 34.0 |
| Experiment run time (h) | 48.0 | 41.5 | 44.0 | 48.0 | 48.0 |
| Experimental notes | Full run | Hard drive filled
| Terminated early
| Full run; no
| Two restarts between
|
Read counts and base yields.
(“-”) indicates not applicable.
| P1b-Lab2-R1-2D | P2-Lab6-R1-2D | P2-Lab7-R1-2D | P2-Lab6-R1-1D | P2-Lab7-R1-1D | |
|---|---|---|---|---|---|
| Read count (K)
|
|
|
|
|
|
| Base yield (Mb)
|
|
|
|
|
|
Read lengths.
(“-”) indicates not applicable.
| P1b-Lab2-R1-2D | P2-Lab6-R1-2D | P2-Lab7-R1-2D | P2-Lab6-R1-1D | P2-Lab7-R1-1D | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| total | pass | total | pass | total | pass | total | pass | total | pass | |
| Mean length (Kb)
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Longest aligned
|
|
|
|
|
|
|
|
|
|
|
| N50 length (Kb)
|
|
|
|
|
|
|
|
|
|
|
** : Longest read here is pre-alignment.
Figure 1. Read length distribution for template and 2D “pass” reads.
The distribution of template (“1D”) read lengths for experiments based on 1D “rapid” libraries (P2-Lab6-R1-1D and P2-Lab7-R1-1D) was skewed toward shorter read lengths due to enzymatic, rather than mechanical, DNA fragmentation. The long tails of the distributions were truncated at 40,000 bases for clarity.
Per-read accuracy metrics for target E. coli sample.
(“-”) indicates the metrics were not applicable for that experiment. NA: not available.
| P1b-Lab2-R1-2D | P2-Lab6-R1-2D | P2-Lab7-R1-2D | P2-Lab6-R1-1D | P2-Lab7-R1-1D | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| pass | fail | pass | fail | pass | fail | pass | fail | pass | fail | |
| Identity %
|
|
|
|
|
|
|
|
|
|
|
| Reads mapped %
|
|
|
|
|
|
|
|
|
|
|
| Longest perfectly
|
|
|
|
|
|
|
|
|
|
|
| Total error %
|
|
|
|
|
|
|
|
|
|
|
| Miscall %
|
|
|
|
|
|
|
|
|
|
|
| Insertion %
|
|
|
|
|
|
|
|
|
|
|
| Deletion %
|
|
|
|
|
|
|
|
|
|
|
Figure 2. Sequencing performance over time.
The mean read length (kb), Q-score, base quality (BQ), and GC%, speed (bases per second), and throughput (count) for each experiment, computed from “pass” reads that mapped to the E. coli reference, were plotted for 15 minute intervals. The values for template reads (“1D”) are plotted for the 1D libraries (P2-Lab6-R1-1D and P2-Lab7-R1-1D) whereas the values for 2D reads were plotted for the 2D libraries (P1b-Lab2-R2-2D, P2-Lab6-R1-2D, and P2-Lab7-R1-2D).