| Literature DB >> 33298935 |
Rowena A Bull1,2, Thiruni N Adikari1,2, James M Ferguson3, Jillian M Hammond3, Igor Stevanovski3, Alicia G Beukers4, Zin Naing2,5, Malinna Yeang2,5, Andrey Verich1, Hasindu Gamaarachchi3,6, Ki Wook Kim5,7, Fabio Luciani1,2, Sacha Stelzer-Braid2,5, John-Sebastian Eden8,9, William D Rawlinson2,5,7,10, Sebastiaan J van Hal4,11, Ira W Deveson12,13.
Abstract
Viral whole-genome sequencing (WGS) provides critical insight into the transmission and evolution of Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2). Long-read sequencing devices from Oxford Nanopore Technologies (ONT) promise significant improvements in turnaround time, portability and cost, compared to established short-read sequencing platforms for viral WGS (e.g., Illumina). However, adoption of ONT sequencing for SARS-CoV-2 surveillance has been limited due to common concerns around sequencing accuracy. To address this, here we perform viral WGS with ONT and Illumina platforms on 157 matched SARS-CoV-2-positive patient specimens and synthetic RNA controls, enabling rigorous evaluation of analytical performance. We report that, despite the elevated error rates observed in ONT sequencing reads, highly accurate consensus-level sequence determination was achieved, with single nucleotide variants (SNVs) detected at >99% sensitivity and >99% precision above a minimum ~60-fold coverage depth, thereby ensuring suitability for SARS-CoV-2 genome analysis. ONT sequencing also identified a surprising diversity of structural variation within SARS-CoV-2 specimens that were supported by evidence from short-read sequencing on matched samples. However, ONT sequencing failed to accurately detect short indels and variants at low read-count frequencies. This systematic evaluation of analytical performance for SARS-CoV-2 WGS will facilitate widespread adoption of ONT sequencing within local, national and international COVID-19 public health initiatives.Entities:
Year: 2020 PMID: 33298935 PMCID: PMC7726558 DOI: 10.1038/s41467-020-20075-6
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Sequencing accuracy for Illumina and ONT whole-genome sequencing of synthetic SARS-CoV-2 controls.
| Read-level error rate (errors per base per read) | Erroneous variants | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Illumina samples | Reportable (bp) | Total | Mismatch | Deletion | Insertion | Total | SNVs | Indels | Consensus accuracy (%) |
| A | 28,687 | 0.00152 | 0.00083 | 0.00058 | 0.00011 | 0 | 0 | 0 | 100 |
| B | 28,687 | 0.00153 | 0.00082 | 0.00060 | 0.00012 | 0 | 0 | 0 | 100 |
| C | 28,687 | 0.00148 | 0.00079 | 0.00057 | 0.00012 | 0 | 0 | 0 | 100 |
| D | 28,687 | 0.00172 | 0.00098 | 0.00063 | 0.00011 | 0 | 0 | 0 | 100 |
| E | 28,687 | 0.00124 | 0.00089 | 0.00024 | 0.00011 | 0 | 0 | 0 | 100 |
| F | 28,687 | 0.00170 | 0.00137 | 0.00023 | 0.00011 | 0 | 0 | 0 | 100 |
| G | 28,687 | 0.00122 | 0.00088 | 0.00022 | 0.00011 | 0 | 0 | 0 | 100 |
| H | 28,687 | 0.00118 | 0.00084 | 0.00024 | 0.00011 | 0 | 0 | 0 | 100 |
| Mean | 28,687 | 0.00145 | 0.00092 | 0.00041 | 0.00011 | 0 | 0 | 0 | 100 |
Consensus-level accuracy of ONT whole-genome SARS-CoV-2 sequencing on patient specimens.
| Medaka | Medaka minus blacklista | Nanopolish | Nanopolish minus blacklista | |
|---|---|---|---|---|
| Cases analysed | 157 | 157 | 157 | 157 |
| Genome coverage (%) | 99.59 | 98.56 | 99.59 | 98.56 |
| Negative positions | 4,674,554 | 4,627,768 | 4,674,554 | 4,627,768 |
| Illumina variants | 1201 | 1162 | 1201 | 1162 |
| ONT variants | 1190 | 1159 | 1196 | 1164 |
| TPs | 1181 | 1155 | 1191 | 1160 |
| FNs | 20 | 7 | 10 | 2 |
| FPs | 9 | 4 | 5 | 4 |
| Sensitivity (%) | 98.33 | 99.40 | 99.17 | 99.83 |
| Precision (%) | 99.24 | 99.65 | 99.58 | 99.66 |
| Jaccard similarity (%) | 97.60 | 99.06 | 98.76 | 99.49 |
| Perfect concordance | 140/157 cases | 149/157 cases | 147/157 cases | 152/157 cases |
| Illumina SNVs | 1194 | 1162 | 1194 | 1162 |
| ONT SNVs | 1180 | 1155 | 1190 | 1160 |
| TPs | 1180 | 1155 | 1190 | 1160 |
| FNs | 14 | 7 | 4 | 2 |
| FPs | 0 | 0 | 0 | 0 |
| Sensitivity (%) | 98.83 | 99.40 | 99.66 | 99.83 |
| Precision (%) | 100 | 100 | 100 | 100 |
| Jaccard similarity (%) | 98.83 | 99.40 | 99.66 | 99.83 |
| Perfect concordance | 145/157 cases | 152/157 cases | 153/157 cases | 155/157 cases |
aBlacklist sites are error-prone low-complexity sequences (n = 15; 9–42 bp; see text for details).
Fig. 1Variant detection performance for whole-genome ONT sequencing of SARS-CoV-2.
(a; upper) Sensitivity with which Illumina comparison SNVs at consensus-level variant frequencies (80–100%) were detected via ONT sequencing on matched SARS-CoV-2 specimens (n = 157). Bars show mean ± range. (a; lower) Fraction of specimens tested in which SNVs were detected with perfect sensitivity (sn). Data are plotted separately for genome-wide variant detection (gold) and variant detection with error-prone ‘blacklist’ sites excluded (red). b Same as in a but Jaccard similarity (jac) scores for all variant candidates are plotted instead of SNV sn. c Correlation of variant frequencies observed for SNV candidates detected at sub-consensus frequencies (20–80%) with Illumina and ONT sequencing. Candidates detected with ONT but not Illumina were considered to be false-positives (FP; red) and candidates detected with Illumina but not ONT were considered to be false-negatives (FP; pink). d Sensitivity (blue) and precision (green) of SNV detection with ONT sequencing at sub-consensus variant frequencies (20–80%). Data are plotted separately for high (60–80%), intermediate (40–60%) and low (20–40%) frequencies. Error bars show 95% confidence intervals (Clopper-Pearson) calculated over all specimens (n = 157). Source data are provided as Source Data file.
Detection of structural variation in SARS-CoV-2 specimens with ONT sequencing.
| Specimen | SV type | Size | Position | Gene | Supporting ONT reads | Short-read evidence | Breakpoint resolution |
|---|---|---|---|---|---|---|---|
| nCoV_077 | Deletion | 15 | 18019-18034 | 94 | Yes | 0, 0 | |
| nCoV_087 | Deletion | 1132 | 1082-2214 | 48 | No | – | |
| nCoV_088 | Deletion | 34 | 26786-26820 | 75 | Yes | 0,0 | |
| nCoV_106 | Deletion | 548 | 6004-6552 | 20 | No | – | |
| nCoV_125 | Deletion | 27 | 27263-27290 | 20 | Yes | −2, −3 | |
| nCoV_183 | Deletion | 15 | 25533-25548 | 41 | Yes | −2, −2 | |
| nCoV_214 | Deletion | 29 | 23554-23583 | 28 | Yes | +1, +2 | |
| nCoV_200 | Deletion | 328 | 27906-28234 | 385 | Yes | 0, 0 | |
| nCoV_209 | Deletion | 639 | 2771-3410 | 48 | Yes | 0, 0 | |
| nCoV_211 | Deletion | 1840 | 509-2349 | 22 | No | – | |
| nCoV_225 | Deletion | 328 | 27906-28234 | 387 | Yes | 0, 0 | |
| nCoV_235 | Deletion | 37 | 26783-26820 | 21 | Yes | +3, +4 | |
| nCoV_249 | Deletion | 702 | 2664-3366 | 52 | Yes | −1, 0 | |
| nCoV_164 | Deletion | 588 | 22690-23278 | 59 | Yes | +1, +4 | |
| nCoV_083 | Deletion | 28 | 23554-23582 | 38 | Yes | 0, 0 | |
| nCoV_083 | Deletion | 13 | 29478-29491 | 36 | Yes | +1, +1 |