| Literature DB >> 27457073 |
Minh Duc Cao1, Devika Ganesamoorthy1, Alysha G Elliott1, Huihui Zhang1, Matthew A Cooper1, Lachlan J M Coin2,3.
Abstract
The recently introduced Oxford Nanopore MinION platform generates DNA sequence data in real-time. This has great potential to shorten the sample-to-results time and is likely to have benefits such as rapid diagnosis of bacterial infection and identification of drug resistance. However, there are few tools available for streaming analysis of real-time sequencing data. Here, we present a framework for streaming analysis of MinION real-time sequence data, together with probabilistic streaming algorithms for species typing, strain typing and antibiotic resistance profile identification. Using four culture isolate samples, as well as a mixed-species sample, we demonstrate that bacterial species and strain information can be obtained within 30 min of sequencing and using about 500 reads, initial drug-resistance profiles within two hours, and complete resistance profiles within 10 h. While strain identification with multi-locus sequence typing required more than 15x coverage to generate confident assignments, our novel gene-presence typing could detect the presence of a known strain with 0.5x coverage. We also show that our pipeline can process over 100 times more data than the current throughput of the MinION on a desktop computer.Entities:
Keywords: Antibiotic resistance; Nanopore sequencing; Pathogen identification; Real-time analysis
Mesh:
Substances:
Year: 2016 PMID: 27457073 PMCID: PMC4960868 DOI: 10.1186/s13742-016-0137-2
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Fig. 1Schematic of the real-time analysis pipeline. Once the MinION starts sequencing, DNA fragments are sequenced (on the MinION) and base-called (by Metrichor cloud) instantaneously, and are simultaneously streamed through the pipeline where they are aligned by BWA-MEM [11]. Arrows show the data flow
Details of the four samples
| Sample | Species | Strain | Information | Proportion |
|---|---|---|---|---|
| Single sample 1 |
| ATCC BAA-2146 | NDM-1 positive resistant | 100 % |
| Single sample 2 |
| ATCC 700603 | K6, ESBL | 100 % |
| Single sample 3 |
| ATCC 13883 | Type strain | 100 % |
| Mixture sample |
| ATCC 25922 | Seattle 1946 | 75 % |
| (Library mix) |
| ATCC 25923 | Seattle 1945, Methicillin sensitive | 25 % |
Details of the four MinION sequencing runs
| Sample | Chemistry | Basecall | Time | Read | Base | Median | Quality | Quality 2D |
|---|---|---|---|---|---|---|---|---|
| version | (hrs) | count | count (Mb) | length | mean (std) | mean (std) | ||
| Single sample 1 | R7 | 1.4 | 60 | 38165 | 185 | 4580 | 4.70 (0.91) | 8.96 (0.63) |
| Single sample 2 | R7 | 1.4 | 60 | 7293 | 39 | 4936 | 4.95 (1.2) | 9.34 (0.87) |
| Single sample 3 | R7.3 | 1.9 | 36 | 15911 | 86 | 5242 | 4.58 (1.7) | 9.46 (1.48) |
| Mixture sample | R7.3 | 1.10 | 21 | 5631 | 12 | 825 | 5.44 (2.1) | 10.72(2.41) |
Read quality in Phred score
Fig. 2Sequencing yields over time for the four samples. Yields are shown in terms of read count (left) and base count (right)
Fig. 3Real-time identification of bacterial species from MinION sequencing data for four different bacterial samples: a) K. pneumoniae ATCC BAA-2146, b) K. quasipneumoniae ATCC 700603, c) K. pneumoniae ATCC 13883 and d) Mixture of 75 % E. coli ATCC 25922 and 25 % S. aureus ATCC 25923. The bars represent confidence intervals at the 95 % level
MLST results for three K. pneumoniae strains
| ATCC BAA-2146 | ATCC 700603 | ATCC 13883 | ||||
|---|---|---|---|---|---|---|
| ST-11 | ST-489 | ST-3 | ||||
| Rank | Type | Score | Type | Score | Type | Score |
| 1 |
|
|
|
|
|
|
| 2 |
|
|
|
| ST-136 | 1450.21 |
| 3 |
|
| ST-257 | 413.57 | ST-38 | 1444.81 |
| 4 | ST-1080 | 1984.46 | ST-356 | 413.57 | ST-1106 | 1444.19 |
| 5 | ST-1680 | 1982.62 | ST-414 | 413.57 | ST-931 | 1441.44 |
The top five probable sequence types are shown for each sample. The highest score sequence types are highlighted in bold
Fig. 4Real-time strain identification from MinION sequencing data on three different K. pneumoniae strains (a, b and c) and a E. coli strain (d) and a S. aureus strain (e) from the mixture sample. The bars represent confidence intervals at the 95 % level
Time-line of resistance gene detection from the K. pneumoniae samples
| Time | genes | Class | TP/FP | Sensitivity | Specificity | Data |
|---|---|---|---|---|---|---|
| (mins) | (%) | (%) | (no. of reads) | |||
|
| ||||||
| 30 | 1228 | |||||
| mphA | macrolide | TP | ||||
| blaSHV | beta-lactamase | FP ∗ | ||||
| strA | aminoglycoside | TP | ||||
| blaTEM | beta-lactamase | TP | ||||
| strB | aminoglycoside | TP | ||||
| blaCTX | beta-lactamase | TP | 26.67 | 87.50 | ||
| 60 | 2613 | |||||
| blaLEN | beta-lactamase | TP | ||||
| sul2 | sulphonamide | TP | ||||
| blaOXA | beta-lactamase | TP | ||||
| aac3 | aminoglycoside | TP | ||||
| aac6 | aminoglycoside | TP | ||||
| blaCMY | beta-lactamase | TP | ||||
| blaCFE | beta-lactamase | TP | ||||
| blaLAT | beta-lactamase | TP | ||||
| blaBIL | beta-lactamase | TP | 53.33 | 94.12 | ||
| 90 | 3844 | |||||
| QnrB | quinolone | TP | ||||
| aadA | aminoglycoside | TP | ||||
| oqxA | quinolone | TP | ||||
| tetA | tetracycline | TP | ||||
| oqxB | quinolone | TP | 76.67 | 95.83 | ||
| 120 | 5258 | |||||
| dfrA | trimethoprim | TP | 80.00 | 96.00 | ||
| 240 | 10 788 | |||||
| blaOKP | beta-lactamase | TP | 83.33 | 96.15 | ||
| 270 | 11 931 | |||||
| rmtC | aminoglycoside | TP | 86.67 | 96.43 | ||
| 300 | 13 022 | |||||
| sul1 | sulphonamide | TP | ||||
| sul3 | sulphonamide | TP | 93.33 | 96.55 | ||
| 540 | 20 200 | |||||
| fosA | fosfomycin | TP | 96.67 | 96.67 | ||
| 600 | 21 546 | |||||
| blaNDM | beta-lactamase | TP | 100.00 | 96.77 | ||
|
| ||||||
| 30 | 582 | |||||
| oqxA | quinolone | TP | ||||
| blaSHV | beta-lactamase | TP | ||||
| oqxB | quinolone | TP | 27.27 | 100.00 | ||
| 60 | 1090 | |||||
| aadB | aminoglycoside | TP | 36.36 | 100.00 | ||
| 390 | 3704 | |||||
| sul1 | sulphonamide | TP | ||||
| sul3 | sulphonamide | TP | 54.55 | 100.00 | ||
| 420 | 3810 | |||||
| blaOXA | beta-lactamase | TP | 63.64 | 100.00 | ||
| 540 | 4156 | |||||
| blaOKP | beta-lactamase | TP | 72.73 | 100.00 | ||
|
| ||||||
| 30 | 1264 | |||||
| fosA | fosfomycin | TP | 16.67 | 100.00 | ||
| 60 | 2186 | |||||
| blaSHV | beta-lactamase | TP | ||||
| blaOKP | beta-lactamase | TP | 50.00 | 100.00 | ||
| 90 | 2952 | |||||
| blaLEN | beta-lactamase | TP | 66.67 | 100.00 | ||
| 120 | 3584 | |||||
| oqxA | quinolone | TP | 83.33 | 100.00 | ||
| 570 | 8112 | |||||
| oqxB | quinolone | TP | 100.00 | 100.00 | ||
TP/FP: true positives/false positives according to the resistance gene profiles obtained from MiSeq sequencing. ∗Gene blaSHV was detected from MinION sequencing of K. pneumoniae ATCC BAA-2146 but not from MiSeq sequencing due to the inability to resolve a repeat in the gene
Report of Metrichor What’s In My Pot Bacteria k24 for SQK-MAP005 v1.27 (WIMP) from the first 1000 reads of three single samples and the first 3000 reads of the mixture sample
| Sample | Reported by Metrichor | Sequence type | Level | Accuracy |
|---|---|---|---|---|
| species/strain | ||||
|
|
| - | Species |
|
|
|
| - | Sub-species |
|
|
| ST146 | Strain |
| |
|
| ST11 | Strain |
| |
|
| ST86 | Strain |
| |
|
| - | Species | ×/ | |
|
| - | Strain |
| |
|
|
| ST1084 | Strain |
|
|
| ST86 | Strain |
| |
|
| ST67 | Strain |
| |
|
| ST17 | Strain | ×/ × | |
| Mixture sample |
| ST597 | Strain |
|
| 75 % |
| ST48 | Strain |
|
|
| ST22 | Strain |
| |
| 25 % |
| ST36 | Strain |
|
|
| ST239 | Strain |
| |
|
| - | Species | ×/ |
The last column indicates if the detection is correct () or incorrect (×) at species/strain levels. The Metrichor was able to identify the species (with some false positives) but not the strains in our samples
* K. quasipneumoniae ATCC 700603 strain was recently re-classified from K. pneumoniae as K. quasipneumoniae [49] but has not been updated in most major databases
Fig. 5Real-time species typing (a) and strain typing (b) of a clinical isolate directly from the MinION using our pipeline and the Metrichor service. The time includes basecalling timing
Fig. 6Schematic of a three-state probabilistic Finite State Machine