| Literature DB >> 33174830 |
Samuel Lipworth1, Hayleah Pickford1, Nicholas Sanderson2,1, Kevin K Chau1, James Kavanagh1, Leanne Barker1, Alison Vaughan2,1, Jeremy Swann3,1, Monique Andersson4, Katie Jeffery4, Marcus Morgan4, Timothy E A Peto2,1, Derrick W Crook2,1,4, Nicole Stoesser4,1, A Sarah Walker1,2.
Abstract
Hybrid assemblies are highly valuable for studies of Enterobacteriaceae due to their ability to fully resolve the structure of mobile genetic elements, such as plasmids, which are involved in the carriage of clinically important genes (e.g. those involved in antimicrobial resistance/virulence). The widespread application of this technique is currently primarily limited by cost. Recent data have suggested that non-inferior, and even superior, hybrid assemblies can be produced using a fraction of the total output from a multiplexed nanopore [Oxford Nanopore Technologies (ONT)] flowcell run. In this study we sought to determine the optimal minimal running time for flowcells when acquiring reads for hybrid assembly. We then evaluated whether the ONT wash kit might allow users to exploit shorter running times by sequencing multiple libraries per flowcell. After 24 h of sequencing, most chromosomes and plasmids had circularized and there was no benefit associated with longer running times. Quality was similar at 12 h, suggesting that shorter running times are likely to be acceptable for certain applications (e.g. plasmid genomics). The ONT wash kit was highly effective in removing DNA between libraries. Contamination between libraries did not appear to affect subsequent hybrid assemblies, even when the same barcodes were used successively on a single flowcell. Utilizing shorter run times in combination with between-library nuclease washes allows at least 36 Enterobacteriaceae isolates to be sequenced per flowcell, significantly reducing the per-isolate sequencing cost. Ultimately this will facilitate large-scale studies utilizing hybrid assembly, advancing our understanding of the genomics of key human pathogens.Entities:
Keywords: Enterobacteriaceae; Nanopore sequencing; bacterial genomics; hybrid assembly; long-read assembly
Mesh:
Substances:
Year: 2020 PMID: 33174830 PMCID: PMC7725331 DOI: 10.1099/mgen.0.000453
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
Fig. 1.Schematic representation of experiments performed, flowcells used and libraries sequenced. *The same 12 isolates were sequenced in both libraries 3 and 4, but with different barcodes, as shown in the inset table and Table 1.
Comparison of SNPs and indels detected by DNAdiff between hybrid assemblies of the same isolates sequenced in libraries 3 and 4
|
Isolate name |
Barcode library 4 |
Barcode library 3 |
SNPs |
Indels |
% identity |
Species |
MLST |
|---|---|---|---|---|---|---|---|
|
blc-44 |
1 |
10 |
9 |
64 |
99.99 |
|
420 |
|
blc-45 |
2 |
11 |
27 |
1 |
99.99 |
|
490 |
|
blc-46 |
3 |
12 |
36 |
2 |
99.98 |
|
372 |
|
blc-47 |
4 |
1 |
0 |
0 |
100 |
|
490 |
|
blc-48 |
5 |
2 |
0 |
0 |
100 |
|
490 |
|
blc-49 |
6 |
3 |
0 |
0 |
99.99 |
|
45 |
|
blc-50 |
7 |
4 |
0 |
0 |
100 |
|
127 |
|
blc-51 |
8 |
5 |
0 |
0 |
100 |
|
15 |
|
blc-52 |
9 |
6 |
0 |
1 |
99.99 |
|
428 |
|
blc-53 |
10 |
7 |
0 |
0 |
100 |
|
127 |
|
blc-54 |
11 |
8 |
1 |
5 |
99.99 |
|
88 |
|
blc-55 |
12 |
9 |
3 |
1 |
99.99 |
|
490 |
Fig. 2.Number of contigs generated by Unicycler for the nine included barcoded (bc) samples in library 1 over time (one isolate was excluded, see the Methods section). Complete assemblies (where the chromosome and all plasmids are formed of single circularized contigs) are shown in blue.
Fig. 3.Assembly likelihoods were calculated with the assembly likelihood estimator (ALE) by mapping Illumina reads to hybrid assemblies. Likelihoods were calculated for assemblies of each barcoded isolate (y-axis) created at 6 h intervals up to 48 h (x-axis). A likelihood difference of 0 (blue) implies that the assembly at time t is equally as likely as that at t=48 h. A positive likelihood difference (orange) implies that the assembly at time t was better than at 48 h and a negative likelihood difference (green) implies a worse assembly at time t vs 48 h.
Fig. 4.Output over time for flowcell 3 (libraries 3/4/5, left) and flowcell 1 (library 1, right). The ONT wash kit was used between libraries 3/4/5 on flowcell 3, whereas flowcell 1 was run for 48 h with no washing steps.
DNAdiff comparisons between Flye long read-only assemblies and hybrid assemblies
|
Library |
Isolate name |
Barcode |
SNPs |
Indels |
% reference bases aligned |
% identity |
|---|---|---|---|---|---|---|
|
1 |
blc-22 |
bc01 |
3385 |
14 679 |
99.95 |
99.55 |
|
1 |
blc-23 |
bc02 |
251 |
9953 |
100 |
99.76 |
|
1 |
blc-24 |
bc04 |
3202 |
15 949 |
99.98 |
99.58 |
|
1 |
blc-25 |
bc05 |
3387 |
12 262 |
100 |
99.53 |
|
1 |
blc-26 |
bc06 |
3082 |
11 494 |
99.96 |
99.53 |
|
1 |
blc-27 |
bc07 |
3203 |
11 764 |
99.98 |
99.56 |
|
1 |
blc-28 |
bc08 |
157 |
12 793 |
99.97 |
99.61 |
|
1 |
blc-29 |
bc09 |
3309 |
12 036 |
100 |
99.39 |
|
1 |
blc-31 |
bc11 |
4070 |
13 861 |
99.98 |
99.51 |
|
2 |
blc-33 |
bc02 |
3179 |
15 572 |
99.97 |
99.48 |
|
2 |
blc-34 |
bc03 |
4942 |
16 498 |
100 |
99.34 |
|
2 |
blc-35 |
bc04 |
5488 |
16 499 |
99.9 |
99.36 |
|
2 |
blc-36 |
bc05 |
5458 |
16 842 |
100 |
99.35 |
|
2 |
blc-37 |
bc06 |
5442 |
16 499 |
99.98 |
99.36 |
|
2 |
blc-38 |
bc07 |
5460 |
16 559 |
100 |
99.36 |
|
2 |
blc-39 |
bc08 |
4311 |
20 783 |
97.82 |
98.45 |
|
2 |
blc-40 |
bc09 |
5428 |
16 882 |
99.83 |
99.36 |
|
2 |
blc-41 |
bc10 |
3064 |
15 975 |
99.93 |
99.37 |
|
2 |
blc-42 |
bc11 |
2960 |
15 133 |
100 |
99.44 |
|
2 |
blc-43 |
bc12 |
5434 |
16 748 |
100 |
99.35 |
|
3 |
blc-47 |
bc01 |
4482 |
19 204 |
99.98 |
99.31 |
|
3 |
blc-48 |
bc02 |
4352 |
19 444 |
99.99 |
99.3 |
|
3 |
blc-49 |
bc03 |
3983 |
18 206 |
99.98 |
99.3 |
|
3 |
blc-50 |
bc04 |
2615 |
17 537 |
100 |
99.4 |
|
3 |
blc-51 |
bc05 |
3728 |
19 560 |
100 |
99.36 |
|
3 |
blc-52 |
bc06 |
2645 |
16 326 |
100 |
99.4 |
|
3 |
blc-53 |
bc07 |
2645 |
17 567 |
100 |
99.4 |
|
3 |
blc-48 |
bc08 |
2580 |
17 145 |
99.95 |
99.39 |
|
3 |
blc-49 |
bc09 |
4326 |
19 080 |
99.95 |
99.31 |
|
3 |
blc-50 |
bc10 |
2607 |
15 458 |
100 |
99.4 |
|
3 |
blc-51 |
bc11 |
4406 |
19 273 |
99.99 |
99.31 |
|
3 |
blc-52 |
bc12 |
2555 |
17 336 |
100 |
99.4 |
|
5 |
blc-56 |
bc01 |
4746 |
19 881 |
99.89 |
98.96 |
|
5 |
blc-57 |
bc02 |
4174 |
20 993 |
99.46 |
98.56 |
|
5 |
blc-58 |
bc03 |
4876 |
20 019 |
99.92 |
99.01 |
|
5 |
blc-59 |
bc04 |
4649 |
21 014 |
94.41 |
97.93 |
|
5 |
blc-61 |
bc06 |
3115 |
14 943 |
100 |
99.45 |
|
5 |
blc-62 |
bc07 |
4709 |
15 872 |
100 |
99.33 |
|
5 |
blc-63 |
bc08 |
4680 |
15 201 |
99.98 |
99.34 |
|
5 |
blc-64 |
bc09 |
3051 |
15 973 |
100 |
99.39 |
|
5 |
blc-65 |
bc10 |
4212 |
18 101 |
99.36 |
99.22 |
|
5 |
blc-66 |
bc12 |
4986 |
21 189 |
98 |
98.61 |