Literature DB >> 34431763

Recovery of small plasmid sequences via Oxford Nanopore sequencing.

Ryan R Wick1, Louise M Judd1, Kelly L Wyres1, Kathryn E Holt1,2.   

Abstract

Oxford Nanopore Technologies (ONT) sequencing platforms currently offer two approaches to whole-genome native-DNA library preparation: ligation and rapid. In this study, we compared these two approaches for bacterial whole-genome sequencing, with a specific aim of assessing their ability to recover small plasmid sequences. To do so, we sequenced DNA from seven plasmid-rich bacterial isolates in three different ways: ONT ligation, ONT rapid and Illumina. Using the Illumina read depths to approximate true plasmid abundance, we found that small plasmids (<20 kbp) were underrepresented in ONT ligation read sets (by a mean factor of ~4) but were not underrepresented in ONT rapid read sets. This effect correlated with plasmid size, with the smallest plasmids being the most underrepresented in ONT ligation read sets. We also found lower rates of chimaeric reads in the rapid read sets relative to ligation read sets. These results show that when small plasmid recovery is important, ONT rapid library preparations are preferable to ligation-based protocols.

Entities:  

Keywords:  Oxford Nanopore sequencing; Plasmids; long-read sequencing; whole-genome sequencing

Mesh:

Substances:

Year:  2021        PMID: 34431763      PMCID: PMC8549360          DOI: 10.1099/mgen.0.000631

Source DB:  PubMed          Journal:  Microb Genom        ISSN: 2057-5858


Data Summary

Supplementary figures, tables, data and code can be found at: github.com/rrwick/Small-plasmid-Nanopore and bridges.monash.edu/articles/dataset/Small_plasmid_Nanopore_data/13543754. The authors confirm all supporting data, code and protocols have been provided within the article or through supplementary data files. Researchers who use Oxford Nanopore Technologies (ONT) platforms to sequence bacterial genomes can currently choose from two library preparation methods. The first is a ligation-based approach, which uses ligase to attach sequencing adapters to the ends of DNA molecules. The second is a rapid approach, which uses a transposase enzyme to cleave DNA and attach adapters in a single step. There are advantages to each preparation, for example ligation can produce better yields but rapid is a simpler procedure. Our study reveals another advantage of rapid preparations: they are more effective at sequencing small plasmids. We show that sequencing of ligation-based libraries yields fewer reads derived from small plasmids, making such plasmids harder to detect in bacterial genomes. Since small plasmids can contain clinically relevant genes, including antimicrobial resistance (AMR) or virulence determinants, their exclusion could lead to unreliable conclusions that have serious consequences for AMR surveillance and prediction. We therefore recommend that researchers performing ONT-only sequencing of bacterial genomes should consider using rapid preparations whenever small plasmid recovery is important.

Introduction

Plasmids are extra-chromosomal pieces of DNA present in many bacterial genomes [1, 2]. While smaller than the chromosome, they are important genomic components that can confer key phenotypic traits and contribute to gene flow within/between species. Most plasmids are circular, though linear plasmids also exist [3], and for some bacterial species it is common to find multiple plasmids in a single genome [4]. Plasmids come in a broad range of sizes, from <1 kbp to >300 kbp [5, 6]. For simplicity, here we categorise plasmids as ‘small’ or ‘large’ using a threshold of 20 kbp [7]. Both small and large plasmids can carry clinically relevant genes, such as virulence or antimicrobial resistance (AMR) genes [8-10]. Unlike many large plasmids, small plasmids do not encode their own conjugative transfer systems, but many do appear to be readily mobilizable between host cells, and are therefore similarly important for the transfer of genetic material and the spread of AMR [11]. Short-read sequencing (e.g. Illumina platforms) of bacterial genomes can typically only produce fragmented draft assemblies [12]. It is difficult to reconstruct plasmid sequences from short-read assemblies [13], and this impedes the ability to accurately monitor the spread of key genes in bacterial populations [14, 15]. In contrast, long-read sequencing (e.g. Pacific Biosciences and Oxford Nanopore Technologies platforms) allows for complete bacterial genome assembly, with each chromosome or plasmid assembled into a single contig [16, 17]. Oxford Nanopore Technologies (ONT) long-read platforms are especially well suited to bacterial genomics, as they are cost-effective and allow for easy multiplexing of samples, enabling sufficient data generation for simultaneous completion of 12 or more genomes at a cost of <100 USD each [18-20]. There are many genome assembly tools appropriate for use with ONT reads, some of which also use short reads for what is known as hybrid assembly [12, 21]. Other tools operate on ONT reads alone, an approach we will call long-read-only assembly [22-24]. While ONT sequencing offers many advantages in bacterial genomics, during a number of sequencing projects [18, 25–30], we have anecdotally observed that small plasmids are frequently absent from ONT long-read-only assemblies. This can have serious downstream consequences, e.g. if a small plasmid containing an AMR determinant is missing from the assembly, this can result in an incorrect antimicrobial susceptibility prediction (a failure to detect resistance), known as a ‘very major error’ as it can lead to prescribing an ineffective antimicrobial [31]. We hypothesised that this problem was due in part to our use of the ONT ligation-based library preparation (SQK-LSK109 kit). During extraction, DNA is incidentally fragmented, typically resulting in fragments ~10 kbp in size [32] onto which barcode and adapter sequences are attached via blunt-end ligation during library preparation. Large plasmids are likely to be fragmented into one or more linear pieces, but small plasmids may avoid fragmentation and remain circular. As such, the small plasmids will have no blunt ends, will not have any adapter ligated and will thus be unavailable for sequencing (Fig. 1). Upon sequencing of the library, this leads to underrepresentation of small plasmids in the resulting ONT read set, which may in turn cause the assembler to fail to produce contigs for them [24]. Deliberately increasing fragmentation before DNA preparation could mitigate this effect by creating smaller fragments, however this would result in shorter read lengths, which negates the benefits of long-read sequencing and can negatively impact the assembly contiguity [24, 33].
Fig. 1.

Conceptual illustration of Oxford Nanopore ligation and rapid sample preparation methods. When circular DNA is extracted from a bacterial cell (top-left), incidental fragmentation of the DNA occurs. The ligation preparation (bottom-left) comprises blunt-end ligation of barcodes/adapters onto DNA molecules, so circular pieces of DNA will not receive adapters and thus remain unavailable for sequencing. The rapid preparation (right) uses a transposome enzyme to add barcodes/adapters into the middle of DNA molecules, making both linear and circular DNA available for sequencing.

Conceptual illustration of Oxford Nanopore ligation and rapid sample preparation methods. When circular DNA is extracted from a bacterial cell (top-left), incidental fragmentation of the DNA occurs. The ligation preparation (bottom-left) comprises blunt-end ligation of barcodes/adapters onto DNA molecules, so circular pieces of DNA will not receive adapters and thus remain unavailable for sequencing. The rapid preparation (right) uses a transposome enzyme to add barcodes/adapters into the middle of DNA molecules, making both linear and circular DNA available for sequencing. The ONT rapid preparation kits (e.g. the SQK-RBK004 rapid barcoding kit) offer a potential solution. Unlike the ligation approach, the rapid approach uses a transposase enzyme to simultaneously cleave DNA and attach barcode/adapter sequences (Fig. 1). Since rapid preparations do not rely on blunt-end ligation, they should be active on sequences that are circular such as unfragmented small plasmids. Hence, in this study we compared the performance of ONT ligation and rapid library preparations for whole-genome sequencing of bacterial genomes containing small plasmids. Specifically, we aimed to quantify plasmid read depths and determine whether ONT rapid preparations give a more accurate representation of small plasmid abundance than ONT ligation preparations. To assess this, we used short-read Illumina sequencing, whose libraries are unbiased with respect to the starting length of the molecules, as the gold standard for quantifying plasmid abundance.

Methods

Bacterial isolates, DNA extraction and sequencing

We included seven bacterial isolates in this study (Table 1, Fig. S1a, available in the online version of this article), each containing small plasmids (identified from previous analyses) [34-37] and belonging to different bacterial species: , , , an unnamed species (given the placeholder name Haemophilus sp002998595 in GTDB R202) [38, 39], , and . These isolates were cultured overnight at 37 °C in Luria-Bertani broth and DNA was extracted using GenFind V3 according to the manufacturer’s instructions (Beckman Coulter) (Fig. S1b). The same DNA extract was used to sequence each isolate using three different approaches: ONT ligation, ONT rapid and Illumina (Fig. S1c). For ONT ligation, we followed the protocol for the SQK-LSK109 ligation sequencing kit and EXP-NBD104 native barcoding expansion (Oxford Nanopore Technologies). For ONT rapid, we followed the protocol for the SQK-RBK004 rapid barcoding kit (Oxford Nanopore Technologies). All ONT libraries were sequenced on MinION R9.4.1 flow cells. For Illumina, we followed a modified Illumina DNA Prep protocol (catalogue number 20018705), whereby the reaction volumes were quartered to conserve reagents. Illumina libraries were sequenced on the NovaSeq 6000 using SP reagent kits v1.0 (300 cycles, Illumina Inc.), producing 150 bp paired-end reads with a mean insert size of 331 bp. We repeated this process (from culture to sequencing) to generate a set of technical replicates. For the first technical replicate, a refuel (with the EXP-FLP002 flow cell priming kit) was performed at the 18 h point of the ONT runs to boost yield. However, no refuelling step was required for the second replicate. All ONT read sets were basecalled and demultiplexed using Guppy v3.6.1 (Fig. S1d, e). Basecalled reads (FASTQ format) are available in the supplementary data repository, github.com/rrwick/Small-plasmid-Nanopore.
Table 1.

Bacterial isolates used in this study. Each genome contained at least one large (≥20 kbp) and one small (<20 kbp) plasmid. Seven plasmids (indicated with *) contained one or more antimicrobial resistance determinants, and two plasmids (indicated with †) contained one or more virulence determinants.

Isolate species and name

Large plasmids (bp)

Small plasmids (bp)

Acinetobacter baumannii J9

145059*

6078*

Citrobacter koseri MINF_9D

64962*

9294

Enterobacter kobei MSB1_1B

136 482, 108 411

4665, 3715, 2369

Haemophilus M1C132_1

39 398

10 719, 9975*, 7392, 5675

Klebsiella oxytoca MSB1_2C

118 161†, 58 472

4574

Klebsiella variicola INF345

250 980, 243 620*, 31 780

5783, 3514

Serratia marcescens 17-147-1671

184 477*†, 161 385*

17406, 1934

Bacterial isolates used in this study. Each genome contained at least one large (≥20 kbp) and one small (<20 kbp) plasmid. Seven plasmids (indicated with *) contained one or more antimicrobial resistance determinants, and two plasmids (indicated with †) contained one or more virulence determinants. Isolate species and name Large plasmids (bp) Small plasmids (bp) J9 145059* 6078* MINF_9D 64962* 9294 MSB1_1B 136 482, 108 411 4665, 3715, 2369 M1C132_1 39 398 10 719, 9975*, 7392, 5675 MSB1_2C 118 161†, 58 472 4574 INF345 250 980, 243 620*, 31 780 5783, 3514 17-147-1671 184 477*†, 161 385* 17406, 1934

Reference genome assembly

To assign reads to their replicon of origin, we required a reference assembly for each of the seven genomes in the study. We first produced separate assemblies for each genome and technical replicate (14 assemblies in total) using pooled reads from each sequencing run (ONT ligation, ONT rapid and Illumina) (Fig. S1f). We then merged the two replicate assemblies for each genome to produce a single final assembly (Fig. S1g). Illumina read QC and trimming was performed by fastp v0.20.1 [40] using default parameters. ONT read QC was performed by Filtlong v0.2.0 using no external reference, a minimum read length of 1 kbp, a minimum mean quality of 80 and a minimum window quality of 60 (quality values refer to Filtlong percent identity estimates based on Phred scores). We used Trycycler v0.3.3 [41] to produce a consensus long-read assembly for each isolate from 15 input assemblies (five Flye v2.8 [23] assemblies, five miniasm/Minipolish v0.3/v0.1.3 [42] assemblies and five Raven v1.1.10 [43] assemblies), each made using independent sets of randomly subsampled ONT reads of 50× depth. Trycycler consensus assemblies were then polished with Medaka v1.0.3 [44] and Pilon v1.23 [45]. For each genome, the two independent assemblies (one from each technical replicate) were compared using edlib [46]. Wherever differences were found, we used IGV [47] to visually inspect Illumina read alignments (generated with Bowtie2) [48] and ONT read alignments (generated with minimap2) [49] for the region in question to assess whether the difference indicated an assembly error (indicated by a drop in Illumina read depth), and we manually repaired such errors as appropriate. Only one genuine sequence difference was found between the two technical replicates: a 1 bp indel in the smallest plasmid of MSB1_1B. Two plasmids in M1C132_1 only appeared in assemblies for the second technical replicate, as they were missing from all ONT and Illumina read sets in the first replicate, suggesting that they were lost during culturing. A plasmid in 17-147–1671 occurred in two variants (a 17.4 kbp version with one copy of IS4321 and an 18.7 kbp version with two copies of IS4321) [50], both of which occurred in both technical replicates. After curation, we merged the assemblies of the two technical replicates to make a single reference assembly for each of the seven isolates, including the two plasmids that only appeared in the second technical replicate and all versions of plasmids that contained variation. The resultant genomes contained a total of seven chromosomes and 26 plasmids (Table 1, Fig. S2). Using a size threshold of 20 kbp, 14 of the plasmids were classified as ‘small’ and 12 as ‘large’. We screened plasmids for AMR genes using Kleborate v2.0.1 (uses the CARD database) [51] and for virulence genes using the VFDB [52]. Seven of the plasmids (five large and two small) contained one or more antimicrobial resistance genes and two of the plasmids (both large) contained one or more virulence genes (Table S1). The reference assembly sequences (FASTA format) are available in the supplementary data repository, github.com/rrwick/Small-plasmid-Nanopore.

Comparison of ONT library preparation methods

To aid our read-based analyses, we developed custom Python scripts (available in the supplementary data repository, github.com/rrwick/Small-plasmid-Nanopore). We aligned each of the four ONT read sets (ligation run 1, rapid run 1, ligation run 2 and rapid run 2) to each sequence in the merged reference assemblies using minimap2 v2.17 (using the map-ont preset) and the align_reads.py script, which enabled alignment over the start-end junction of circular replicons. These alignments were processed with the assign_reads.py script, which assigned each read to a reference sequence by labelling each position of the read with the reference to which it best aligned (using the following filters: alignment identity ≥75 %, alignment length ≥100 bp, mean read identity ≥80 %, mean read coverage ≥50 %). This script also gave a demultiplexing status to each read: correct (Guppy demultiplexing agreed with reference alignment), incorrect (Guppy demultiplexing disagreed with reference alignment), unclassified (the read was not demultiplexed by Guppy), unaligned (the read did not align to the reference sequences) or chimaera (the read aligned to multiple different reference sequences). The get_depths.py script was used to calculate per-replicon ONT read depths by taking the mean depth across each replicon, excluding repetitive regions common to multiple replicons (identified by cross-replicon minimap2 alignments) as such repeats could make for unreliable alignments. Finally, we aligned each Illumina read set (after QC and trimming with fastp) to its respective reference genome using Bowtie2 v2.3.4.1 and used the get_depths.py script to calculate per-replicon Illumina read depths, again excluding repetitive regions common to multiple replicons. Plasmid read depths were normalised to their corresponding chromosomal read depth (e.g. a plasmid with a depth twice that of the chromosomal depth had a normalised read depth of two), providing a quantification of plasmid abundance that could be compared between read sets. Read depths were calculated separately for the two technical replicates, as plasmid copy number could differ between DNA extractions. This resulted in three normalised read depths for each plasmid in each technical replicate: ONT ligation, ONT rapid and Illumina (Table S1). The on-bead tagmentation process in the Illumina DNA Prep produces libraries with a fragment size of ~300–400 bp, which is considerably smaller than the smallest replicon in our genomes. Therefore, we assumed that unlike long-read ONT sequencing, Illumina sequencing is not significantly biased by replicon size. Since Illumina sequencing is known to have biases regarding GC content [53], we examined this effect in our data by calculating read depth and GC content for each one kbp sequence window in the chromosomes of the seven study genomes (using the depth_and_gc.py script). By plotting the read depth (normalised to the mean depth for 50 % GC windows) vs GC content, we estimated that fluctuations in GC content resulted in <10 % variation of Illumina read depth (Fig. S3). We therefore assume that our normalised Illumina read depth values are a good approximation of the true copy number of each replicon, without adjusting for GC content.

Results and discussion

Sequencing data characteristics

The four ONT sequencing runs (ligation run 1, rapid run 1, ligation run 2 and rapid run 2) yielded a total of 15.2 Gbp, 4.3 Gbp, 8.0 Gbp and 9.6 Gbp data, respectively. Per-barcode ONT yields ranged from 64 Mbp to 2.8 Gbp, equating to mean read depths of 10× to 1214× (Table S1). Per-barcode ONT N50 read lengths ranged from 1.9 kbp to 25.8 kbp. Per-barcode Illumina yields ranged from 313 Mbp to 1.06 Gbp (40× to 232× depth) (Table S1). Since we used pooled ONT read sets (both ligation and rapid) to perform assemblies, all isolates had sufficient data to produce reliable reference genomes. The variation in ONT read depth across each replicon was low and did not depend on the library preparation: the mean within-replicon read depth coefficient of variation was 0.10 for both ONT rapid and ONT ligation read sets (0.18 for Illumina read sets) (Table S1). The densities of read-start sites across each replicon were largely compatible with Poisson distributions, showing that most read positions are random (Fig. S4). However, ligation read sets contained more regions where read-start densities deviated from Poisson distributions, suggesting that some sequence regions can be more prone to fragmentation than others during the ligation preparation. This was particularly evident for the M1C132_1 and 17-147-1671 genomes. While previous studies have shown that ligation preparations favour read count and rapid preparations favour read length [54], we observed no clear trends (Table S1, Fig. S5). Many factors influence these metrics, including the quality and quantity of extracted DNA, the number of available pores on the flow cell, and the incubation times during library preparation steps. Our results show that for multiplexed bacterial whole-genome sequencing, good yields and read lengths (>100× depth and >15 kbp N50) are possible from either type of library preparation. Library preparation type also did not seem to affect the sequence accuracy of reads. All runs had a maximum read identity of ~98 %, but two of the runs (ligation run 1 and rapid run 2) had a larger proportion of low-identity reads (22 and 23 % of reads at <90 % identity vs 15 and 16 % for ligation run 2 and rapid run 1, respectively) (Fig. S6). These runs suffered from degradation in translocation speed (the rate at which DNA moves through the nanopore) over the course of the run (Fig. S7), which may partly explain their lower accuracy [55]. When this problem occurs, it can be mitigated by refuelling an in-progress run using ONT’s EXP-FLP002 kit, as we did for both ligation run 1 and rapid run 1. The beneficial effect of refuelling on read accuracy was particularly notable for ligation run 1 but was negligible for rapid run 1 (Fig. S8). Demultiplexing accuracy was also inconsistent between preparation types, with both the best (0.39 %) and worst (3.87 %) demultiplexing error rates occurring in rapid runs, compared to error rates of 2.22 and 2.92 % for the ligation runs (Table S1). During ligation preparations, barcode sequences are attached on both ends of the reads, while rapid preparations result in barcode attachment to the start of the reads only. This gives users of ligation preparations the option of running Guppy with the --require_barcodes_both_ends option (not used in this study) to increase demultiplexing accuracy when needed but with a larger proportion of unclassified reads [56].

Chimaeric read rates

The rate of chimaeras (reads originating from two or more discontiguous pieces of DNA) was notably different between the two preparations: 1.41 and 0.88% chimaeric reads for the two ligation runs vs 0.03 and 0.14 % for the two rapid runs (Table S1). There are two broad categories of chimaeric reads: in silico chimaeras and ligated chimaeras [57]. In silico chimaeras occur when two separate pieces of DNA pass through a pore in quick succession such that the sequencing software mistakes them as a single read. Thus in silico chimaeras can potentially happen for either type of preparation. Ligated chimaeras occur when two separate pieces of DNA are physically joined before sequencing. Since only the ligation preparation involves ligase, ligated chimaeras should be comparatively rare in rapid preparations, which may explain our observation that rapid preparations have fewer chimaeric reads overall. Our results suggest that rapid preparations are preferable when chimaeric reads need to be minimised, but we note that this conclusion is derived from small sample sizes (n=2 for each preparation). Additionally, in the context of bacterial whole-genome assembly, we have previously shown that chimaeric read rates of up to ~5 % (i.e. exceeding those of our sequencing runs) do not impact assembly quality [24].

Small plasmid abundance

For each assembled plasmid, we calculated a normalised depth ratio: its normalised ONT read depth (i.e. ONT plasmid depth relative to the chromosome) divided by its normalised Illumina read depth (i.e. Illumina plasmid depth relative to the chromosome). Ratios greater than one indicate the plasmid was overrepresented in ONT reads relative to Illumina reads, and ratios less than one indicate the plasmid was underrepresented. Fig. 2 shows the relationship between normalised depth ratios and plasmid size for ligation and rapid ONT preparations.
Fig. 2.

Plasmid abundance resulting from (a) ligation and (b) rapid ONT library preparation methods. Each point in the plots represents one plasmid, with circles for plasmids in the first technical replicate and triangles for plasmids in the second technical replicate. The read depth ratio is the normalised ONT read depth divided by the normalised Illumina read depth. The dashed lines at ratio=1 indicate perfect agreement of plasmid depths between ONT and Illumina data. Points above the dashed lines indicate plasmids that are overrepresented in ONT reads, while points below the dashed lines indicate plasmids that are underrepresented in ONT reads. For ONT ligation reads (a), small plasmids are systematically underrepresented relative to Illumina reads. For ONT rapid reads (b), plasmid size has no clear effect, and depths for both small (<20 kbp) and large plasmids (≥20 kbp) are in good agreement with Illumina reads.

Plasmid abundance resulting from (a) ligation and (b) rapid ONT library preparation methods. Each point in the plots represents one plasmid, with circles for plasmids in the first technical replicate and triangles for plasmids in the second technical replicate. The read depth ratio is the normalised ONT read depth divided by the normalised Illumina read depth. The dashed lines at ratio=1 indicate perfect agreement of plasmid depths between ONT and Illumina data. Points above the dashed lines indicate plasmids that are overrepresented in ONT reads, while points below the dashed lines indicate plasmids that are underrepresented in ONT reads. For ONT ligation reads (a), small plasmids are systematically underrepresented relative to Illumina reads. For ONT rapid reads (b), plasmid size has no clear effect, and depths for both small (<20 kbp) and large plasmids (≥20 kbp) are in good agreement with Illumina reads. For ONT ligation reads, there was a clear relationship between the normalised depth ratio and plasmid size (P=1.1×10−10, τ=0.64, from a Kendall rank correlation test) (Fig. 2a). ONT rapid read depths showed no such relationship (P=0.98, τ=0.0025) (Fig. 2b). Specifically, ONT ligation reads tended to underrepresent small plasmids (<20 kbp), for which the mean normalised depth ratio was 25 % (i.e. small plasmids produced ~4× fewer ONT ligation reads than one would expect based on Illumina read depths). The most extreme case was for the 2.4 kbp plasmid in the second technical replicate of MSB1_1B, which had a normalised depth ratio of <1 %, i.e. it was underrepresented by a factor of more than 100. Very large plasmids (>100 kbp) did not suffer from underrepresentation in ONT ligation read sets, presumably because they were likely to be linearised via fragmentation. The effect of plasmid size on normalised depth ratio in the ligation reads was stronger for the second technical replicate, i.e. plasmids in the second technical replicate were especially underrepresented in ONT ligation read sets. This is concordant with the fact that the read N50 was shorter in ligation run 1 and longer in ligation run 2 (8.2 kbp vs 20.9 kbp) (Table S1, Fig. S5), as shorter reads imply more DNA fragmentation, thus increasing the chance that a circular plasmid will be linearised and sequenced. These results suggest that both ONT ligation and ONT rapid preparations provide an accurate representation of abundance for very large plasmids (>100 kbp), but only the ONT rapid preparation accurately represents abundance across the full spectrum of plasmid sizes. While these conclusions assume that Illumina reads accurately represent plasmid abundance, this is supported by the fact that Illumina and ONT rapid plasmid read depths are in good agreement. As our main goal was to understand the effect of library preparation on plasmid results, we kept the DNA extraction method constant for all experiments (Beckman Coulter’s GenFind V3). However, it is possible that different DNA extraction methods could affect the capture of small plasmids also, which could be explored in future studies. The bias against small plasmids during ligation-based library preparations could result in the exclusion of small plasmids from genome assemblies, particularly when performing long-read-only assembly (as opposed to hybrid assembly where Illumina reads are also available) and assembling read sets of modest depth. For example, if a genome containing a four-copies-per-cell small plasmid was sequenced with a chromosomal read depth of 25×, the plasmid sequence should be present at ~100× depth. However, if that small plasmid was underrepresented by a factor of 30 (a realistic possibility based on our data, see Fig. 2a), it might only be sequenced at ~3× depth, making it unlikely to appear in an assembly for that genome.

Conclusions

When faced with the choice of ligation or rapid ONT preparations, researchers must weigh their respective advantages. Ligation kits are versatile and can give greater yields. Rapid kits are faster, require fewer additional resources and can provide longer read lengths (if optimised for in DNA preparation) [54]. Our study reveals two more advantages of the rapid kits: lower chimaeric read rates and better recovery of small plasmids. Rapid preparations are therefore more likely to produce long-read-only assemblies which include all replicons, small plasmids included. Click here for additional data file. Click here for additional data file.
  51 in total

1.  Minimap and miniasm: fast mapping and de novo assembly for noisy long sequences.

Authors:  Heng Li
Journal:  Bioinformatics       Date:  2016-03-19       Impact factor: 6.937

2.  Beta-lactam resistance in Haemophilus parasuis Is mediated by plasmid pB1000 bearing blaROB-1.

Authors:  Alvaro San Millan; Jose Antonio Escudero; Ana Catalan; Silvia Nieto; Fidel Farelo; Magdalena Gibert; Miguel Angel Moreno; Lucas Dominguez; Bruno Gonzalez-Zorn
Journal:  Antimicrob Agents Chemother       Date:  2007-04-16       Impact factor: 5.191

3.  Integrative genomics viewer.

Authors:  James T Robinson; Helga Thorvaldsdóttir; Wendy Winckler; Mitchell Guttman; Eric S Lander; Gad Getz; Jill P Mesirov
Journal:  Nat Biotechnol       Date:  2011-01       Impact factor: 54.908

4.  ISfinder: the reference centre for bacterial insertion sequences.

Authors:  P Siguier; J Perochon; L Lestrade; J Mahillon; M Chandler
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

5.  Plasmid flux in Escherichia coli ST131 sublineages, analyzed by plasmid constellation network (PLACNET), a new method for plasmid reconstruction from whole genome sequences.

Authors:  Val F Lanza; María de Toro; M Pilar Garcillán-Barcia; Azucena Mora; Jorge Blanco; Teresa M Coque; Fernando de la Cruz
Journal:  PLoS Genet       Date:  2014-12-18       Impact factor: 5.917

6.  Convergence of virulence and MDR in a single plasmid vector in MDR Klebsiella pneumoniae ST15.

Authors:  Margaret M C Lam; Kelly L Wyres; Ryan R Wick; Louise M Judd; Aasmund Fostervold; Kathryn E Holt; Iren Høyland Löhr
Journal:  J Antimicrob Chemother       Date:  2019-05-01       Impact factor: 5.790

7.  Genomic surveillance for hypervirulence and multi-drug resistance in invasive Klebsiella pneumoniae from South and Southeast Asia.

Authors:  Kelly L Wyres; To N T Nguyen; Margaret M C Lam; Louise M Judd; Nguyen van Vinh Chau; David A B Dance; Margaret Ip; Abhilasha Karkey; Clare L Ling; Thyl Miliya; Paul N Newton; Nguyen Phu Huong Lan; Amphone Sengduangphachanh; Paul Turner; Balaji Veeraraghavan; Phat Voong Vinh; Manivanh Vongsouvath; Nicholas R Thomson; Stephen Baker; Kathryn E Holt
Journal:  Genome Med       Date:  2020-01-16       Impact factor: 11.117

8.  Rapid, multiplexed, whole genome and plasmid sequencing of foodborne pathogens using long-read nanopore technology.

Authors:  Tonya L Taylor; Jeremy D Volkening; Eric DeJesus; Mustafa Simmons; Kiril M Dimitrov; Glenn E Tillman; David L Suarez; Claudio L Afonso
Journal:  Sci Rep       Date:  2019-11-08       Impact factor: 4.379

9.  CARD 2020: antibiotic resistome surveillance with the comprehensive antibiotic resistance database.

Authors:  Brian P Alcock; Amogelang R Raphenya; Tammy T Y Lau; Kara K Tsang; Mégane Bouchard; Arman Edalatmand; William Huynh; Anna-Lisa V Nguyen; Annie A Cheng; Sihan Liu; Sally Y Min; Anatoly Miroshnichenko; Hiu-Ki Tran; Rafik E Werfalli; Jalees A Nasir; Martins Oloni; David J Speicher; Alexandra Florescu; Bhavya Singh; Mateusz Faltyn; Anastasia Hernandez-Koutoucheva; Arjun N Sharma; Emily Bordeleau; Andrew C Pawlowski; Haley L Zubyk; Damion Dooley; Emma Griffiths; Finlay Maguire; Geoff L Winsor; Robert G Beiko; Fiona S L Brinkman; William W L Hsiao; Gary V Domselaar; Andrew G McArthur
Journal:  Nucleic Acids Res       Date:  2020-01-08       Impact factor: 16.971

10.  Oxford Nanopore MinION Sequencing Enables Rapid Whole Genome Assembly of Rickettsia typhi in a Resource-Limited Setting.

Authors:  Ivo Elliott; Elizabeth M Batty; Damien Ming; Matthew T Robinson; Pruksa Nawtaisong; Mariateresa de Cesare; Paul N Newton; Rory Bowden
Journal:  Am J Trop Med Hyg       Date:  2020-02       Impact factor: 2.345

View more
  13 in total

1.  A high-throughput multiplexing and selection strategy to complete bacterial genomes.

Authors:  Sergio Arredondo-Alonso; Anna K Pöntinen; François Cléon; Rebecca A Gladstone; Anita C Schürch; Pål J Johnsen; Ørjan Samuelsen; Jukka Corander
Journal:  Gigascience       Date:  2021-12-09       Impact factor: 6.524

2.  Genome Sequences of 18 Salmonella enterica Serotype Hadar Strains Collected from Patients in the United States.

Authors:  Hattie E Webb; Justin Y Kim; Kaitlin A Tagg; Fernando de la Cruz; Arancha Peñil-Celis; Beth Tolar; Zachary Ellison; Colin Schwensohn; Joshua Brandenburg; Megin Nichols; Jason P Folster
Journal:  Microbiol Resour Announc       Date:  2022-08-29

3.  Prediction of Antibiotic Susceptibility Profiles of Vibrio cholerae Isolates From Whole Genome Illumina and Nanopore Sequencing Data: CholerAegon.

Authors:  Valeria Fuesslin; Sebastian Krautwurst; Akash Srivastava; Doris Winter; Britta Liedigk; Thorsten Thye; Silvia Herrera-León; Shirlee Wohl; Jürgen May; Julius N Fobil; Daniel Eibach; Manja Marz; Kathrin Schuldt
Journal:  Front Microbiol       Date:  2022-06-22       Impact factor: 6.064

Review 4.  Plasmid co-infection: linking biological mechanisms to ecological and evolutionary dynamics.

Authors:  Claudia Igler; Jana S Huisman; Berit Siedentop; Sebastian Bonhoeffer; Sonja Lehtinen
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2021-11-29       Impact factor: 6.237

5.  Polypolish: Short-read polishing of long-read bacterial genome assemblies.

Authors:  Ryan R Wick; Kathryn E Holt
Journal:  PLoS Comput Biol       Date:  2022-01-24       Impact factor: 4.475

6.  Genome diversity of domesticated Acinetobacter baumannii ATCC 19606T strains.

Authors:  Irene Artuso; Massimiliano Lucidi; Daniela Visaggio; Giulia Capecchi; Gabriele Andrea Lugli; Marco Ventura; Paolo Visca
Journal:  Microb Genom       Date:  2022-01

7.  Comparing Long-Read Assemblers to Explore the Potential of a Sustainable Low-Cost, Low-Infrastructure Approach to Sequence Antimicrobial Resistant Bacteria With Oxford Nanopore Sequencing.

Authors:  Ian Boostrom; Edward A R Portal; Owen B Spiller; Timothy R Walsh; Kirsty Sands
Journal:  Front Microbiol       Date:  2022-03-03       Impact factor: 5.640

8.  Novel canine high-quality metagenome-assembled genomes, prophages and host-associated plasmids provided by long-read metagenomics together with Hi-C proximity ligation.

Authors:  Anna Cuscó; Daniel Pérez; Joaquim Viñes; Norma Fàbregas; Olga Francino
Journal:  Microb Genom       Date:  2022-03

9.  Automated antimicrobial susceptibility testing and antimicrobial resistance genotyping using Illumina and Oxford Nanopore Technologies sequencing data among Enterobacteriaceae.

Authors:  Rick Conzemius; Yehudit Bergman; Peter Májek; Stephan Beisken; Shawna Lewis; Emily B Jacobs; Pranita D Tamma; Patricia J Simner
Journal:  Front Microbiol       Date:  2022-08-08       Impact factor: 6.064

10.  Trycycler: consensus long-read assemblies for bacterial genomes.

Authors:  Ryan R Wick; Louise M Judd; Louise T Cerdeira; Jane Hawkey; Guillaume Méric; Ben Vezina; Kelly L Wyres; Kathryn E Holt
Journal:  Genome Biol       Date:  2021-09-14       Impact factor: 13.583

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.