| Literature DB >> 29938134 |
Mami Tanaka1, Sayaka Mino1, Yoshitoshi Ogura2, Tetsuya Hayashi2, Tomoo Sawabe1.
Abstract
Whole genome sequence comparisons have become essential for establishing a robust scheme in bacterial taxonomy. To generalize this genome-based taxonomy, fast, reliable, and cost-effective genome sequencing methodologies are required. MinION, the palm-sized sequencer from Oxford Nanopore Technologies, enables rapid sequencing of bacterial genomes using minimal laboratory resources. Here we tested the ability of Nanopore sequences for the genome-based taxonomy of Vibrionaceae and compared Nanopore-only assemblies to complete genomes of five Rumoiensis clade species: Vibrio aphrogenes, V. algivorus, V. casei, V. litoralis, and V. rumoiensis. Comparison of overall genome relatedness indices (OGRI) and multilocus sequence analysis (MLSA) based on Nanopore-only assembly and Illumina or hybrid assemblies revealed that errors in Nanopore-only assembly do not influence average nucleotide identity (ANI), in silico DNA-DNA hybridization (DDH), G+C content, or MLSA tree topology in Vibrionaceae. Our results show that the genome sequences from Nanopore-based approach can be used for rapid species identification based on the OGRI and MLSA.Entities:
Keywords: Genome taxonomy; Nanopore; Vibrionaceae
Year: 2018 PMID: 29938134 PMCID: PMC6011873 DOI: 10.7717/peerj.5018
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Assembly results of the V. aphrogenes genome using reads from different platforms.
Resulting assemblies were evaluated using QUAST v4.5 (Gurevich et al., 2013).
| Assembly | PacBio | PacBio | Nanopore | Nanopore | MP+PE | PE |
|---|---|---|---|---|---|---|
| Assembler | HGAP | HGAP | Unicycler | Canu + | Platanus | Plataus |
| Number of contigs (≥0 bp) | 2 | 2 | 2 | 2 | 40 | 51 |
| Total length (bp) (≥0 bp) | 3,375,422 | 3,375,390 | 3,375,144 | 3,371,144 | 3,371,804 | 3,333,369 |
| Number of contigs (≥1 kb) | 2 | 2 | 2 | 2 | 2 | 23 |
| Total length (bp) (≥1 kb) | 3,375,422 | 3,375,390 | 3,375,144 | 3,371,144 | 3,360,281 | 3,322,746 |
| Genome fraction (%) | 100 | 100 | 99.987 | 100 | 99.512 | 98.433 |
| Number of N per kb | 0 | 0 | 0 | 0 | 0.3458 | 0.0767 |
| Mismatches per kb | 0 | 0.0003 | 0.0033 | 0.0240 | 0.0122 | 0.0069 |
| Indels per kb | 0 | 0.0095 | 0.0124 | 1.3569 | 0.0158 | 0.0160 |
Genome assemblies of five Rumoiensis clade species.
The genomes of five species were reconstructed in two different methods, hybrid assembly and Nanopore-only assembly. Indels and mismatches in Nanopore-only assemblies were determined based on the differences from the hybrid assemblies. MinION read data shown here are those obtained after debarcoding and adaptor trimming with Porechop 0.2.2 (https://github.com/rrwick/Porechop).
| Strain | |||||
|---|---|---|---|---|---|
| Total reads | 172,016 | 42,584 | 332,715 | 163,459 | 192,869 |
| Total bases | 938,094,776 | 467,570,099 | 2,237,368,370 | 772,425,123 | 837,603,769 |
| Average read length (bp) | 5,454 | 10,980 | 6,725 | 4,726 | 4,343 |
| Number of contigs (reference) | 2 | 2 | 5 | 3 | 4 |
| Number of contigs (Nanopore) | 3 | 2 | 3 | 4 | 5 |
| Total length (bp) (reference) | 3,648,612 | 3,375,144 | 4,140,771 | 3,872,238 | 4,207,152 |
| Total length (bp) (Nanopore) | 3,711,100 | 3,371,144 | 4,118,045 | 3,920,009 | 4,326,255 |
| Indels per kb | 0.59 | 1.36 | 0.41 | 0.83 | 0.77 |
| Mismatches per kb | 0.38 | 0.05 | 0.03 | 0.32 | 0.06 |
Figure 1Heatmap representation of ANI values using different assemblies.
ANI values were calculated using Orthologous Average Nucleotide Identity Tool version 1.3 (Lee et al., 2016) and the values represented here are the orthoANI values.
G+C content stability determined from different assemblies.
G+C contents of the hybrid, Nanopore-only, and Illumina-only assemblies were calculated, respectively.
| Strain | Hybrid | Nanopore | Illumina |
|---|---|---|---|
| 40.80 | 40.78 | 40.73 | |
| 42.13 | 42.17 | 42.06 | |
| 40.72 | 40.72 | 40.54 | |
| 42.01 | 41.96 | 41.94 | |
| 42.31 | 42.35 | 42.25 |
Evaluation of protein coding gene sequences for MLSA retrieved from Nanopore-only assembly.
Gene sequences from Nanopore-only assemblies were retrieved and compared with the genes from the hybrid assemblies.
| Error type | MLSA genes | |||||||
|---|---|---|---|---|---|---|---|---|
| Length (bp) | 1,218/435 | 996/636 | 2,439/588 | 1,044/507 | 735/369 | 1,047/486 | 993/378 | 2,628/420 |
| Mismatch | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 2/0 | 2/0 |
| Insertion | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 1/0 | 2/0 |
| Deletion | 0/0 | 0/0 | 0/0 | 2/0 | 1/0 | 0/0 | 1/0 | 1/0 |
| Length (bp) | 1,215/435 | 996/636 | 2,439/588 | 1,044/507 | 735/369 | 1,047/486 | 993/378 | 2,628/420 |
| Mismatch | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
| Insertion | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
| Deletion | 2/2 | 0/0 | 2/2 | 1/0 | 1/0 | 0/0 | 0/0 | 2/0 |
| Length (bp) | 1,218/435 | 996/636 | 2,439/588 | 1,044/507 | 732/369 | 1,041/486 | 993/378 | 2,628/420 |
| Mismatch | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
| Insertion | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
| Deletion | 2/0 | 0/0 | 2/0 | 1/0 | 1/0 | 0/0 | 0/0 | 2/0 |
| Length (bp) | 1,218/435 | 996/636 | 2,439/588 | 1,044/507 | 735/369 | 1,047/486 | 993/378 | 2,628/420 |
| Mismatch | 0/0 | 0/0 | 0/0 | 0/0 | 1/0 | 0/0 | 0/0 | 0/0 |
| Insertion | 0/0 | 0/0 | 1/0 | 0/0 | 1/0 | 0/0 | 0/0 | 0/0 |
| Deletion | 0/0 | 0/0 | 0/0 | 0/0 | 1/0 | 0/0 | 0/0 | 0/0 |
| Length (bp) | 1,215/435 | 996/636 | 2,439/588 | 1,044/507 | 735/369 | 1,047/486 | 993/378 | 2,628/420 |
| Mismatch | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
| Insertion | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 1/0 |
| Deletion | 0/0 | 1/1 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
Notes.
Full length/MLSA region.