| Literature DB >> 32987804 |
Jia Jin Marc Chang1, Yin Cheong Aden Ip1, Chin Soon Lionel Ng1,2, Danwei Huang1,2.
Abstract
Since the release of the MinION sequencer in 2014, it has been applied to great effect in the remotest and harshest of environments, and even in space. One of the most common applications of MinION is for nanopore-based DNA barcoding in situ for species identification and discovery, yet the existing sample capability is limited (n ≤ 10). Here, we assembled a portable sequencing setup comprising the BentoLab and MinION and developed a workflow capable of processing 32 samples simultaneously. We demonstrated this enhanced capability out at sea, where we collected samples and barcoded them onboard a dive vessel moored off Sisters' Islands Marine Park, Singapore. In under 9 h, we generated 105 MinION barcodes, of which 19 belonged to fresh metazoans processed immediately after collection. Our setup is thus viable and would greatly fortify existing portable DNA barcoding capabilities. We also tested the performance of the newly released R10.3 nanopore flow cell for DNA barcoding, and showed that the barcodes generated were ~99.9% accurate when compared to Illumina references. A total of 80% of the R10.3 nanopore barcodes also had zero base ambiguities, compared to 50-60% for R9.4.1, suggesting an improved homopolymer resolution and making the use of R10.3 highly recommended.Entities:
Keywords: Oxford Nanopore Technologies (ONT); cytochrome c oxidase subunit I (COI); marine biodiversity; metazoa; next-generation sequencing (NGS); portable sequencing
Year: 2020 PMID: 32987804 PMCID: PMC7598690 DOI: 10.3390/genes11101121
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Schematic representation of our in situ field sequencing workflow (right), from sample to sequence, compared with a typical laboratory-based next-generation sequencing barcoding workflow (left).
Figure 2DNA barcoding performed in situ at Sisters’ Islands Marine Park, Singapore on 15 July 2020. Examples of samples collected via SCUBA: (A) HS0171, Phyllidia ocellata; (B) HS0179, Cenometra bella. (C) Samples were processed onboard and (D) barcoded using the BentoLab and MinION.
Figure 3Representatives of sampled phyla in this study. Scale bars represent 1 cm. Phylum Mollusca: (A) HS0097, Pleurobranchus forskalii; (B) HS0074, Erronea ovum; (C) HS0147, Spondylus sp.; (D) HS0112, Batillaria zonalis; (E) HS0009, Chromodoris lineolata; (F) HS0148, Crosslandia daedali; (G) HS0067, Phyllidiella rudmani. Phylum Echinodermata: (H) HS0133, Salmacis sphaeroides; (I) HS0071, Ophiuroidea sp. Phylum Porifera: (J) HS0050, Pseudoceratina sp. Phylum Arthropoda: (K) HS0038, Lophozozymus pictor; (L) HS0042, Gonodactylellus viridis; (M) HS0096, Majoidea sp.; (N) HS0087, Amphipoda sp.; (O) HS0043, Alpheidae sp. Phylum Platyhelminthes: (P) HS0069, Pseudoceros sp 6; (Q) HS0039, Pseudobiceros bedfordi. Phylum Sipuncula: (R) HS0014, Phascolosoma sp. Phylum Annelida: (S) HS0143, Leocrates sp.; (T) HS0076, Polynoidae sp. Phylum Cnidaria: (U) HS0072, Alcyonacea sp.; (V) HS0145, Discosoma sp. Phylum Chordata: (W) HS0031, Cryptocentrus leptocephalus; (X) HS0134, Platycephalidae sp.; (Y) HS0064, Aeoliscus strigatus.
MinION reads and barcodes obtained among datasets for each flow cell and basecalling model. The number of error-corrected barcodes was the same regardless of the namino setting used. Clean consolidated barcodes refer to remaining number of consolidated barcodes post-contamination check.
| R9.4.1_Fast | R9.4.1_HAC | R10.3_Fast_ST | R10.3_HAC_ST | R10.3_Fast_SR | R10.3_HAC_SR | |
|---|---|---|---|---|---|---|
|
| 1,056,403 | 1,056,403 | 512,000 | 512,000 | 1,060,000 | 1,060,000 |
|
| 115,833 (11.0) | 161,376 (15.3) | 50,203 (9.8) | 109,955 (21.5) | 121,579 (11.5) | 264,501 (25.0) |
|
| 11–36,925 | 11–49,990 | 10–2517 | 11–5086 | 10–6037 | 10–12,221 |
|
| 125/101 | 126/111 | 115/92 | 121/114 | 122/101 | 128/117 |
|
| 101 | 111 | 92 | 114 | 101 | 117 |
|
| 97 | 110 | 90 | 113 | 99 | 115 |
|
| 98 | 110 | 91 | 113 | 100 | 115 |
|
| 86 | 104 | 83 | 111 | 92 | 113 |
|
| 79 | 96 | 75 | 102 | 84 | 103 |
Sequencing accuracy (A) and gaps (G) observed when comparing the overlapping number (N) of MinION barcodes with Illumina references.
| R9.4.1_Fast | R9.4.1_HAC | R10.3_Fast_ST | R10.3_HAC_ST | R10.3_Fast_SR | R10.3_HAC_SR | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
| G | A (%) |
| G | A (%) |
| G | A (%) |
| G | A (%) |
| G | A (%) |
| G | A (%) |
|
| 65 | 198 | 99.9800 | 74 | 99 | 100.0000 | 62 | 202 | 99.9843 | 76 | 31 | 99.9958 | 69 | 233 | 99.9812 | 77 | 33 | 100.0000 |
|
| 65 | 119 | 99.9801 | 74 | 50 | 100.0000 | 62 | 121 | 99.9480 | 76 | 7 | 100.0000 | 69 | 125 | 99.9673 | 77 | 9 | 99.9834 |
|
| 62 | 0 | 99.9428 | 73 | 1 | 99.9516 | 61 | 0 | 99.9315 | 75 | 0 | 99.9914 | 68 | 4 | 99.9149 | 76 | 0 | 99.9916 |
|
| 62 | 0 | 99.9479 | 73 | 1 | 99.9736 | 61 | 0 | 99.9525 | 75 | 0 | 99.9957 | 68 | 2 | 99.9479 | 76 | 0 | 100.0000 |
|
| 62 | 2 | 99.9791 | 73 | 1 | 99.9780 | 61 | 0 | 99.9524 | 75 | 0 | 99.9957 | 68 | 4 | 99.9668 | 76 | 0 | 100.0000 |
|
| 63 | 1 | 99.9387 | 73 | 0 | 99.9780 | 61 | 4 | 99.8735 | 75 | 0 | 99.9914 | 68 | 0 | 99.9339 | 76 | 0 | 99.9620 |
|
| 63 | 1 | 99.9488 | 73 | 0 | 99.9912 | 61 | 4 | 99.9101 | 75 | 0 | 100.0000 | 68 | 0 | 99.9479 | 76 | 0 | 99.9831 |
|
| 63 | 1 | 99.9589 | 73 | 0 | 99.9912 | 61 | 5 | 99.9204 | 75 | 0 | 100.0000 | 68 | 0 | 99.9525 | 76 | 0 | 99.9831 |
|
| 55 | 0 | 99.9532 | 70 | 0 | 99.9679 | 58 | 0 | 99.9003 | 73 | 0 | 99.9912 | 64 | 0 | 99.9599 | 74 | 0 | 99.9913 |
|
| 55 | 0 | 99.9648 | 70 | 0 | 99.9862 | 58 | 0 | 99.9223 | 73 | 0 | 99.9956 | 64 | 0 | 99.9649 | 74 | 0 | 100.0000 |
|
| 55 | 0 | 99.9707 | 70 | 0 | 99.9862 | 58 | 0 | 99.9333 | 73 | 0 | 99.9956 | 64 | 0 | 99.9648 | 74 | 0 | 100.0000 |
Figure 4Percentage of ambiguous bases (%) for the three types of error-corrected MinION barcodes: (A) MAFFT + AA, (B) RACON + AA, and (C) consolidated. Colors represent the type of flow cell used (R9.4.1 or R10.3), along with the basecalling model applied (Fast or HAC). For the R10.3 datasets, we generated a subset for the same sequencing time (ST) as R9.4.1, and another dataset for the same number of reads (SR) as R9.4.1. Note that the y-axis was scaled using pseudo-log2 transformation for better representation.