| Literature DB >> 33276723 |
Chentao Yang1, Yuxuan Zheng2, Shangjin Tan1, Guanliang Meng1, Wei Rao1, Caiqing Yang2, David G Bourne3,4,5, Paul A O'Brien3,4,5, Junqiang Xu1, Sha Liao1, Ao Chen1, Xiaowei Chen1, Xinrui Jia2, Ai-Bing Zhang6, Shanlin Liu7,8.
Abstract
BACKGROUND: Over the last decade, the rapid development of high-throughput sequencing platforms has accelerated species description and assisted morphological classification through DNA barcoding. However, the current high-throughput DNA barcoding methods cannot obtain full-length barcode sequences due to read length limitations (e.g. a maximum read length of 300 bp for the Illumina's MiSeq system), or are hindered by a relatively high cost or low sequencing output (e.g. a maximum number of eight million reads per cell for the PacBio's SEQUEL II system).Entities:
Keywords: Biodiversity; COI; DNA barcode; High-throughput sequencing; MGISEQ-2000; SE400
Mesh:
Substances:
Year: 2020 PMID: 33276723 PMCID: PMC7716423 DOI: 10.1186/s12864-020-07255-w
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Comparison of different strategies to access COI barcode using HTS platforms. The different experimental designs and adopted sequencing strategies fit for sequencing length capacity (a). For four main methods of previous studies, (i) and (ii) refer to (Meier, Wong, Srivathsan, & Foo, 2016), (Shokralla et al., 2015), respectively, while (iii) and (iv) refers to (Liu, Yang, Zhou, & Zhou, 2017). The HIFI-SE pipeline can easily and directly obtain the standard COI barcode by overlapping single-end 400 bp (b)
Fig. 2Results of Sanger sequencing (left semicircle) and HIFI-SE barcode assemblies (right semicircle) arranged in a 96-well plate in Plate #1. Gray represents failure; light and dark green represent success of Sanger and HIFI-SE respectively. Marine invertebrate samples are arranged in wells from A01 to F04 (framed by the red tetragon). Insects are arranged in wells from A05 to H12
Fig. 3Phylogenetic tree of 660 successful barcodes of moth, with outgroup Drosophila melanogaster. The red circle reveals samples containing fungi COI barcode, and blue for Wolbachia COI barcode. We obtained the taxonomic information of each sample according to that of its best hits on the BOLD database and it may suffer misidentification due to inaccurate records on the database. The phylogeny tree revealed that some specimens could be wrongly identified based on an inadequate database in specific linage. For example, the best hit of #035 in Plate #4 (green arrow) with 100% similarity in BOLD database belongs to Crambidae family, however, the second hit with 99.85% similarity belongs to Erebidae family. This type of incorrect placement is prone to occur among early-release records, which suggests a new record of specimen need to be carefully reviewed when add to a database, also indicating that morphological identification is still important
Fig. 4HIFI-SE barcode assembly pipeline. The colored bars from left to right represent tags, primers (purple for 5′ end and orange for 3′ end) and barcode sequences, respectively