Literature DB >> 24479510

VIP Barcoding: composition vector-based software for rapid species identification based on DNA barcoding.

Long Fan1, Jerome H L Hui, Zu Guo Yu, Ka Hou Chu.   

Abstract

Species identification based on short sequences of DNA markers, that is, DNA barcoding, has emerged as an integral part of modern taxonomy. However, software for the analysis of large and multilocus barcoding data sets is scarce. The Basic Local Alignment Search Tool (BLAST) is currently the fastest tool capable of handling large databases (e.g. >5000 sequences), but its accuracy is a concern and has been criticized for its local optimization. However, current more accurate software requires sequence alignment or complex calculations, which are time-consuming when dealing with large data sets during data preprocessing or during the search stage. Therefore, it is imperative to develop a practical program for both accurate and scalable species identification for DNA barcoding. In this context, we present VIP Barcoding: a user-friendly software in graphical user interface for rapid DNA barcoding. It adopts a hybrid, two-stage algorithm. First, an alignment-free composition vector (CV) method is utilized to reduce searching space by screening a reference database. The alignment-based K2P distance nearest-neighbour method is then employed to analyse the smaller data set generated in the first stage. In comparison with other software, we demonstrate that VIP Barcoding has (i) higher accuracy than Blastn and several alignment-free methods and (ii) higher scalability than alignment-based distance methods and character-based methods. These results suggest that this platform is able to deal with both large-scale and multilocus barcoding data with accuracy and can contribute to DNA barcoding for modern taxonomy. VIP Barcoding is free and available at http://msl.sls.cuhk.edu.hk/vipbarcoding/.
© 2014 John Wiley & Sons Ltd.

Entities:  

Keywords:  DNA barcoding; sequence analysis; software

Mesh:

Year:  2014        PMID: 24479510     DOI: 10.1111/1755-0998.12235

Source DB:  PubMed          Journal:  Mol Ecol Resour        ISSN: 1755-098X            Impact factor:   7.090


  4 in total

1.  matK-QR classifier: a patterns based approach for plant species identification.

Authors:  Ravi Prabhakar More; Rupali Chandrashekhar Mane; Hemant J Purohit
Journal:  BioData Min       Date:  2016-12-09       Impact factor: 2.522

2.  Decision Tree Algorithm-Generated Single-Nucleotide Polymorphism Barcodes of rbcL Genes for 38 Brassicaceae Species Tagging.

Authors:  Cheng-Hong Yang; Kuo-Chuan Wu; Li-Yeh Chuang; Hsueh-Wei Chang
Journal:  Evol Bioinform Online       Date:  2018-03-05       Impact factor: 1.625

3.  A Comprehensive Quality Evaluation System for Complex Herbal Medicine Using PacBio Sequencing, PCR-Denaturing Gradient Gel Electrophoresis, and Several Chemical Approaches.

Authors:  Xiasheng Zheng; Peng Zhang; Baosheng Liao; Jing Li; Xingyun Liu; Yuhua Shi; Jinle Cheng; Zhitian Lai; Jiang Xu; Shilin Chen
Journal:  Front Plant Sci       Date:  2017-09-13       Impact factor: 5.753

4.  Diversity of Marine-Derived Fungal Cultures Exposed by DNA Barcodes: The Algorithm Matters.

Authors:  Nikos Andreakis; Lone Høj; Philip Kearns; Michael R Hall; Gavin Ericson; Rose E Cobb; Benjamin R Gordon; Elizabeth Evans-Illidge
Journal:  PLoS One       Date:  2015-08-26       Impact factor: 3.240

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.