| Literature DB >> 29308007 |
Rui Dong1, Hui Zheng2, Kun Tian1, Shek-Chung Yau3, Weiguang Mao1, Wenping Yu4, Changchuan Yin2, Chenglong Yu5,6, Rong Lucy He7, Jie Yang2, Stephen St Yau1.
Abstract
We construct a virus database called VirusDB (http://yaulab.math.tsinghua.edu.cn/VirusDB/) and an online inquiry system to serve people who are interested in viral classification and prediction. The database stores all viral genomes, their corresponding natural vectors, and the classification information of the single/multiple-segmented viral reference sequences downloaded from National Center for Biotechnology Information. The online inquiry system serves the purpose of computing natural vectors and their distances based on submitted genomes, providing an online interface for accessing and using the database for viral classification and prediction, and back-end processes for automatic and manual updating of database content to synchronize with GenBank. Submitted genomes data in FASTA format will be carried out and the prediction results with 5 closest neighbors and their classifications will be returned by email. Considering the one-to-one correspondence between sequence and natural vector, time efficiency, and high accuracy, natural vector is a significant advance compared with alignment methods, which makes VirusDB a useful database in further research.Entities:
Keywords: classification; database; genome sequences; natural vector; virus
Year: 2017 PMID: 29308007 PMCID: PMC5751915 DOI: 10.1177/1176934317746667
Source DB: PubMed Journal: Evol Bioinform Online ISSN: 1176-9343 Impact factor: 1.625
Figure 1.The high-level design of the inquiry system.
Figure 2.The basic workflow of VirusDB.
Figure 3.Screenshots of procedure of submission and prediction of a single-segmented virus.
Figure 4.The Filoviridae viruses in our database through Current Collection.