Literature DB >> 25172357

Global comparison of multiple-segmented viruses in 12-dimensional genome space.

Hsin-Hsiung Huang1, Chenglong Yu2, Hui Zheng3, Troy Hernandez4, Shek-Chung Yau5, Rong Lucy He6, Jie Yang3, Stephen S-T Yau7.   

Abstract

We have recently developed a computational approach in a vector space for genome-based virus classification. This approach, called the "Natural Vector (NV) representation", which is an alignment-free method, allows us to classify single-segmented viruses with high speed and accuracy. For multiple-segmented viruses, typically phylogenetic trees of each segment are reconstructed for discovering viral phylogeny. Consensus tree methods may be used to combine the phylogenetic trees based on different segments. However, consensus tree methods were not developed for instances where the viruses have different numbers of segments or where their segments do not match well. We propose a novel approach for comparing multiple-segmented viruses globally, even in cases where viruses contain different numbers of segments. Using our method, each virus is represented by a set of vectors in R(12). The Hausdorff distance is then used to compare different sets of vectors. Phylogenetic trees can be reconstructed based on this distance. The proposed method is used for predicting classification labels of viruses with n-segments (n ⩾ 1). The correctness rates of our predictions based on cross-validation are as high as 96.5%, 95.4%, 99.7%, and 95.6% for Baltimore class, family, subfamily, and genus, respectively, which are comparable to the rates for single-segmented viruses only. Our method is not affected by the number or order of segments. We also demonstrate that the natural graphical representation based on the Hausdorff distance is more reasonable than the consensus tree for a recent public health threat, the influenza A (H7N9) viruses.
Copyright © 2014 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Natural graphical representation; Natural vector; Nucleotide sequence; Phylogeny

Mesh:

Year:  2014        PMID: 25172357     DOI: 10.1016/j.ympev.2014.08.003

Source DB:  PubMed          Journal:  Mol Phylogenet Evol        ISSN: 1055-7903            Impact factor:   4.286


  8 in total

1.  Ebolavirus classification based on natural vectors.

Authors:  Hui Zheng; Changchuan Yin; Tung Hoang; Rong Lucy He; Jie Yang; Stephen S-T Yau
Journal:  DNA Cell Biol       Date:  2015-03-24       Impact factor: 3.311

2.  Whole-genome single nucleotide variant distribution on genomic regions and its relationship to major depression.

Authors:  Chenglong Yu; Bernhard T Baune; Julio Licinio; Ma-Li Wong
Journal:  Psychiatry Res       Date:  2017-02-20       Impact factor: 3.222

3.  Viral Phylogenomics Using an Alignment-Free Method: A Three-Step Approach to Determine Optimal Length of k-mer.

Authors:  Qian Zhang; Se-Ran Jun; Michael Leuze; David Ussery; Intawat Nookaew
Journal:  Sci Rep       Date:  2017-01-19       Impact factor: 4.379

4.  Virus Database and Online Inquiry System Based on Natural Vectors.

Authors:  Rui Dong; Hui Zheng; Kun Tian; Shek-Chung Yau; Weiguang Mao; Wenping Yu; Changchuan Yin; Chenglong Yu; Rong Lucy He; Jie Yang; Stephen St Yau
Journal:  Evol Bioinform Online       Date:  2017-12-17       Impact factor: 1.625

5.  Analysis of the Hosts and Transmission Paths of SARS-CoV-2 in the COVID-19 Outbreak.

Authors:  Rui Dong; Shaojun Pei; Changchuan Yin; Rong Lucy He; Stephen S-T Yau
Journal:  Genes (Basel)       Date:  2020-06-09       Impact factor: 4.096

6.  An open-source k-mer based machine learning tool for fast and accurate subtyping of HIV-1 genomes.

Authors:  Stephen Solis-Reyes; Mariano Avino; Art Poon; Lila Kari
Journal:  PLoS One       Date:  2018-11-14       Impact factor: 3.240

7.  Full Chromosomal Relationships Between Populations and the Origin of Humans.

Authors:  Rui Dong; Shaojun Pei; Mengcen Guan; Shek-Chung Yau; Changchuan Yin; Rong L He; Stephen S-T Yau
Journal:  Front Genet       Date:  2022-02-02       Impact factor: 4.599

8.  Large-Scale Genome Comparison Based on Cumulative Fourier Power and Phase Spectra: Central Moment and Covariance Vector.

Authors:  Shaojun Pei; Rui Dong; Rong Lucy He; Stephen S-T Yau
Journal:  Comput Struct Biotechnol J       Date:  2019-07-11       Impact factor: 7.271

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.