Literature DB >> 16241900

Genomic classification using an information-based similarity index: application to the SARS coronavirus.

Albert C-C Yang1, Ary L Goldberger, C-K Peng.   

Abstract

Measures of genetic distance based on alignment methods are confined to studying sequences that are conserved and identifiable in all organisms under study. A number of alignment-free techniques based on either statistical linguistics or information theory have been developed to overcome the limitations of alignment methods. We present a novel alignment-free approach to measuring the similarity among genetic sequences that incorporates elements from both word rank order-frequency statistics and information theory. We first validate this method on the human influenza A viral genomes as well as on the human mitochondrial DNA database. We then apply the method to study the origin of the SARS coronavirus. We find that the majority of the SARS genome is most closely related to group 1 coronaviruses, with smaller regions of matches to sequences from groups 2 and 3. The information based similarity index provides a new tool to measure the similarity between datasets based on their information content and may have a wide range of applications in the large-scale analysis of genomic databases.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 16241900     DOI: 10.1089/cmb.2005.12.1103

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  15 in total

1.  Whole-proteome phylogeny of large dsDNA virus families by an alignment-free method.

Authors:  Guohong Albert Wu; Se-Ran Jun; Gregory E Sims; Sung-Hou Kim
Journal:  Proc Natl Acad Sci U S A       Date:  2009-06-24       Impact factor: 11.205

2.  Phylogenetic analysis of protein sequences based on distribution of length about common sub-string.

Authors:  Guisong Chang; Tianming Wang
Journal:  Protein J       Date:  2011-03       Impact factor: 2.371

3.  Whole-proteome phylogeny of large dsDNA viruses and parvoviruses through a composition vector method related to dynamical language model.

Authors:  Zu-Guo Yu; Ka Hou Chu; Chi Pang Li; Vo Anh; Li-Qian Zhou; Roger Wei Wang
Journal:  BMC Evol Biol       Date:  2010-06-22       Impact factor: 3.260

4.  Clustering heart rate dynamics is associated with β-adrenergic receptor polymorphisms: analysis by information-based similarity index.

Authors:  Albert C Yang; Shih-Jen Tsai; Chen-Jee Hong; Cynthia Wang; Tai-Jui Chen; Ying-Jay Liou; Chung-Kang Peng
Journal:  PLoS One       Date:  2011-05-04       Impact factor: 3.240

5.  Genomic signatures of strain selection and enhancement in Bacillus atrophaeus var. globigii, a historical biowarfare simulant.

Authors:  Henry S Gibbons; Stacey M Broomall; Lauren A McNew; Hajnalka Daligault; Carol Chapman; David Bruce; Mark Karavis; Michael Krepps; Paul A McGregor; Charles Hong; Kyong H Park; Arya Akmal; Andrew Feldman; Jeffrey S Lin; Wenling E Chang; Brandon W Higgs; Plamen Demirev; John Lindquist; Alvin Liem; Ed Fochler; Timothy D Read; Roxanne Tapia; Shannon Johnson; Kimberly A Bishop-Lilly; Chris Detter; Cliff Han; Shanmuga Sozhamannan; C Nicole Rosenzweig; Evan W Skowronski
Journal:  PLoS One       Date:  2011-03-25       Impact factor: 3.240

6.  Finding and identifying the viral needle in the metagenomic haystack: trends and challenges.

Authors:  Hayssam Soueidan; Louise-Amélie Schmitt; Thierry Candresse; Macha Nikolski
Journal:  Front Microbiol       Date:  2015-01-07       Impact factor: 5.640

7.  Genome analysis with the conditional multinomial distribution profile.

Authors:  Guisong Chang; Tianming Wang
Journal:  J Theor Biol       Date:  2010-12-01       Impact factor: 2.691

8.  Is multiple-sequence alignment required for accurate inference of phylogeny?

Authors:  Michael Höhl; Mark A Ragan
Journal:  Syst Biol       Date:  2007-04       Impact factor: 15.683

Review 9.  ACE2 enhance viral infection or viral infection aggravate the underlying diseases.

Authors:  Shaolei Teng; Qiyi Tang
Journal:  Comput Struct Biotechnol J       Date:  2020-08-06       Impact factor: 7.271

10.  Weighted multifractal cross-correlation analysis based on Shannon entropy.

Authors:  Hui Xiong; Pengjian Shang
Journal:  Commun Nonlinear Sci Numer Simul       Date:  2015-07-03       Impact factor: 4.260

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.