Literature DB >> 11238070

An information-based sequence distance and its application to whole mitochondrial genome phylogeny.

M Li1, J H Badger, X Chen, S Kwong, P Kearney, H Zhang.   

Abstract

MOTIVATION: Traditional sequence distances require an alignment and therefore are not directly applicable to the problem of whole genome phylogeny where events such as rearrangements make full length alignments impossible. We present a sequence distance that works on unaligned sequences using the information theoretical concept of Kolmogorov complexity and a program to estimate this distance.
RESULTS: We establish the mathematical foundations of our distance and illustrate its use by constructing a phylogeny of the Eutherian orders using complete unaligned mitochondrial genomes. This phylogeny is consistent with the commonly accepted one for the Eutherians. A second, larger mammalian dataset is also analyzed, yielding a phylogeny generally consistent with the commonly accepted one for the mammals. AVAILABILITY: The program to estimate our sequence distance, is available at http://www.cs.cityu.edu.hk/~cssamk/gencomp/GenCompress1.htm. The distance matrices used to generate our phylogenies are available at http://www.math.uwaterloo.ca/~mli/distance.html.

Entities:  

Mesh:

Substances:

Year:  2001        PMID: 11238070     DOI: 10.1093/bioinformatics/17.2.149

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  58 in total

1.  Non-Euclidean properties of spike train metric spaces.

Authors:  Dmitriy Aronov; Jonathan D Victor
Journal:  Phys Rev E Stat Nonlin Soft Matter Phys       Date:  2004-06-02

2.  Metagenomic Classification Using an Abstraction Augmented Markov Model.

Authors:  Xiujun Sylvia Zhu; Monnie McGee
Journal:  J Comput Biol       Date:  2015-11-30       Impact factor: 1.479

3.  Phylogeny of prokaryotes and chloroplasts revealed by a simple composition approach on all protein sequences from complete genomes without sequence alignment.

Authors:  Z G Yu; L Q Zhou; V V Anh; K H Chu; S C Long; J Q Deng
Journal:  J Mol Evol       Date:  2005-04       Impact factor: 2.395

4.  Alignment-free genome comparison with feature frequency profiles (FFP) and optimal resolutions.

Authors:  Gregory E Sims; Se-Ran Jun; Guohong A Wu; Sung-Hou Kim
Journal:  Proc Natl Acad Sci U S A       Date:  2009-02-02       Impact factor: 11.205

5.  Compressing genomic sequence fragments using SlimGene.

Authors:  Christos Kozanitis; Chris Saunders; Semyon Kruglyak; Vineet Bafna; George Varghese
Journal:  J Comput Biol       Date:  2011-03       Impact factor: 1.479

6.  Large local analysis of the unaligned genome and its application.

Authors:  Lianping Yang; Xiangde Zhang; Tianming Wang; Hegui Zhu
Journal:  J Comput Biol       Date:  2013-01       Impact factor: 1.479

7.  Phylogeny Based on Whole Genome as inferred from Complete Information Set Analysis.

Authors:  W Li; W Fang; L Ling; J Wang; Z Xuan; R Chen
Journal:  J Biol Phys       Date:  2002-09       Impact factor: 1.365

8.  mspecLINE: bridging knowledge of human disease with the proteome.

Authors:  Jeremy Handcock; Eric W Deutsch; John Boyle
Journal:  BMC Med Genomics       Date:  2010-03-10       Impact factor: 3.063

9.  Proper distance metrics for phylogenetic analysis using complete genomes without sequence alignment.

Authors:  Zu-Guo Yu; Xiao-Wen Zhan; Guo-Sheng Han; Roger W Wang; Vo Anh; Ka Hou Chu
Journal:  Int J Mol Sci       Date:  2010-03-18       Impact factor: 5.923

10.  Comparing biological networks via graph compression.

Authors:  Morihiro Hayashida; Tatsuya Akutsu
Journal:  BMC Syst Biol       Date:  2010-09-13
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.