Literature DB >> 15272430

Prokaryote phylogeny without sequence alignment: from avoidance signature to composition distance.

Bailin Hao1, Ji Qi.   

Abstract

This is a review of a new and essentially simple method of inferring phylogenetic relationships from complete genome data without using sequence alignment. The method is based on counting the appearance frequency of oligopeptides of a fixed length (up to K = 6) in the collection of protein sequences of a species. It is a method without fine adjustment and choice of genes. Applied to prokaryotic genomes it has led to results comparable with the bacteriologists' systematics as reflected in the latest 2002 outline of the Bergey's Manual of Systematic Bacteriology. The method has also been used to compare chloroplast genomes and to the phylogeny of Coronaviruses including human SARS-CoV. A key point in our approach is subtraction of a random background from the original counts by using a Markov model of order K-2 in order to highlight the shaping role of natural selection. The implications of the subtraction procedure is specially analyzed and further development of the new approach is indicated. Copyright Imperial College Press

Entities:  

Mesh:

Year:  2004        PMID: 15272430     DOI: 10.1142/s0219720004000442

Source DB:  PubMed          Journal:  J Bioinform Comput Biol        ISSN: 0219-7200            Impact factor:   1.122


  26 in total

Review 1.  Phylogenetic framework and molecular signatures for the main clades of the phylum Actinobacteria.

Authors:  Beile Gao; Radhey S Gupta
Journal:  Microbiol Mol Biol Rev       Date:  2012-03       Impact factor: 11.056

Review 2.  Alignment-free genetic sequence comparisons: a review of recent approaches by word analysis.

Authors:  Oliver Bonham-Carter; Joe Steele; Dhundy Bastola
Journal:  Brief Bioinform       Date:  2013-07-31       Impact factor: 11.622

3.  Cyanobacterial phylogenetic analysis based on phylogenomics approaches render evolutionary diversification and adaptation: an overview of representative orders.

Authors:  Ratna Prabha; Dhananjaya P Singh
Journal:  3 Biotech       Date:  2019-02-15       Impact factor: 2.406

4.  A fungal phylogeny based on 82 complete genomes using the composition vector method.

Authors:  Hao Wang; Zhao Xu; Lei Gao; Bailin Hao
Journal:  BMC Evol Biol       Date:  2009-08-10       Impact factor: 3.260

5.  MS4--Multi-Scale Selector of Sequence Signatures: an alignment-free method for classification of biological sequences.

Authors:  Eduardo Corel; Florian Pitschi; Ivan Laprevotte; Gilles Grasseau; Gilles Didier; Claudine Devauchelle
Journal:  BMC Bioinformatics       Date:  2010-07-30       Impact factor: 3.169

6.  Integrating overlapping structures and background information of words significantly improves biological sequence comparison.

Authors:  Qi Dai; Lihua Li; Xiaoqing Liu; Yuhua Yao; Fukun Zhao; Michael Zhang
Journal:  PLoS One       Date:  2011-11-10       Impact factor: 3.240

7.  Fast algorithms for computing sequence distances by exhaustive substring composition.

Authors:  Alberto Apostolico; Olgert Denas
Journal:  Algorithms Mol Biol       Date:  2008-10-28       Impact factor: 1.405

8.  CVTree update: a newly designed phylogenetic study platform using composition vectors and whole genomes.

Authors:  Zhao Xu; Bailin Hao
Journal:  Nucleic Acids Res       Date:  2009-04-26       Impact factor: 16.971

9.  A dictionary based informational genome analysis.

Authors:  Alberto Castellini; Giuditta Franco; Vincenzo Manca
Journal:  BMC Genomics       Date:  2012-09-17       Impact factor: 3.969

10.  Accurate taxonomy assignments from 16S rRNA sequences produced by highly parallel pyrosequencers.

Authors:  Zongzhi Liu; Todd Z DeSantis; Gary L Andersen; Rob Knight
Journal:  Nucleic Acids Res       Date:  2008-08-22       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.