Literature DB >> 26724405

Whole genome/proteome based phylogeny reconstruction for prokaryotes using higher order Markov model and chaos game representation.

Wei-Feng Yang1, Zu-Guo Yu2, Vo Anh3.   

Abstract

UNLABELLED: Traditional methods for sequence comparison and phylogeny reconstruction rely on pair wise and multiple sequence alignments. But alignment could not be directly applied to whole genome/proteome comparison and phylogenomic studies due to their high computational complexity. Hence alignment-free methods became popular in recent years. Here we propose a fast alignment-free method for whole genome/proteome comparison and phylogeny reconstruction using higher order Markov model and chaos game representation. In the present method, we use the transition matrices of higher order Markov models to characterize amino acid or DNA sequences for their comparison. The order of the Markov model is uniquely identified by maximizing the average Shannon entropy of conditional probability distributions. Using one-dimensional chaos game representation and linked list, this method can reduce large memory and time consumption which is due to the large-scale conditional probability distributions. To illustrate the effectiveness of our method, we employ it for fast phylogeny reconstruction based on genome/proteome sequences of two species data sets used in previous published papers. Our results demonstrate that the present method is useful and efficient.
AVAILABILITY AND IMPLEMENTATION: The source codes for our algorithm to get the distance matrix and genome/proteome sequences can be downloaded from ftp://121.199.20.25/. The software Phylip and EvolView we used to construct phylogenetic trees can be referred from their websites.
Copyright © 2015 Elsevier Inc. All rights reserved.

Keywords:  Alignment-free whole proteome comparison; Chaos game representation; Higher order Markov model; Phylogenetic tree; Shannon entropy

Mesh:

Substances:

Year:  2015        PMID: 26724405     DOI: 10.1016/j.ympev.2015.12.011

Source DB:  PubMed          Journal:  Mol Phylogenet Evol        ISSN: 1055-7903            Impact factor:   4.286


  1 in total

1.  Phylogenetic Analysis of HIV-1 Genomes Based on the Position-Weighted K-mers Method.

Authors:  Yuanlin Ma; Zuguo Yu; Runbin Tang; Xianhua Xie; Guosheng Han; Vo V Anh
Journal:  Entropy (Basel)       Date:  2020-02-23       Impact factor: 2.524

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.