| Literature DB >> 23147229 |
Abstract
Numerous efficient methods based on word counts for sequence analysis have been proposed to characterize DNA sequences to help in comparison, retrieval from the databases and reconstructing evolutionary relations. However, most of them seem unrelated to any intrinsic characteristics of DNA. In this paper, we proposed a novel statistical measure for sequence comparison on the basis of k-word counts. This new measure removed the influence of sequences' lengths and uncovered bulk property of DNA sequences. The proposed measure was tested by similarity search and phylogenetic analysis. The experimental assessment demonstrated that our similarity measure was efficient.Mesh:
Substances:
Year: 2012 PMID: 23147229 DOI: 10.1016/j.jtbi.2012.10.035
Source DB: PubMed Journal: J Theor Biol ISSN: 0022-5193 Impact factor: 2.691