Literature DB >> 21334347

Numerical characteristics of word frequencies and their application to dissimilarity measure for sequence comparison.

Qi Dai1, Xiaoqing Liu, Yuhua Yao, Fukun Zhao.   

Abstract

Sequence comparison is one of the major tasks in bioinformatics, which can be used to study structural and functional conservation, as well as evolutionary relations among the sequences. Numerous dissimilarity measures achieve promising results in sequence comparison, but challenges remain. This paper studied numerical characteristics of word frequencies and proposed a novel dissimilarity measure for sequence comparison. Instead of using the word frequencies directly, the proposed measure considers both the word frequencies and overlapping structures of words. To verify the effectiveness of the proposed measure, we tested it with two experiments and further compared it with alignment-based and alignment-free measures. The results demonstrate that the proposed measure extracting more information on the overlapping structures of the words improves the efficiency of sequence comparison. Crown
Copyright © 2011. Published by Elsevier Ltd. All rights reserved.

Mesh:

Substances:

Year:  2011        PMID: 21334347     DOI: 10.1016/j.jtbi.2011.02.005

Source DB:  PubMed          Journal:  J Theor Biol        ISSN: 0022-5193            Impact factor:   2.691


  4 in total

1.  Function-based classification of carbohydrate-active enzymes by recognition of short, conserved peptide motifs.

Authors:  Peter Kamp Busk; Lene Lange
Journal:  Appl Environ Microbiol       Date:  2013-03-22       Impact factor: 4.792

2.  A novel hierarchical clustering algorithm for gene sequences.

Authors:  Dan Wei; Qingshan Jiang; Yanjie Wei; Shengrui Wang
Journal:  BMC Bioinformatics       Date:  2012-07-23       Impact factor: 3.169

3.  An improved alignment-free model for DNA sequence similarity metric.

Authors:  Junpeng Bao; Ruiyu Yuan; Zhe Bao
Journal:  BMC Bioinformatics       Date:  2014-09-28       Impact factor: 3.169

4.  A relative Lempel-Ziv complexity: Application to comparing biological sequences.

Authors:  Liwei Liu; Dongbo Li; Fenglan Bai
Journal:  Chem Phys Lett       Date:  2012-02-01       Impact factor: 2.328

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.