Literature DB >> 23933105

Linear regression model of short k-word: a similarity distance suitable for biological sequences with various lengths.

Xiwu Yang1, Tianming Wang2.   

Abstract

Originating from sequences' length difference, both k-word based methods and graphical representation approaches have uncovered biological information in their distinct ways. However, it is less likely that the mechanisms of information storage vary with sequences' length. A similarity distance suitable for sequences with various lengths will be much near to the mechanisms of information storage. In this paper, new sub-sequences of k-word were extracted from biological sequences under a one-to-one mapping. The new sub-sequences were evaluated by a linear regression model. Moreover, a new distance was defined on the invariants from the linear regression model. With comparison to other alignment-free distances, the results of four experiments demonstrated that our similarity distance was more efficient.
© 2013 Elsevier Ltd. All rights reserved.

Keywords:  Alignment-free; Bootstrap; Phylogenetic tree; Sequence comparison

Mesh:

Substances:

Year:  2013        PMID: 23933105     DOI: 10.1016/j.jtbi.2013.07.028

Source DB:  PubMed          Journal:  J Theor Biol        ISSN: 0022-5193            Impact factor:   2.691


  3 in total

1.  Circular Helix-Like Curve: An Effective Tool of Biological Sequence Analysis and Comparison.

Authors:  Yushuang Li; Wenli Xiao
Journal:  Comput Math Methods Med       Date:  2016-06-14       Impact factor: 2.238

2.  A Statistical Similarity/Dissimilarity Analysis of Protein Sequences Based on a Novel Group Representative Vector.

Authors:  Marwa A Abd Elwahaab; Mervat M Abo-Elkhier; Moheb I Abo El Maaty
Journal:  Biomed Res Int       Date:  2019-05-08       Impact factor: 3.411

3.  One novel representation of DNA sequence based on the global and local position information.

Authors:  Zhiyi Mo; Wen Zhu; Yi Sun; Qilin Xiang; Ming Zheng; Min Chen; Zejun Li
Journal:  Sci Rep       Date:  2018-05-15       Impact factor: 4.379

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.