Literature DB >> 20509856

Loose and strict repeats in weighted sequences of proteins.

Hui Zhang1, Qing Guo, Jing Fan, Costas S Iliopoulos.   

Abstract

A weighted sequence is a string in which a set of characters may appear at each position with respective probabilities of occurrence. Weighted sequences are able to summarize poorly defined short sequences, as well as the profiles of protein families and complete chromosome sequences. Thus it is of biological and theoretical significance to design powerful algorithms on weighted sequences. A common task is to identify repetitive motifs in weighted sequences, with presence probability not less than a given threshold. We define two types of repeats in weighted sequences, called the loose repeats and the strict repeats, respectively, and then attempt to locate these repeats. Using an iterative partitioning technique, we present algorithms for computing all the loose repeats and strict repeats of every length, respectively. Each solution costs O(n(2)) time.

Mesh:

Substances:

Year:  2010        PMID: 20509856     DOI: 10.2174/092986610791760324

Source DB:  PubMed          Journal:  Protein Pept Lett        ISSN: 0929-8665            Impact factor:   1.890


  2 in total

1.  Locating tandem repeats in weighted sequences in proteins.

Authors:  Hui Zhang; Qing Guo; Costas S Iliopoulos
Journal:  BMC Bioinformatics       Date:  2013-05-09       Impact factor: 3.169

2.  Optimal computation of all tandem repeats in a weighted sequence.

Authors:  Carl Barton; Costas S Iliopoulos; Solon P Pissis
Journal:  Algorithms Mol Biol       Date:  2014-08-16       Impact factor: 1.405

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.