Literature DB >> 17945779

Using suffix tree to discover complex repetitive patterns in DNA sequences.

Dan He1.   

Abstract

The discovery of repetitive patterns is a fundamental problem in bioinformatics. It remains a challenging open problem because most of the existing methods, such as using annotated repeat database and extracting pairs of maximum repeated regions, can not give a correct definition incorporating both the length and frequency factors of the repetitive patterns. There is an algorithm considering both the pattern length and frequency. However, it could only find the simple "elementary" repeats and is not able to reveal the complex structure of the repetitive patterns. Furthermore, its time complexity O(n2 f), where n is the length of the sequence, f is the minimum frequency requirement, could be still too high for long DNA sequences. In this paper, we propose a novel algorithm using suffix tree to reveal the complex structure of the repetitive patterns in DNA sequences. We show that our algorithm achieves an O(n2/f2) time complexity.

Mesh:

Substances:

Year:  2006        PMID: 17945779     DOI: 10.1109/IEMBS.2006.260445

Source DB:  PubMed          Journal:  Conf Proc IEEE Eng Med Biol Soc        ISSN: 1557-170X


  2 in total

1.  Differential sequential patterns supporting insulin therapy of new-onset type 1 diabetes.

Authors:  Rafał Deja; Wojciech Froelich; Grażyna Deja
Journal:  Biomed Eng Online       Date:  2015-02-21       Impact factor: 2.819

2.  phRAIDER: Pattern-Hunter based Rapid Ab Initio Detection of Elementary Repeats.

Authors:  Carly E Schaeffer; Nathaniel D Figueroa; Xiaolin Liu; John E Karro
Journal:  Bioinformatics       Date:  2016-06-15       Impact factor: 6.937

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.