Literature DB >> 17237101

Tandem repeats over the edit distance.

Dina Sokol1, Gary Benson, Justin Tojeira.   

Abstract

MOTIVATION: A tandem repeat in DNA is a sequence of two or more contiguous, approximate copies of a pattern of nucleotides. Tandem repeats occur in the genomes of both eukaryotic and prokaryotic organisms. They are important in numerous fields including disease diagnosis, mapping studies, human identity testing (DNA fingerprinting), sequence homology and population studies. Although tandem repeats have been used by biologists for many years, there are few tools available for performing an exhaustive search for all tandem repeats in a given sequence.
RESULTS: In this paper we describe an efficient algorithm for finding all tandem repeats within a sequence, under the edit distance measure. The contributions of this paper are two-fold: theoretical and practical. We present a precise definition for tandem repeats over the edit distance and an efficient, deterministic algorithm for finding these repeats. AVAILABILITY: The algorithm has been implemented in C++, and the software is available upon request and can be used at http://www.sci.brooklyn.cuny.edu/~sokol/trepeats. The use of this tool will assist biologists in discovering new ways that tandem repeats affect both the structure and function of DNA and protein molecules.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17237101     DOI: 10.1093/bioinformatics/btl309

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  16 in total

1.  A new way to visualize DNA's base succession: the Caenorhabditis elegans chromosome landscapes.

Authors:  Afef Elloumi Oueslati; Imen Messaoudi; Zied Lachiri; Noureddine Ellouze
Journal:  Med Biol Eng Comput       Date:  2015-05-24       Impact factor: 2.602

2.  Searching microsatellites in DNA sequences: approaches used and tools developed.

Authors:  Atul Grover; Veenu Aishwarya; P C Sharma
Journal:  Physiol Mol Biol Plants       Date:  2011-12-23

3.  A method for discovering common patterns from two RNA secondary structures and its application to structural repeat detection.

Authors:  Lei Hua; Jason T L Wang; Xiang Ji; Ankur Malhotra; Mugdha Khaladkar; Bruce A Shapiro; Kaizhong Zhang
Journal:  J Bioinform Comput Biol       Date:  2012-06-22       Impact factor: 1.122

Review 4.  Development and application of MLVA methods as a tool for inter-laboratory surveillance.

Authors:  C A Nadon; E Trees; L K Ng; E Møller Nielsen; A Reimer; N Maxwell; K A Kubota; P Gerner-Smidt
Journal:  Euro Surveill       Date:  2013-08-29

5.  MACFP: Maximal Approximate Consecutive Frequent Pattern Mining under Edit Distance.

Authors:  Jingbo Shang; Jian Peng; Jiawei Han
Journal:  Proc SIAM Int Conf Data Min       Date:  2016-05

6.  TRStalker: an efficient heuristic for finding fuzzy tandem repeats.

Authors:  Marco Pellegrini; M Elena Renda; Alessio Vecchio
Journal:  Bioinformatics       Date:  2010-06-15       Impact factor: 6.937

7.  TRedD--a database for tandem repeats over the edit distance.

Authors:  Dina Sokol; Firat Atagun
Journal:  Database (Oxford)       Date:  2010-07-06       Impact factor: 3.451

8.  Ab initio detection of fuzzy amino acid tandem repeats in protein sequences.

Authors:  Marco Pellegrini; Maria Elena Renda; Alessio Vecchio
Journal:  BMC Bioinformatics       Date:  2012-03-21       Impact factor: 3.169

9.  Repeat or not repeat?--Statistical validation of tandem repeat prediction in genomic sequences.

Authors:  Elke Schaper; Andrey V Kajava; Alain Hauser; Maria Anisimova
Journal:  Nucleic Acids Res       Date:  2012-08-25       Impact factor: 16.971

10.  MsDetector: toward a standard computational tool for DNA microsatellites detection.

Authors:  Hani Z Girgis; Sergey L Sheetlin
Journal:  Nucleic Acids Res       Date:  2012-10-02       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.