Literature DB >> 15180940

STAR: an algorithm to Search for Tandem Approximate Repeats.

Olivier Delgrange1, Eric Rivals.   

Abstract

MOTIVATION: Tandem repeats consist in approximate and adjacent repetitions of a DNA motif. Such repeats account for large portions of eukaryotic genomes and have also been found in other life kingdoms. Owing to their polymorphism, tandem repeats have proven useful in genome cartography, forensic and population studies, etc. Nevertheless, they are not systematically detected nor annotated in genome projects. Partially because of this lack of data, their evolution is still poorly understood.
RESULTS: In this work, we design an exact algorithm to locate approximate tandem repeats (ATR) of a motif in a DNA sequence. Given a motif and a DNA sequence, our method named STAR, identifies all segments of the sequence that correspond to significant approximate tandem repetitions of the motif. In our model, an Exact Tandem Repeat (ETR) comes from the tandem duplication of the motif and an ATR derives from an ETR by a series of point mutations. An ATR can then be encoded as a number of duplications of the motif together with a list of mutations. Consequently, any sequence that is not an ATR cannot be encoded efficiently by this description, while a true ATR can. Our method uses the minimum description length criterion to identify which sequence segments are ATR. Our optimization procedure guarantees that STAR finds a combination of ATR that minimizes this criterion. AVAILABILITY: for use at http://atgc.lirmm.fr/star

Entities:  

Mesh:

Year:  2004        PMID: 15180940     DOI: 10.1093/bioinformatics/bth335

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  18 in total

Review 1.  Comparative genomics and molecular dynamics of DNA repeats in eukaryotes.

Authors:  Guy-Franck Richard; Alix Kerrest; Bernard Dujon
Journal:  Microbiol Mol Biol Rev       Date:  2008-12       Impact factor: 11.056

2.  HighSSR: high-throughput SSR characterization and locus development from next-gen sequencing data.

Authors:  Alexander Churbanov; Rachael Ryan; Nabeeh Hasan; Donovan Bailey; Haofeng Chen; Brook Milligan; Peter Houde
Journal:  Bioinformatics       Date:  2012-09-06       Impact factor: 6.937

3.  Searching microsatellites in DNA sequences: approaches used and tools developed.

Authors:  Atul Grover; Veenu Aishwarya; P C Sharma
Journal:  Physiol Mol Biol Plants       Date:  2011-12-23

4.  ImtRDB: a database and software for mitochondrial imperfect interspersed repeats annotation.

Authors:  Viktor A Shamanskiy; Valeria N Timonina; Konstantin Yu Popadin; Konstantin V Gunbin
Journal:  BMC Genomics       Date:  2019-05-08       Impact factor: 3.969

5.  Direct mapping of symbolic DNA sequence into frequency domain in global repeat map algorithm.

Authors:  Matko Glunčić; Vladimir Paar
Journal:  Nucleic Acids Res       Date:  2012-09-12       Impact factor: 16.971

6.  A monte carlo method for assessing the quality of duplication-aware alignment algorithms.

Authors:  Valerio Freschi; Alessandro Bogliolo
Journal:  Evol Bioinform Online       Date:  2011-05-10       Impact factor: 1.625

7.  NTRFinder: a software tool to find nested tandem repeats.

Authors:  Atheer A Matroud; M D Hendy; C P Tuffley
Journal:  Nucleic Acids Res       Date:  2011-11-25       Impact factor: 16.971

8.  A lossy compression technique enabling duplication-aware sequence alignment.

Authors:  Valerio Freschi; Alessandro Bogliolo
Journal:  Evol Bioinform Online       Date:  2012-04-02       Impact factor: 1.625

9.  Repeat or not repeat?--Statistical validation of tandem repeat prediction in genomic sequences.

Authors:  Elke Schaper; Andrey V Kajava; Alain Hauser; Maria Anisimova
Journal:  Nucleic Acids Res       Date:  2012-08-25       Impact factor: 16.971

10.  MsDetector: toward a standard computational tool for DNA microsatellites detection.

Authors:  Hani Z Girgis; Sergey L Sheetlin
Journal:  Nucleic Acids Res       Date:  2012-10-02       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.