Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Repseek, a tool to retrieve approximate repeats from large DNA sequences.

Literature DB >> 17038345

Repseek, a tool to retrieve approximate repeats from large DNA sequences.

Guillaume Achaz¹, Frédéric Boyer, Eduardo P C Rocha, Alain Viari, Eric Coissac.

Abstract

UNLABELLED: Chromosomes or other long DNA sequences contain many highly similar repeated sub-sequences. While there are efficient methods for detecting strict repeats or detecting already characterized repeats, there is no software available for detecting approximate repeats in large DNA sequences allowing for weighted substitutions and indels in a coherent statistical framework. Here, we present an implementation of a two-steps method (seed detection followed by their extension) that detects those approximate repeats. Our method is computationally efficient enough to handle large sequences and is flexible enough to account for influencing factors, such as sequence-composition biases both at the seed detection and alignment levels. AVAILABILITY: http://wwwabi.snv.jussieu.fr/public/RepSeek/

Mesh：

Substances：
DNA

Year: 2006 PMID： 17038345 DOI： 10.1093/bioinformatics/btl519

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

Keyword Cloud
Cited

30 in total

1. Searching for repeats, as an example of using the generalised Ruzzo-Tompa algorithm to find optimal subsequences with gaps.

Authors: John L Spouge; Leonardo Mariño-Ramírez; Sergey L Sheetlin
Journal: Int J Bioinform Res Appl Date: 2014

2. Predicting phenotypic variation in yeast from individual genome sequences.

Authors: Rob Jelier; Jennifer I Semple; Rosa Garcia-Verdugo; Ben Lehner
Journal: Nat Genet Date: 2011-11-13 Impact factor: 38.330

3. REMiner-II: a tool for rapid identification and configuration of repetitive element arrays from large mammalian chromosomes as a single query.

Authors: Woo-Chan Kim; Kang-Hoon Lee; Kyung-Seop Shin; Ri-Na You; Young-Kwan Lee; Kiho Cho; Dong-Ho Cho
Journal: Genomics Date: 2012-06-28 Impact factor: 5.736

4. Finding and Characterizing Repeats in Plant Genomes.

Authors: Jacques Nicolas; Sébastien Tempel; Anna-Sophie Fiston-Lavier; Emira Cherif
Journal: Methods Mol Biol Date: 2022

5. Localized effect of treated wastewater effluent on the resistome of an urban watershed.

Authors: Christopher N Thornton; Windy D Tanner; James A VanDerslice; William J Brazelton
Journal: Gigascience Date: 2020-11-19 Impact factor: 6.524

6. Understanding the evolution of holoparasitic plants: the complete plastid genome of the holoparasite Cytinus hypocistis (Cytinaceae).

Authors: Cristina Roquet; Éric Coissac; Corinne Cruaud; Martí Boleda; Frédéric Boyer; Adriana Alberti; Ludovic Gielly; Pierre Taberlet; Wilfried Thuiller; Jérémie Van Es; Sébastien Lavergne
Journal: Ann Bot Date: 2016-10-01 Impact factor: 4.357

Repseek, a tool to retrieve approximate repeats from large DNA sequences.

1. Searching for repeats, as an example of using the generalised Ruzzo-Tompa algorithm to find optimal subsequences with gaps.

2. Predicting phenotypic variation in yeast from individual genome sequences.

3. REMiner-II: a tool for rapid identification and configuration of repetitive element arrays from large mammalian chromosomes as a single query.

4. Finding and Characterizing Repeats in Plant Genomes.

5. Localized effect of treated wastewater effluent on the resistome of an urban watershed.

6. Understanding the evolution of holoparasitic plants: the complete plastid genome of the holoparasite Cytinus hypocistis (Cytinaceae).

7. Identification of repeat structure in large genomes using repeat probability clouds.

8. Reductive genome evolution in chemoautotrophic intracellular symbionts of deep-sea Calyptogena clams.

9. The DAWGPAWS pipeline for the annotation of genes and transposable elements in plant genomes.

10. Swelfe: a detector of internal repeats in sequences and structures.