Literature DB >> 15374857

RAP: a new computer program for de novo identification of repeated sequences in whole genomes.

Davide Campagna1, Chiara Romualdi, Nicola Vitulo, Micky Del Favero, Matej Lexa, Nicola Cannata, Giorgio Valle.   

Abstract

MOTIVATION: DNA repeats are a common feature of most genomic sequences. Their de novo identification is still difficult despite being a crucial step in genomic analysis and oligonucleotides design. Several efficient algorithms based on word counting are available, but too short words decrease specificity while long words decrease sensitivity, particularly in degenerated repeats.
RESULTS: The Repeat Analysis Program (RAP) is based on a new word-counting algorithm optimized for high resolution repeat identification using gapped words. Many different overlapping gapped words can be counted at the same genomic position, thus producing a better signal than the single ungapped word. This results in better specificity both in terms of low-frequency detection, being able to identify sequences repeated only once, and highly divergent detection, producing a generally high score in most intron sequences. AVAILABILITY: The program is freely available for non-profit organizations, upon request to the authors. CONTACT: giorgio.valle@unipd.it SUPPLEMENTARY INFORMATION: The program has been tested on the Caenorhabditis elegans genome using word lengths of 12, 14 and 16 bases. The full analysis has been implemented in the UCSC Genome Browser and is accessible at http://genome.cribi.unipd.it.

Entities:  

Mesh:

Substances:

Year:  2004        PMID: 15374857     DOI: 10.1093/bioinformatics/bti039

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  13 in total

1.  A fast, lock-free approach for efficient parallel counting of occurrences of k-mers.

Authors:  Guillaume Marçais; Carl Kingsford
Journal:  Bioinformatics       Date:  2011-01-07       Impact factor: 6.937

2.  Bioinformatics and genomic analysis of transposable elements in eukaryotic genomes.

Authors:  Mateusz Janicki; Rebecca Rooke; Guojun Yang
Journal:  Chromosome Res       Date:  2011-08       Impact factor: 4.620

3.  Identification of repeat structure in large genomes using repeat probability clouds.

Authors:  Wanjun Gu; Todd A Castoe; Dale J Hedges; Mark A Batzer; David D Pollock
Journal:  Anal Biochem       Date:  2008-05-20       Impact factor: 3.365

4.  Novel porcine repetitive elements.

Authors:  Ralph T Wiedmann; Dan J Nonneman; John W Keele
Journal:  BMC Genomics       Date:  2006-12-01       Impact factor: 3.969

5.  LTRtype, an Efficient Tool to Characterize Structurally Complex LTR Retrotransposons and Nested Insertions on Genomes.

Authors:  Fan-Chun Zeng; You-Jie Zhao; Que-Jie Zhang; Li-Zhi Gao
Journal:  Front Plant Sci       Date:  2017-04-04       Impact factor: 5.753

6.  Rapid quantification of sequence repeats to resolve the size, structure and contents of bacterial genomes.

Authors:  David Williams; William L Trimble; Meghan Shilts; Folker Meyer; Howard Ochman
Journal:  BMC Genomics       Date:  2013-08-08       Impact factor: 3.969

7.  A new method to compute K-mer frequencies and its application to annotate large repetitive plant genomes.

Authors:  Stefan Kurtz; Apurva Narechania; Joshua C Stein; Doreen Ware
Journal:  BMC Genomics       Date:  2008-10-31       Impact factor: 3.969

8.  LTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons.

Authors:  David Ellinghaus; Stefan Kurtz; Ute Willhoeft
Journal:  BMC Bioinformatics       Date:  2008-01-14       Impact factor: 3.169

9.  Structural and functional genomics of tomato.

Authors:  Amalia Barone; Maria Luisa Chiusano; Maria Raffaella Ercolano; Giovanni Giuliano; Silvana Grandillo; Luigi Frusciante
Journal:  Int J Plant Genomics       Date:  2008

10.  A benchmark study of k-mer counting methods for high-throughput sequencing.

Authors:  Swati C Manekar; Shailesh R Sathe
Journal:  Gigascience       Date:  2018-12-01       Impact factor: 6.524

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.