Literature DB >> 15145809

Exhaustive whole-genome tandem repeats search.

Arun Krishnan1, Francis Tang.   

Abstract

MOTIVATION: Approximate tandem repeats (ATR) occur frequently in the genomes of organisms, and are a source of polymorphisms observed in individuals, and thus are of interest to those studying genetic disorders. Though extensive work has been done in order to identify ATRs, there are inherent limitations with the current approaches in terms of the number of pattern sizes that can be searched or the size of the input length.
RESULTS: This paper describes (1) a new algorithm which exhaustively finds all variable-length ATRs in a genomic sequence and (2) a precise description of, and an algorithm to significantly reduce, redundancy in the output. Our ATR definition is parameterized by a mismatch ratio p which allows for more mismatches in longer tandem repeats (and fewer in shorter). Furthermore, our algorithm is embarrassingly parallel and thus can attain near-linear speed-up on Beowulf clusters. We present results of our algorithm applied to sequences of widely differing lengths (from genes to chromosomes). AVAILABILITY: Source and binaries are available on request.

Entities:  

Mesh:

Year:  2004        PMID: 15145809     DOI: 10.1093/bioinformatics/bth311

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  5 in total

1.  A new way to visualize DNA's base succession: the Caenorhabditis elegans chromosome landscapes.

Authors:  Afef Elloumi Oueslati; Imen Messaoudi; Zied Lachiri; Noureddine Ellouze
Journal:  Med Biol Eng Comput       Date:  2015-05-24       Impact factor: 2.602

2.  Searching microsatellites in DNA sequences: approaches used and tools developed.

Authors:  Atul Grover; Veenu Aishwarya; P C Sharma
Journal:  Physiol Mol Biol Plants       Date:  2011-12-23

3.  TRStalker: an efficient heuristic for finding fuzzy tandem repeats.

Authors:  Marco Pellegrini; M Elena Renda; Alessio Vecchio
Journal:  Bioinformatics       Date:  2010-06-15       Impact factor: 6.937

4.  Direct mapping of symbolic DNA sequence into frequency domain in global repeat map algorithm.

Authors:  Matko Glunčić; Vladimir Paar
Journal:  Nucleic Acids Res       Date:  2012-09-12       Impact factor: 16.971

5.  Dot2dot: accurate whole-genome tandem repeats discovery.

Authors:  Loredana M Genovese; Marco M Mosca; Marco Pellegrini; Filippo Geraci
Journal:  Bioinformatics       Date:  2019-03-15       Impact factor: 6.937

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.