Literature DB >> 18089620

A novel genome-scale repeat finder geared towards transposons.

Xuehui Li1, Tamer Kahveci, A Mark Settles.   

Abstract

MOTIVATION: Repeats are ubiquitous in genomes and play important roles in evolution. Transposable elements are a common kind of repeat. Transposon insertions can be nested and make the task of identifying repeats difficult.
RESULTS: We develop a novel iterative algorithm, called Greedier, to find repeats in a target genome given a repeat library. Greedier distinguishes itself from existing methods by taking into account the fragmentation of repeats. Each iteration consists of two passes. In the first pass, it identifies the local similarities between the repeat library and the target genome. Greedier then builds graphs from this comparison output. In each graph, a vertex denotes a similar subsequence pair. Edges denote pairs of subsequences that can be connected to form higher similarities. In the second pass, Greedier traverses these graphs greedily to find matches to individual repeat units in the repeat library. It computes a fitness value for each such match denoting the similarity of that match. Matches with fitness values greater than a cutoff are removed, and the rest of the genome is stitched together. The similarity cutoff is then gradually reduced, and the iteration is repeated until no hits are returned from the comparison. Our experiments on the Arabidopsis and rice genomes show that Greedier identifies approximately twice as many transposon bases as those found by cross_match and WindowMasker. Moreover, Greedier masks far fewer false positive bases than either cross_match or WindowMasker. In addition to masking repeats, Greedier also reports potential nested transposon structures.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 18089620     DOI: 10.1093/bioinformatics/btm613

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  6 in total

1.  Personal genome sequencing: current approaches and challenges.

Authors:  Michael Snyder; Jiang Du; Mark Gerstein
Journal:  Genes Dev       Date:  2010-03-01       Impact factor: 11.361

Review 2.  Diversity and evolution of transposable elements in Arabidopsis.

Authors:  Zoé Joly-Lopez; Thomas E Bureau
Journal:  Chromosome Res       Date:  2014-06       Impact factor: 5.239

3.  REMiner-II: a tool for rapid identification and configuration of repetitive element arrays from large mammalian chromosomes as a single query.

Authors:  Woo-Chan Kim; Kang-Hoon Lee; Kyung-Seop Shin; Ri-Na You; Young-Kwan Lee; Kiho Cho; Dong-Ho Cho
Journal:  Genomics       Date:  2012-06-28       Impact factor: 5.736

4.  Bioinformatics and genomic analysis of transposable elements in eukaryotic genomes.

Authors:  Mateusz Janicki; Rebecca Rooke; Guojun Yang
Journal:  Chromosome Res       Date:  2011-08       Impact factor: 4.620

5.  TE-greedy-nester: structure-based detection of LTR retrotransposons and their nesting.

Authors:  Matej Lexa; Pavel Jedlicka; Ivan Vanat; Michal Cervenansky; Eduard Kejnovsky
Journal:  Bioinformatics       Date:  2020-12-22       Impact factor: 6.937

Review 6.  Burgeoning evidence indicates that microRNAs were initially formed from transposable element sequences.

Authors:  Justin T Roberts; Sara E Cardin; Glen M Borchert
Journal:  Mob Genet Elements       Date:  2014-05-22
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.