Literature DB >> 7584377

The Repeat Pattern Toolkit (RPT): analyzing the structure and evolution of the C. elegans genome.

P Agarwal1, D J States.   

Abstract

Over 3.6 million bases of DNA sequence from chromosome III of the C. elegans have been determined. The availability of this extended region of contiguous sequence has allowed us to analyze the nature and prevalence of repetitive sequences in the genome of a eukaryotic organism with a high gene density. We have assembled a Repeat Pattern Toolkit (RPT) to analyze the patterns of repeats occurring in DNA. The tools include identifying significant local alignments (utilizing both two-way and three-way alignments), dividing the set of alignments into connected components (signifying repeat families), computing evolutionary distance between repeat family members, constructing minimum spanning trees from the connected components, and visualizing the evolution of the repeat families. Over 7000 families of repetitive sequences were identified. The size of the families ranged from isolated pairs to over 1600 segments of similar sequence. Approximately 12.3% of the analyzed sequence participates in a repeat element.

Entities:  

Mesh:

Year:  1994        PMID: 7584377

Source DB:  PubMed          Journal:  Proc Int Conf Intell Syst Mol Biol        ISSN: 1553-0833


  8 in total

1.  Automated de novo identification of repeat sequence families in sequenced genomes.

Authors:  Zhirong Bao; Sean R Eddy
Journal:  Genome Res       Date:  2002-08       Impact factor: 9.043

2.  De novo repeat classification and fragment assembly.

Authors:  Pavel A Pevzner; Paul A Pevzner; Haixu Tang; Glenn Tesler
Journal:  Genome Res       Date:  2004-09       Impact factor: 9.043

3.  Identification of transposable elements using multiple alignments of related genomes.

Authors:  Anat Caspi; Lior Pachter
Journal:  Genome Res       Date:  2005-12-14       Impact factor: 9.043

4.  Bioinformatics and genomic analysis of transposable elements in eukaryotic genomes.

Authors:  Mateusz Janicki; Rebecca Rooke; Guojun Yang
Journal:  Chromosome Res       Date:  2011-08       Impact factor: 4.620

5.  HAMSTER: visualizing microarray experiments as a set of minimum spanning trees.

Authors:  Raymond Wan; Larisa Kiseleva; Hajime Harada; Hiroshi Mamitsuka; Paul Horton
Journal:  Source Code Biol Med       Date:  2009-11-20

6.  A clustering method for repeat analysis in DNA sequences.

Authors:  N Volfovsky; B J Haas; S L Salzberg
Journal:  Genome Biol       Date:  2001-08-01       Impact factor: 13.583

Review 7.  Retrotransposons in Plant Genomes: Structure, Identification, and Classification through Bioinformatics and Machine Learning.

Authors:  Simon Orozco-Arias; Gustavo Isaza; Romain Guyot
Journal:  Int J Mol Sci       Date:  2019-08-06       Impact factor: 5.923

8.  A sensitive repeat identification framework based on short and long reads.

Authors:  Xingyu Liao; Min Li; Kang Hu; Fang-Xiang Wu; Xin Gao; Jianxin Wang
Journal:  Nucleic Acids Res       Date:  2021-09-27       Impact factor: 16.971

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.