Literature DB >> 12176934

Automated de novo identification of repeat sequence families in sequenced genomes.

Zhirong Bao1, Sean R Eddy.   

Abstract

Repetitive sequences make up a major part of eukaryotic genomes. We have developed an approach for the de novo identification and classification of repeat sequence families that is based on extensions to the usual approach of single linkage clustering of local pairwise alignments between genomic sequences. Our extensions use multiple alignment information to define the boundaries of individual copies of the repeats and to distinguish homologous but distinct repeat element families. When tested on the human genome, our approach was able to properly identify and group known transposable elements. The program, should be useful for first-pass automatic classification of repeats in newly sequenced genomes.

Entities:  

Mesh:

Year:  2002        PMID: 12176934      PMCID: PMC186642          DOI: 10.1101/gr.88502

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  15 in total

1.  The Pfam protein families database.

Authors:  A Bateman; E Birney; R Durbin; S R Eddy; K L Howe; E L Sonnhammer
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Initial sequencing and analysis of the human genome.

Authors:  E S Lander; L M Linton; B Birren; C Nusbaum; M C Zody; J Baldwin; K Devon; K Dewar; M Doyle; W FitzHugh; R Funke; D Gage; K Harris; A Heaford; J Howland; L Kann; J Lehoczky; R LeVine; P McEwan; K McKernan; J Meldrim; J P Mesirov; C Miranda; W Morris; J Naylor; C Raymond; M Rosetti; R Santos; A Sheridan; C Sougnez; Y Stange-Thomann; N Stojanovic; A Subramanian; D Wyman; J Rogers; J Sulston; R Ainscough; S Beck; D Bentley; J Burton; C Clee; N Carter; A Coulson; R Deadman; P Deloukas; A Dunham; I Dunham; R Durbin; L French; D Grafham; S Gregory; T Hubbard; S Humphray; A Hunt; M Jones; C Lloyd; A McMurray; L Matthews; S Mercer; S Milne; J C Mullikin; A Mungall; R Plumb; M Ross; R Shownkeen; S Sims; R H Waterston; R K Wilson; L W Hillier; J D McPherson; M A Marra; E R Mardis; L A Fulton; A T Chinwalla; K H Pepin; W R Gish; S L Chissoe; M C Wendl; K D Delehaunty; T L Miner; A Delehaunty; J B Kramer; L L Cook; R S Fulton; D L Johnson; P J Minx; S W Clifton; T Hawkins; E Branscomb; P Predki; P Richardson; S Wenning; T Slezak; N Doggett; J F Cheng; A Olsen; S Lucas; C Elkin; E Uberbacher; M Frazier; R A Gibbs; D M Muzny; S E Scherer; J B Bouck; E J Sodergren; K C Worley; C M Rives; J H Gorrell; M L Metzker; S L Naylor; R S Kucherlapati; D L Nelson; G M Weinstock; Y Sakaki; A Fujiyama; M Hattori; T Yada; A Toyoda; T Itoh; C Kawagoe; H Watanabe; Y Totoki; T Taylor; J Weissenbach; R Heilig; W Saurin; F Artiguenave; P Brottier; T Bruls; E Pelletier; C Robert; P Wincker; D R Smith; L Doucette-Stamm; M Rubenfield; K Weinstock; H M Lee; J Dubois; A Rosenthal; M Platzer; G Nyakatura; S Taudien; A Rump; H Yang; J Yu; J Wang; G Huang; J Gu; L Hood; L Rowen; A Madan; S Qin; R W Davis; N A Federspiel; A P Abola; M J Proctor; R M Myers; J Schmutz; M Dickson; J Grimwood; D R Cox; M V Olson; R Kaul; C Raymond; N Shimizu; K Kawasaki; S Minoshima; G A Evans; M Athanasiou; R Schultz; B A Roe; F Chen; H Pan; J Ramser; H Lehrach; R Reinhardt; W R McCombie; M de la Bastide; N Dedhia; H Blöcker; K Hornischer; G Nordsiek; R Agarwala; L Aravind; J A Bailey; A Bateman; S Batzoglou; E Birney; P Bork; D G Brown; C B Burge; L Cerutti; H C Chen; D Church; M Clamp; R R Copley; T Doerks; S R Eddy; E E Eichler; T S Furey; J Galagan; J G Gilbert; C Harmon; Y Hayashizaki; D Haussler; H Hermjakob; K Hokamp; W Jang; L S Johnson; T A Jones; S Kasif; A Kaspryzk; S Kennedy; W J Kent; P Kitts; E V Koonin; I Korf; D Kulp; D Lancet; T M Lowe; A McLysaght; T Mikkelsen; J V Moran; N Mulder; V J Pollara; C P Ponting; G Schuler; J Schultz; G Slater; A F Smit; E Stupka; J Szustakowki; D Thierry-Mieg; J Thierry-Mieg; L Wagner; J Wallis; R Wheeler; A Williams; Y I Wolf; K H Wolfe; S P Yang; R F Yeh; F Collins; M S Guyer; J Peterson; A Felsenfeld; K A Wetterstrand; A Patrinos; M J Morgan; P de Jong; J J Catanese; K Osoegawa; H Shizuya; S Choi; Y J Chen; J Szustakowki
Journal:  Nature       Date:  2001-02-15       Impact factor: 49.962

3.  DNA sequencing. Genome teams adjust to shotgun marriage.

Authors:  E Marshall
Journal:  Science       Date:  2001-06-15       Impact factor: 47.728

4.  Human transaldolase-associated repetitive elements are transcribed by RNA polymerase III.

Authors:  A Perl; E Colombo; E Samoilova; M C Butler; K Banki
Journal:  J Biol Chem       Date:  2000-03-10       Impact factor: 5.157

5.  The significance of responses of the genome to challenge.

Authors:  B McClintock
Journal:  Science       Date:  1984-11-16       Impact factor: 47.728

6.  Free left arms as precursor molecules in the evolution of Alu sequences.

Authors:  J Jurka; E Zuckerkandl
Journal:  J Mol Evol       Date:  1991-07       Impact factor: 2.395

7.  The Repeat Pattern Toolkit (RPT): analyzing the structure and evolution of the C. elegans genome.

Authors:  P Agarwal; D J States
Journal:  Proc Int Conf Intell Syst Mol Biol       Date:  1994

8.  Modular arrangement of proteins as inferred from analysis of homology.

Authors:  E L Sonnhammer; D Kahn
Journal:  Protein Sci       Date:  1994-03       Impact factor: 6.725

9.  Selfish genes, the phenotype paradigm and genome evolution.

Authors:  W F Doolittle; C Sapienza
Journal:  Nature       Date:  1980-04-17       Impact factor: 49.962

10.  Selfish DNA: the ultimate parasite.

Authors:  L E Orgel; F H Crick
Journal:  Nature       Date:  1980-04-17       Impact factor: 49.962

View more
  327 in total

1.  MAX, a novel retrotransposon of the BEL-Pao family, is nested within the Bari1 cluster at the heterochromatic h39 region of chromosome 2 in Drosophila melanogaster.

Authors:  R M Marsano; S Marconi; R Moschetti; P Barsanti; C Caggese; R Caizzi
Journal:  Mol Genet Genomics       Date:  2003-11-21       Impact factor: 3.291

2.  Identification of long intergenic repeat sequences associated with DNA methylation sites in Caulobacter crescentus and other alpha-proteobacteria.

Authors:  Swaine L Chen; Lucy Shapiro
Journal:  J Bacteriol       Date:  2003-08       Impact factor: 3.490

3.  De novo repeat classification and fragment assembly.

Authors:  Pavel A Pevzner; Paul A Pevzner; Haixu Tang; Glenn Tesler
Journal:  Genome Res       Date:  2004-09       Impact factor: 9.043

4.  In search of lost trajectories: Recovering the diversification of transposable elements.

Authors:  Timothée Flutre; Emmanuelle Permal; Hadi Quesneville
Journal:  Mob Genet Elements       Date:  2011-07-01

Review 5.  A beginner's guide to eukaryotic genome annotation.

Authors:  Mark Yandell; Daniel Ence
Journal:  Nat Rev Genet       Date:  2012-04-18       Impact factor: 53.242

6.  Genome-wide identification of cis-regulatory motifs and modules underlying gene coregulation using statistics and phylogeny.

Authors:  Hervé Rouault; Khalil Mazouni; Lydie Couturier; Vincent Hakim; François Schweisguth
Journal:  Proc Natl Acad Sci U S A       Date:  2010-07-29       Impact factor: 11.205

7.  Conservation and divergence of methylation patterning in plants and animals.

Authors:  Suhua Feng; Shawn J Cokus; Xiaoyu Zhang; Pao-Yang Chen; Magnolia Bostick; Mary G Goll; Jonathan Hetzel; Jayati Jain; Steven H Strauss; Marnie E Halpern; Chinweike Ukomadu; Kirsten C Sadler; Sriharsa Pradhan; Matteo Pellegrini; Steven E Jacobsen
Journal:  Proc Natl Acad Sci U S A       Date:  2010-04-15       Impact factor: 11.205

8.  Imogene: identification of motifs and cis-regulatory modules underlying gene co-regulation.

Authors:  Hervé Rouault; Marc Santolini; François Schweisguth; Vincent Hakim
Journal:  Nucleic Acids Res       Date:  2014-03-25       Impact factor: 16.971

9.  Degradation of the Repetitive Genomic Landscape in a Close Relative of Caenorhabditis elegans.

Authors:  Gavin C Woodruff; Anastasia A Teterina
Journal:  Mol Biol Evol       Date:  2020-09-01       Impact factor: 16.240

10.  The nuclear genome of Brachypodium distachyon: analysis of BAC end sequences.

Authors:  Naxin Huo; Gerard R Lazo; John P Vogel; Frank M You; Yaqin Ma; Daniel M Hayden; Devin Coleman-Derr; Theresa A Hill; Jan Dvorak; Olin D Anderson; Ming-Cheng Luo; Yong Q Gu
Journal:  Funct Integr Genomics       Date:  2007-11-06       Impact factor: 3.410

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.