Literature DB >> 11875034

Computational detection and location of transcription start sites in mammalian genomic DNA.

Thomas A Down1, Tim J P Hubbard.   

Abstract

Transcription, the process whereby RNA copies are made from sections of the DNA genome, is directed by promoter regions. These define the transcription start site, and also the set of cellular conditions under which the promoter is active. At least in more complex species, it appears to be common for genes to have several different transcription start sites, which may be active under different conditions. Eukaryotic promoters are complex and fairly diffuse structures, which have proven hard to detect in silico. We show that a novel hybrid machine-learning method is able to build useful models of promoters for >50% of human transcription start sites. We estimate specificity to be >70%, and demonstrate good positional accuracy. Based on the structure of our learned models, we conclude that a signal resembling the well known TATA box, together with flanking regions of C-G enrichment, are the most important sequence-based signals marking sites of transcriptional initiation at a large class of typical promoters.

Entities:  

Mesh:

Substances:

Year:  2002        PMID: 11875034      PMCID: PMC155284          DOI: 10.1101/gr.216102

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  13 in total

1.  The eukaryotic promoter database (EPD).

Authors:  R C Périer; V Praz; T Junier; C Bonnard; P Bucher
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  The DNA sequence of human chromosome 22.

Authors:  I Dunham; N Shimizu; B A Roe; S Chissoe; A R Hunt; J E Collins; R Bruskiewich; D M Beare; M Clamp; L J Smink; R Ainscough; J P Almeida; A Babbage; C Bagguley; J Bailey; K Barlow; K N Bates; O Beasley; C P Bird; S Blakey; A M Bridgeman; D Buck; J Burgess; W D Burrill; K P O'Brien
Journal:  Nature       Date:  1999-12-02       Impact factor: 49.962

3.  Highly specific localization of promoter regions in large genomic sequences by PromoterInspector: a novel context analysis approach.

Authors:  M Scherf; A Klingenhoff; T Werner
Journal:  J Mol Biol       Date:  2000-03-31       Impact factor: 5.469

4.  First pass annotation of promoters on human chromosome 22.

Authors:  M Scherf; A Klingenhoff; K Frech; K Quandt; R Schneider; K Grote; M Frisch; V Gailus-Durner; A Seidel; R Brack-Werner; T Werner
Journal:  Genome Res       Date:  2001-03       Impact factor: 9.043

5.  Identification and functional modelling of DNA sequence elements of transcription.

Authors:  T Werner
Journal:  Brief Bioinform       Date:  2000-11       Impact factor: 11.622

6.  SSAHA: a fast search method for large DNA databases.

Authors:  Z Ning; A J Cox; J C Mullikin
Journal:  Genome Res       Date:  2001-10       Impact factor: 9.043

7.  Meta-MEME: motif-based hidden Markov models of protein families.

Authors:  W N Grundy; T L Bailey; C P Elkan; M E Baker
Journal:  Comput Appl Biosci       Date:  1997-08

8.  Weight matrix descriptions of four eukaryotic RNA polymerase II promoter elements derived from 502 unrelated promoter sequences.

Authors:  P Bucher
Journal:  J Mol Biol       Date:  1990-04-20       Impact factor: 5.469

9.  Functional annotation of a full-length mouse cDNA collection.

Authors:  J Kawai; A Shinagawa; K Shibata; M Yoshino; M Itoh; Y Ishii; T Arakawa; A Hara; Y Fukunishi; H Konno; J Adachi; S Fukuda; K Aizawa; M Izawa; K Nishi; H Kiyosawa; S Kondo; I Yamanaka; T Saito; Y Okazaki; T Gojobori; H Bono; T Kasukawa; R Saito; K Kadota; H Matsuda; M Ashburner; S Batalov; T Casavant; W Fleischmann; T Gaasterland; C Gissi; B King; H Kochiwa; P Kuehl; S Lewis; Y Matsuo; I Nikaido; G Pesole; J Quackenbush; L M Schriml; F Staubli; R Suzuki; M Tomita; L Wagner; T Washio; K Sakai; T Okido; M Furuno; H Aono; R Baldarelli; G Barsh; J Blake; D Boffelli; N Bojunga; P Carninci; M F de Bonaldo; M J Brownstein; C Bult; C Fletcher; M Fujita; M Gariboldi; S Gustincich; D Hill; M Hofmann; D A Hume; M Kamiya; N H Lee; P Lyons; L Marchionni; J Mashima; J Mazzarelli; P Mombaerts; P Nordone; B Ring; M Ringwald; I Rodriguez; N Sakamoto; H Sasaki; K Sato; C Schönbach; T Seya; Y Shibata; K F Storch; H Suzuki; K Toyo-oka; K H Wang; C Weitz; C Whittaker; L Wilming; A Wynshaw-Boris; K Yoshida; Y Hasegawa; H Kawaji; S Kohtsuki; Y Hayashizaki
Journal:  Nature       Date:  2001-02-08       Impact factor: 49.962

10.  The hormone-sensitive lipase gene is transcribed from at least five alternative first exons in mouse adipose tissue.

Authors:  N N Laurin; S P Wang; G A Mitchell
Journal:  Mamm Genome       Date:  2000-11       Impact factor: 2.957

View more
  131 in total

Review 1.  In silico identification of metazoan transcriptional regulatory regions.

Authors:  Wyeth W Wasserman; William Krivan
Journal:  Naturwissenschaften       Date:  2003-03-27

2.  Ensembl 2002: accommodating comparative genomics.

Authors:  M Clamp; D Andrews; D Barker; P Bevan; G Cameron; Y Chen; L Clark; T Cox; J Cuff; V Curwen; T Down; R Durbin; E Eyras; J Gilbert; M Hammond; T Hubbard; A Kasprzyk; D Keefe; H Lehvaslaiho; V Iyer; C Melsopp; E Mongin; R Pettett; S Potter; A Rust; E Schmidt; S Searle; G Slater; J Smith; W Spooner; A Stabenau; J Stalker; E Stupka; A Ureta-Vidal; I Vastrik; E Birney
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

3.  PromH: Promoters identification using orthologous genomic sequences.

Authors:  V V Solovyev; I A Shahmuradov
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

Review 4.  Computational approaches to identify promoters and cis-regulatory elements in plant genomes.

Authors:  Stephane Rombauts; Kobe Florquin; Magali Lescot; Kathleen Marchal; Pierre Rouzé; Yves van de Peer
Journal:  Plant Physiol       Date:  2003-07       Impact factor: 8.340

Review 5.  Current methods of gene prediction, their strengths and weaknesses.

Authors:  Catherine Mathé; Marie-France Sagot; Thomas Schiex; Pierre Rouzé
Journal:  Nucleic Acids Res       Date:  2002-10-01       Impact factor: 16.971

6.  The Ensembl analysis pipeline.

Authors:  Simon C Potter; Laura Clarke; Val Curwen; Stephen Keenan; Emmanuel Mongin; Stephen M J Searle; Arne Stabenau; Roy Storey; Michele Clamp
Journal:  Genome Res       Date:  2004-05       Impact factor: 9.043

7.  The Ensembl automatic gene annotation system.

Authors:  Val Curwen; Eduardo Eyras; T Daniel Andrews; Laura Clarke; Emmanuel Mongin; Steven M J Searle; Michele Clamp
Journal:  Genome Res       Date:  2004-05       Impact factor: 9.043

8.  Statistical analysis of over-represented words in human promoter sequences.

Authors:  Leonardo Mariño-Ramírez; John L Spouge; Gavin C Kanga; David Landsman
Journal:  Nucleic Acids Res       Date:  2004-02-12       Impact factor: 16.971

9.  Genome of bovine herpesvirus 5.

Authors:  G Delhon; M P Moraes; Z Lu; C L Afonso; E F Flores; R Weiblen; G F Kutish; D L Rock
Journal:  J Virol       Date:  2003-10       Impact factor: 5.103

10.  Mice have a transcribed L-threonine aldolase/GLY1 gene, but the human GLY1 gene is a non-processed pseudogene.

Authors:  Alasdair J Edgar
Journal:  BMC Genomics       Date:  2005-03-09       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.