Literature DB >> 28130343

Finding the start site: redefining the human initiator element.

Jennifer F Kugel1, James A Goodrich1.   

Abstract

Transcription by RNA polymerase II (Pol II) is dictated in part by core promoter elements, which are DNA sequences flanking the transcription start site (TSS) that help direct the proper initiation of transcription. Taking advantage of recent advances in genome-wide sequencing approaches, Vo ngoc and colleagues (pp. 6-11) identified transcripts with focused sites of initiation and found that many were transcribed from promoters containing a new consensus sequence for the human initiator (Inr) core promoter element.
© 2017 Kugel and Goodrich; Published by Cold Spring Harbor Laboratory Press.

Entities:  

Keywords:  RNA polymerase II; core promoter; focused transcription; initiator; transcription start site

Mesh:

Substances:

Year:  2017        PMID: 28130343      PMCID: PMC5287107          DOI: 10.1101/gad.295980.117

Source DB:  PubMed          Journal:  Genes Dev        ISSN: 0890-9369            Impact factor:   11.361


Defining the proper initiation of RNA polymerase II (Pol II) transcription requires a complex interplay of proteins, DNA elements, and RNA that work together to dictate where on the genome transcription begins. This entails the regulated assembly of large multisubunit nucleoprotein complexes containing Pol II and many accessory factors; the platform for forming these large complexes is the core promoter. The core promoter in human genes is the region from −40 to +40 and flanks the transcription start site (TSS) at +1. Although no single core promoter element is contained in all human promoters, many contain one or more of the following core elements (Fig. 1): the TATA box, initiator (Inr), TFIIB recognition elements (BREu and BREd), polypyrimidine initiator (TCT), motif ten element (MTE), and downstream core promoter element (DPE) (for review, see Danino et al. 2015). Of these, the Inr element encompasses the TSS and is thought to be the most common core promoter element, with previous studies estimating that ∼50% of human core promoters contain an Inr (Gershenzon and Ioshikhes 2005; Yang et al. 2007). The commonly used consensus sequence for the human Inr, which was derived from mutational analyses, is YYANWYY from −2 to +5 (where, Y = C/T, W = A/T, N = A/C/G/T, and +1 is underlined) (Javahery et al. 1994; Lo and Smale 1996). More recently, analysis of genome-wide CAGE (cap analysis gene expression) data led to the considerably shorter Inr consensus of YR from −1 to +1 (where, R = A/G, and +1 is underlined) (Carninci et al. 2006; Frith et al. 2008). Other studies have also defined somewhat different consensus sequences for the Inr; however, all have an A at +1 in common (for review, see Kadonaga 2012).
Figure 1.

Relative locations of select human core promoter elements and the Inr consensus sequence found in promoters with focused TSSs. The promoter elements depicted include BREu (the upstream TFIIB recognition element), TATA (the TATA box), BREd (the downstream TFIIB recognition element), Inr (new consensus sequence shown), MTE, and DPE.

Relative locations of select human core promoter elements and the Inr consensus sequence found in promoters with focused TSSs. The promoter elements depicted include BREu (the upstream TFIIB recognition element), TATA (the TATA box), BREd (the downstream TFIIB recognition element), Inr (new consensus sequence shown), MTE, and DPE. Kadonaga and colleagues (Vo ngoc et al. 2017) devised and implemented a novel multistep approach that combines experimental and computational methods to reinvestigate the human Inr consensus sequence. First, they generated two 5′-GRO-seq (5′ end-selected global run-on followed by sequencing) libraries with human MCF-7 cells to identify the 5′ ends of nascent capped transcripts. Second, they developed a peak-calling algorithm named FocusTSS to find transcripts in the 5′-GRO-seq data sets that were initiated at a focused position on the genome, hence identifying clear TSSs to enable analysis of Inr sequences. FocusTSS identified 7678 TSSs that were in both data sets. Third, to identify sequence motifs enriched among the focused TSSs, they used the HOMER motif discovery tool (Heinz et al. 2010), which yielded an Inr-like consensus sequence of BBCABW from −3 to +3 (where, B = C/G/T, W = A/T, and +1 is underlined). Forty percent of the focused TSSs contained a perfect match to the BBCABW consensus Inr. Similar computational analyses performed with data sets from three other human cell lines yielded the same Inr consensus sequence. Interestingly, their analyses also revealed that Inr-containing promoters are less likely to have a TATA box than promoters lacking an Inr and that there is no correlation between the presence of BBCABW Inr elements and CpG islands. The importance of the sequence at individual positions in the BBCABW consensus Inr sequence was tested using in vitro transcription assays (Vo ngoc et al. 2017). Two native core promoters that each contained a consensus Inr were used, and single-point mutations were made at each position from −3 to +3 that took the sequence away from consensus. The sequences at positions −1 to +3 were the most important for setting levels of basal transcription, with mutations at +1 and +3 showing the largest reductions in transcription levels. In addition, 12 natural core promoters were chosen that each differed from consensus at one position; these positions were mutated to create the Inr consensus. Mutating positions −1 to +3 toward consensus increased transcriptional activity, and, again, the mutations at positions +1 and +3 had the greatest effect. This work provides a substantial step forward in understanding core promoter sequences, establishes a new approach to defining TSSs, and raises many interesting questions that will guide future research. For example, although the Inr is enriched at promoters with focused transcriptional start sites, it is also found randomly distributed throughout the genome. Hence, a consensus Inr alone does not constitute a promoter. The data also showed that promoters with consensus Inr sequences are relatively deficient in TATA boxes. It will be interesting to determine the interplay between other core promoter elements and the Inr at promoters with focused TSSs. Although this work defines a clear correlation between the presence of consensus Inr sequences and focused TSSs, the extent to which the Inr itself causes start sites to be focused remains to be determined. In addition, the role of specific Inr positions in controlling cellular transcription warrants further investigation. For example, C−1 and A+1 were found most frequently in Inr sequences identified in cells, but mutating C−1 away from consensus did not have a strong effect on transcription in vitro. The investigators suggest there is an additional constraint for the use of C−1 in cells. Many of the questions raised by this study could be answered by changing the sequences of core promoters in the human genome to determine the effects on the position of the TSS, level of transcription, and occupancy of factors at the core promoter. Finally, this work was limited to the analysis of core promoters with focused TSSs. Although much more complicated, it will be important to extend this new approach to promoters with nonfocused start sites to investigate whether such promoters contain Inr elements. This study illustrates that, despite years of research, much remains to be learned about core promoters and how they set start site positions and levels of transcription at human genes.
  10 in total

1.  Synergy of human Pol II core promoter elements revealed by statistical sequence analysis.

Authors:  Naum I Gershenzon; Ilya P Ioshikhes
Journal:  Bioinformatics       Date:  2004-11-30       Impact factor: 6.937

2.  A code for transcription initiation in mammalian genomes.

Authors:  Martin C Frith; Eivind Valen; Anders Krogh; Yoshihide Hayashizaki; Piero Carninci; Albin Sandelin
Journal:  Genome Res       Date:  2007-11-21       Impact factor: 9.043

3.  Generality of a functional initiator consensus sequence.

Authors:  K Lo; S T Smale
Journal:  Gene       Date:  1996-12-05       Impact factor: 3.688

Review 4.  The core promoter: At the heart of gene expression.

Authors:  Yehuda M Danino; Dan Even; Diana Ideses; Tamar Juven-Gershon
Journal:  Biochim Biophys Acta       Date:  2015-04-28

Review 5.  Perspectives on the RNA polymerase II core promoter.

Authors:  James T Kadonaga
Journal:  Wiley Interdiscip Rev Dev Biol       Date:  2011-12-06       Impact factor: 5.814

6.  Genome-wide analysis of mammalian promoter architecture and evolution.

Authors:  Piero Carninci; Albin Sandelin; Boris Lenhard; Shintaro Katayama; Kazuro Shimokawa; Jasmina Ponjavic; Colin A M Semple; Martin S Taylor; Pär G Engström; Martin C Frith; Alistair R R Forrest; Wynand B Alkema; Sin Lam Tan; Charles Plessy; Rimantas Kodzius; Timothy Ravasi; Takeya Kasukawa; Shiro Fukuda; Mutsumi Kanamori-Katayama; Yayoi Kitazume; Hideya Kawaji; Chikatoshi Kai; Mari Nakamura; Hideaki Konno; Kenji Nakano; Salim Mottagui-Tabar; Peter Arner; Alessandra Chesi; Stefano Gustincich; Francesca Persichetti; Harukazu Suzuki; Sean M Grimmond; Christine A Wells; Valerio Orlando; Claes Wahlestedt; Edison T Liu; Matthias Harbers; Jun Kawai; Vladimir B Bajic; David A Hume; Yoshihide Hayashizaki
Journal:  Nat Genet       Date:  2006-04-28       Impact factor: 38.330

7.  DNA sequence requirements for transcriptional initiator activity in mammalian cells.

Authors:  R Javahery; A Khachi; K Lo; B Zenzie-Gregory; S T Smale
Journal:  Mol Cell Biol       Date:  1994-01       Impact factor: 4.272

8.  Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities.

Authors:  Sven Heinz; Christopher Benner; Nathanael Spann; Eric Bertolino; Yin C Lin; Peter Laslo; Jason X Cheng; Cornelis Murre; Harinder Singh; Christopher K Glass
Journal:  Mol Cell       Date:  2010-05-28       Impact factor: 17.970

9.  Prevalence of the initiator over the TATA box in human and yeast genes and identification of DNA motifs enriched in human TATA-less core promoters.

Authors:  Chuhu Yang; Eugene Bolotin; Tao Jiang; Frances M Sladek; Ernest Martinez
Journal:  Gene       Date:  2006-10-10       Impact factor: 3.688

10.  The human initiator is a distinct and abundant element that is precisely positioned in focused core promoters.

Authors:  Long Vo Ngoc; California Jack Cassidy; Cassidy Yunjing Huang; Sascha H C Duttke; James T Kadonaga
Journal:  Genes Dev       Date:  2017-01-20       Impact factor: 11.361

  10 in total
  6 in total

Review 1.  HIV-1 Natural Antisense Transcription and Its Role in Viral Persistence.

Authors:  Rui Li; Rachel Sklutuis; Jennifer L Groebner; Fabio Romerio
Journal:  Viruses       Date:  2021-04-29       Impact factor: 5.048

2.  Genome annotation across species using deep convolutional neural networks.

Authors:  Ghazaleh Khodabandelou; Etienne Routhier; Julien Mozziconacci
Journal:  PeerJ Comput Sci       Date:  2020-06-15

3.  High-throughput functional analysis of lncRNA core promoters elucidates rules governing tissue specificity.

Authors:  Marta Melé; John L Rinn; Kaia Mattioli; Pieter-Jan Volders; Chiara Gerhardinger; James C Lee; Philipp G Maass
Journal:  Genome Res       Date:  2019-01-25       Impact factor: 9.043

4.  Core Element Cloning, Cis-Element Mapping and Serum Regulation of the Human EphB4 Promoter: A Novel TATA-Less Inr/MTE/DPE-Like Regulated Gene.

Authors:  Pierluigi Scalia; Stephen J Williams; Antonio Giordano
Journal:  Genes (Basel)       Date:  2019-12-02       Impact factor: 4.096

5.  A unified view of the sequence and functional organization of the human RNA polymerase II promoter.

Authors:  Donal S Luse; Mrutyunjaya Parida; Benjamin M Spector; Kyle A Nilson; David H Price
Journal:  Nucleic Acids Res       Date:  2020-08-20       Impact factor: 16.971

6.  Initiator-Directed Transcription: Fission Yeast Nmtl Initiator Directs Preinitiation Complex Formation and Transcriptional Initiation.

Authors:  Diego A Rojas; Fabiola Urbina; Lucía Valenzuela-Pérez; Lorenzo Leiva; Vicente J Miralles; Edio Maldonado
Journal:  Genes (Basel)       Date:  2022-01-28       Impact factor: 4.096

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.