| Literature DB >> 21745829 |
Yoshiharu Y Yamamoto1, Yohei Yoshioka, Mitsuro Hyakumachi, Junichi Obokata.
Abstract
It is now well known that vertebrates use multiple types of core promoter to accomplish differentiated tasks in Pol II-dependent transcription. Several transcriptional characteristics are known to be associated with core types, including distribution patterns of transcription start sites (TSSs) and selection between tissue-specific and constitutive expression profiles. However, their relationship to gene structure is poorly understood. In this report, we carried a comparative analysis of three Arabidopsis core types, TATA, GA, and Coreless, with regard to gene structure. Our genome-wide investigation was based on the peak TSS positions in promoters that had been identified in a large-scale experimental analysis. This analysis revealed that the types of core promoter are related with the room for promoters that is measured as the distance from the TSS to the end of the upstream gene, the distance from the TSS to the start position of the coding sequence (CDS), and the number and species of the cis-regulatory elements. Of these, it was found that the distance from the TSS to the CDS has a tight, inverse correlation to the expression level, and thus the observed relationship to the core type appears to be indirect. However, promoter length and preference of cis-elements are thought to be a direct reflection of core type-specific transcriptional initiation mechanisms.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21745829 PMCID: PMC3190954 DOI: 10.1093/dnares/dsr020
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Figure 5.Regulated expression and core promoter type. (A and B) The ratio of the core promoter types (TATA, GA, and Coreless) in relation to the response to ABA treatment or to wounding is shown. All the promoters were aligned according to their response, and the mean core ratio was calculated with a bin of 51 (grey) and 201 (black) promoters each. 1.0 on the horizontal axis indicates no response to the treatments. (C and D) The promoter ratio of the TATA and Coreless type is shown in relation to the type of gene response. Significantly different value from All, judged by Fisher's exact test, is shown with an asterisk.
Core promoter type and octamer preference
| Octamer | Core: TATA | Core: GA | Coreless | Total | |||
|---|---|---|---|---|---|---|---|
| Over-represented | Under-represented | Over-represented | Under-represented | Over-represented | Under-represented | ||
| CAa | 2 | 0 | 0 | 0 | 0 | 3 | 5 |
| GAa | 0 | 15 | 19 | 0 | 0 | 0 | 20 |
| TATAa | 86 | 0 | 0 | 3 | 0 | 90 | 137 |
| REGa | 9 | 118 | 20 | 5 | 14 | 9 | 614 |
| High RARfb | 196 | 184 | 73 | 105 | 177 | 102 | 7037 |
The number of octamer sequences with significantly over- or under-represented appearance ratios (whatever the values are) is shown.
aLDSS element. REG is a group of position-dependent sequences that are suggested to be transcriptional regulatory sequences.
bOther types of putative transcriptional regulatory sequences predicted from microarray data of ABA, auxin, BL, CK, ethylene, JA, SA, H2O2, drought, or DREB1Aox (RARf > 3.0). Core-related elements judged by LDSS analysis, including weak TATA elements and unidentified elements that show localized distribution (P < 0.05 under assumption of random distribution) with a peak position between −50 and +50 relative to the peak TSS, were removed from High RARf. The remaining sequences include some of the REGs, weak REG-like sequences (P < 0.05, peak position between −200 and −40) that had not been identified in our previous report,[16] and position-independent putative regulatory elements.
Preferential distribution of high RARf octamers among three core-promoter types
| Core type | Over-represented | Under-represented |
|---|---|---|
| TATA-specific | 193 | 183 |
| GA-specific | 70 | 104 |
| Coreless-specific | 177 | 102 |
| TATA and GA | 3 | 1 |
| TATA and Coreless | 0 | 0 |
| GA and Coreless | 0 | 0 |
| TATA, GA, and Coreless | 0 | 0 |
The numbers of high RARf-octamers (putative transcriptional regulatory sequences) that are biased in each promoter type are shown. LDSS-positive core-related octamers are removed.
Figure 1.Promoter length and core promoter type. Promoters were divided into two groups according to the orientation of the upstream gene, and for each group, distribution of promoter length from the major TSS to the end of the upstream gene, defined by the gene model, is shown with respect to the core promoter types. The diagram in the graph indicates the direction of the genes, and the black arrow indicates the gene whose promoter length is analysed. Coreless promoters are identified as TATA-, GA-, Y-, and also CA-negative. (A) Distribution of promoter length is shown. The vertical axis indicates the ratio among promoters with a core type, and sum of gene ratios for Coreless, All, and TATA are 1.0, respectively. (B) The median promoter length is shown for each core promoter type. Statistically distinguished groups are labelled with different alphabetical letters over the bars.
Figure 2.Promoter length and core promoter type in rice. The median promoter length in the rice genome is shown for each core promoter type. The diagram in the graph indicates the direction of genes, and the black arrow is the gene whose promoter length is analysed. Statistically distinguished groups are labelled with different alphabetical letters over the bars.
Figure 3.Promoter length and expression level. (A) Promoter ratio according to expression ratio is shown. Tpm: tag per million in a TSS tag library.[12] (B) For each combination of gene orientation, promoters were divided into three groups each according to their expression level. The promoter length of each category is shown. Statistically distinguished groups are labelled with different alphabetical letters over the bars.
Figure 4.Distance from TSS to CDS is tightly associated with expression level. (A) The distance from the major TSS to the downstream translation start site is shown for the core types. (B) The distance from the TSS to the CDS is shown in relation to the shape of the TSS cluster. (C) The distance from the TSS to the CDS is shown in relation to the expression level (tpm). (D) For each core type, the expression level (tpm) is shown. Statistically distinguished groups are labelled with different alphabetical letters over the bars.
Figure 6.REG density and core promoter type. The distribution of REG according to the promoter position is shown for each core promoter type.
Preferential distribution of REG octamers among three core-promoter types
| Core type | Over-represented | Under-represented |
|---|---|---|
| TATA-specific | 8 | 115 |
| GA-specific | 19 | 3 |
| Coreless-specific | 14 | 8 |
| TATA and GA | 1 | 2 |
| TATA and Coreless | 0 | 1 |
| GA and Coreless | 0 | 0 |
| TATA, GA, and Coreless | 0 | 0 |
The number of REGs (putative transcriptional regulatory sequences) that are biased in each promoter type is shown.
Characteristics of core promoter types
| TATA | GA | Coreless | |
|---|---|---|---|
| Promoter length | Long | — | Short |
| Distance from TSS to CDS | Short | Short | Long |
| REG density | Low | High | High |
| Expression level | Higha | Higha | Lowa |
| Expression profile | Regulateda | — | Constitutivea |
A summary of this work is shown.
aAlso supported by Yamamoto et al.[12]