| Literature DB >> 16790048 |
Naum I Gershenzon1, Edward N Trifonov, Ilya P Ioshikhes.
Abstract
BACKGROUND: Experimental investigation of transcription is still a very labor- and time-consuming process. Only a few transcription initiation scenarios have been studied in detail. The mechanism of interaction between basal machinery and promoter, in particular core promoter elements, is not known for the majority of identified promoters. In this study, we reveal various transcription initiation mechanisms by statistical analysis of 3393 nonredundant Drosophila promoters.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16790048 PMCID: PMC1538597 DOI: 10.1186/1471-2164-7-161
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
The parameters of core promoter elements. List of the core promoter elements (col. 1); motif consensus in a NC-IUB nomenclature [56] (col. 2); the length of motif (at left) and the distance between center and 5' end (at right) (col. 3); applied windows for the center of motifs (col. 4); the maximal number of allowed mismatches () in order for motif consensus still to remain functional (col. 5); cutoff value for PWM (col. 6); the absolute number (col. 7) and percentage (col. 8) of promoters with respective core element; statistical significance (SS) of the occurrence frequency of an element in the respective window (col. 9). All respective P-values are less than 0.0001, which is considered to be extremely statistically significant. The P-values were obtained using P-Value Calculator [57] from respective Chi (χ) values used for SS calculation [51] for a system with 1 degree of freedom (DF = 1).
| TATA | TATAWAAR | 12/3 | -33 - -23 | 1 | 0.79 | 549 | 16.2 | 46.9 |
| Inr | TCAKTY | 12/3 | -1 - +9 | 1 | 0.70 | 2257 | 66.5 | 32.0 |
| DPE | RGWYV | 8/0 | +27 - +36 | 0 | 0.895 | 749 | 22.1 | 8.4 |
| MTE | CSARCSSAAC | 10/0 | +17 - +26 | 2 | 0.79 | 344 | 10.1 | 20.7 |
The pictograms of core promoter elements.
| Name | Pictogram |
| TATA | |
| Inr | |
| DPE | |
| MTE |
The statistical parameters of combinations of core elements. Combination name (col. 1); position of the center of the first element of the combination in bp (col. 2); distance between the centers of the elements in bp (the suggested synergetic distances marked by bold font (col. 3); the percentage (%) (col. 4); the absolute number (N) (col. 5); statistical significance of over-representation of promoters having this combination at respective positions with distance as in col. 3 (col. 6); and respective P-values (col. 7). The P-values were calculated as for the Table 1. The P-values < 0.001 are commonly considered to be extremely statistically significant, and those <0.01 – as very statistically significant.
| Inr_DPE | -1 - +9 | 25 | 0.77 | 26 | -4.8 | |
| 26 | 1.33 | 45 | -3.7 | |||
| <0.0001 | ||||||
| 28 | 1.03 | 35 | -4.8 | |||
| 29 | 0.77 | 26 | -4.0 | |||
| Inr_MTE | -1 - +9 | 15 | 0.41 | 14 | -3.0 | |
| 16 | 1.6 | 55 | 3.3 | |||
| 17 | 3.0 | 101 | 10.0 | |||
| 18 | 0.35 | 12 | -3.8 | |||
| 19 | 0.64 | 22 | -1.3 | |||
| <0.0001 | ||||||
| MTE_DPE | 17 – 26 | 9 | 0.09 | 3 | -2.1 | |
| <0.0001 | ||||||
| 11 | 0.32 | 11 | 0.7 | |||
| 12 | 0.12 | 4 | -1.0 | |||
| 13 | 0.06 | 2 | -0.3 | |||
| TATA_Inr | -33 - -23 | 29 | 1.4 | 46 | -2.8 | |
| 30 | 2.4 | 83 | 1.8 | |||
| 31 | 3.0 | 135 | 8.6 | |||
| 32 | 4.8 | 163 | 13.9 | |||
| 33 | 2.0 | 68 | 2.2 | |||
| 34 | 1.7 | 58 | 1.7 | |||
| 35 | 1.1 | 36 | -1.0 | |||
| <0.0001 | ||||||
| TATA_DPE | -33 - -23 | 57 | 0.44 | 15 | -1.1 | |
| 58 | 0.80 | 27 | 2.1 | |||
| 59 | 0.83 | 28 | 3.2 | |||
| 60 | 0.56 | 19 | 1.7 | |||
| 61 | 0.32 | 11 | 0.1 | |||
| 0.0014 | ||||||
| TATA_MTE | -33 - -23 | 46 | 0.12 | 4 | -1.4 | |
| 47 | 0.44 | 15 | 2.3 | |||
| 48 | 0.35 | 12 | 1.2 | |||
| 49 | 0.41 | 14 | 2.0 | |||
| 50 | 0.12 | 4 | -1.4 | |||
| 0.0093 | ||||||
Figure 1The number of promoters N having one of the combinations of two elements at respective synergetic/cooperative distances L (where statistical significance is meaningful).
The pictograms and consensuses of overrepresented motifs. The numeral in parentheses in the first column is the numeral of overrepresented motif from the article [22].
| Motif | Pictogram | Consensus |
| 1(1) | YGGYCACACT | |
| 2(7) | MCAKCHCTRR | |
| 3(2) | HATCGATA | |
| 4(5) | CAGCTGHT | |
| 5(6) | TYRGTATTTY | |
| 6 | TTKTKTTT | |
| 7 | MAAARYRAAA |