| Literature DB >> 19128496 |
Abstract
BACKGROUND: The promoter is a critical necessary transcriptional cis-regulatory element. In addition to its role as an assembly site for the basal transcriptional apparatus, the promoter plays a key part in mediating temporal and spatial aspects of gene expression through differential binding of transcription factors and selective interaction with distal enhancers. Although many genes have multiple promoters, little attention has been focused on how these relate to one another; nor has much study been directed at relationships between promoters of adjacent genes.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19128496 PMCID: PMC2631479 DOI: 10.1186/1471-2164-10-9
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Promoter sets used for this study. (A) Schematic view of the three types of promoters: unique promoters (UPs), first alternative promoters (FAP), and downstream alternative promoters (DAP). (B) Number of promoters in each class when using the "all promoters" data set. (C) Number of promoters in each class when using the "high quality" data set. (D) Number of promoters in each class when using the "cap supported" data set.
Figure 2The mononucleotide distribution of the three different classes of fly promoters. Shown are unique promoters (black), first alternative promoters (red), the downstream alternative promoters (green). The mean frequency of each nucleotide was calculated in a 10 bp window sliding across the promoter region in 1 bp steps.
Presence of the 15 fly promoter motifs in the three promoter classes
| Occurrence percentagea | p valueb | |||||
| Motif | UPs | FAPs | DAPs | UPs vs. FAPs | UPs vs. DAPs | FAPs vs. DAPs |
| TATA/DMp1 | 5.77 | 1.13 | 0.95 | 9.73e-24 | 5.21e-36 | ns |
| INR/DMp2 | 10.22 | 11.97 | 12.58 | ns | 3.36e-04 | ns |
| INR1/DMp3 | 0.52 | 1.43 | 0.74 | 4.54e-05 | ns | ns |
| DPE/DMp4 | 0.69 | 0.92 | 1.02 | ns | ns | ns |
| DPE1/DMp5 | 0.39 | 1.07 | 0.81 | 3.32e-04 | ns | ns |
| DMv1 | 1.56 | 2.46 | 0.70 | ns | 2.31e-04 | 6.23e-07 |
| DMv2 | 0.85 | 0.87 | 0 | ns | 7.26e-10 | 2.17e-07 |
| DMv3 | 3.17 | 3.48 | 1.65 | ns | 4.46e-06 | 6.86e-05 |
| DMv4 | 3.39 | 5.58 | 1.23 | 7.10e-06 | 2.69e-11 | 5.58e-18 |
| DMv5 | 1.41 | 0.97 | 0.88 | ns | ns | ns |
| GAGA/NDM1 | 1.23 | 2.86 | 4.87 | 4.14e-07 | 2.68e-29 | 4.55e-04 |
| NDM2 | 2.92 | 4.76 | 6.03 | 4.87e-05 | 3.87e-14 | ns |
| NDM3 | 1.11 | 0.97 | 0 | ns | 8.40e-13 | 3.55e-08 |
| DRE/NDM4 | 8.34 | 14.22 | 4.59 | 3.16e-15 | 1.04e-12 | 2.35e-31 |
| E-box/NDM5 | 5.44 | 4.45 | 0 | ns | 2.73e-62 | 3.07e-35 |
| Nonec | 61.81 | 54.78 | 69.90 | 5.27e-09 | 4.73e-16 | 1.61e-26 |
apercentage of the promoters in a class having the specific motif.
bcalculated using Fisher's exact test. Holm's method was used for multiple hypothesis testing. ns, not significant.
cpercentage of promoters without any known motifs.
Presence of significant motif interactions in the three fly promoter classes
| Occurrence percentagea | p valueb | |||||
| Motif interaction | UPs | FAPs | DAPs | UPs vs. FAPs | UPs vs. DAPs | FAPs vs. DAPs |
| TATA_INR/DMp1_DMp2 | 1.03 | 0.36 | 0.07 | ns | 1.57e-09 | ns |
| INR_GAGA/DMp2_NDM1 | 0.28 | 0.82 | 0.91 | 1.43e-03 | 1.92e-05 | ns |
| INR_NDM2/DMp2_NDM2 | 0.56 | 1.13 | 1.65 | ns | 5.84e-08 | ns |
| DMv1_DRE/DMv1_NDM4 | 0.25 | 0.87 | 0.18 | 1.20e-04 | ns | 6.83e-04 |
| DMv3_DRE/DMv3_NDM4 | 1.30 | 2.10 | 0.56 | ns | 5.66e-04 | 1.98e-06 |
| DRE_E-box/NDM4_NDM5 | 0.97 | 0.77 | 0 | ns | 2.95e-11 | 1.33e-06 |
apercentage of the promoters in a class having the specific motif interaction.
bcalculated using Fisher's exact test. Holm's method was used for multiple hypothesis testing. ns, not significant.
Presence of the eight human promoter motifs in the three promoter classes
| Occurrence percentagea | p valueb | |||||
| Motif | UPs | FAPs | DAPs | UPs vs. FAPs | UPs vs. DAPs | FAPs vs. DAPs |
| CCAAT | 12.25 | 10.16 | 9.78 | ns | ns | ns |
| SP1 | 37.91 | 34.35 | 30.69 | ns | 1.33e-04 | ns |
| CLUS1 | 1.82 | 0.81 | 0.50 | ns | ns | ns |
| USF | 2.76 | 1.94 | 1.61 | ns | ns | ns |
| CREB | 3.02 | 2.42 | 3.47 | ns | ns | ns |
| TATA | 6.60 | 1.94 | 3.22 | 6.97e-07 | 1.47e-04 | ns |
| NRF-1 | 10.23 | 3.71 | 3.34 | 2.65e-08 | 1.89e-11 | ns |
| ETS | 17.28 | 5.97 | 6.44 | 1.07e-14 | 1.26e-16 | ns |
| Nonec | 33.59 | 51.77 | 53.59 | 3.75e-17 | 9.32e-25 | ns |
apercentage of the promoters in a class having the specific motif.
bcalculated using Fisher's exact test. Holm's method was used for multiple hypothesis testing. ns, not significant.
cpercentage of promoters without any known motifs.
Motif similarity between the first and second promoters of the same genea
| Similarb | Intermediateb | Differentb | # of used pairs | # of omitted pairs | |||||||||||
| allc | high qualityd | cape | all | high quality | cap | all | high quality | cap | all | high quality | cap | all | high quality | cap | |
| Observed value (shared motifs allowed) | 34.07 | 31.52 | 36.04 | 17.70 | 16.30 | 23.42 | 48.23 | 52.17 | 40.54 | 226 | 92 | 111 | 1182 | 445 | 278 |
| Observed value (shared motifs not allowed) | 12.65 | 11.59 | 15.19 | 15.66 | 15.94 | 21.52 | 71.69 | 72.46 | 63.29 | 166 | 69 | 79 | 1242 | 468 | 310 |
| Random meanf | 9.50 | 8.34 | 8.67 | 10.83 | 12.51 | 14.31 | 79.66 | 79.15 | 77.03 | 226 | 92 | 111 | 1182 | 445 | 278 |
| p-valueg (random >= observed value, shared motifs allowed) | 0 | 0 | 0 | 2e-04 | 0.1399 | 0.0021 | 1 | 1 | 1 | -- | -- | -- | -- | -- | -- |
| p-valueg (random <= observed value, shared motifs allowed) | 1 | 1 | 1 | 1 | 0.9227 | 0.9991 | 0 | 0 | 0 | -- | -- | -- | -- | -- | -- |
| p-valueg (random >= observed value, shared motifs not allowed) | 0.0438 | 0.1286 | 0.0078 | 0.0041 | 0.1399 | 0.0108 | 0.9991 | 0.9622 | 1 | -- | -- | -- | -- | -- | -- |
| p-valueg (random <= observed value, shared motifs not allowed) | 0.9562 | 0.8714 | 0.9922 | 0.9959 | 0.8601 | 0.9892 | 9e-04 | 0.0378 | 0 | -- | -- | -- | -- | -- | -- |
agenes with exactly two alternative promoters. Similar results were obtained for genes with multiple alternative promoters (data not shown).
bpercentage of the alternative promoter pairs defined as similar, intermediate, and different (see Methods).
cThe promoter set contains all promoters from Drosophila genome annotation release 5.5.
dThe promoter set contains only high quality promoters in fly genome; see Methods.
eThe promoter set contains only cap-supported promoters in fly genome; see Methods.
fmean of 10,000 randomizations.
gempirical p-value calculated as the proportion of sampled randomizations where the percentage of the promoter pairs in each of the three similarity levels was no less than/no greater than the observed values
Motif similarity between neighboring unique promoters
| Similara | Intermediatea | Differenta | # of used pairs | # of omitted pairs | |||||||||||
| allb | high qualityc | capd | all | high quality | cap | all | high quality | cap | all | high quality | cap | all | high quality | cap | |
| Observed value (shared motifs allowed) | 17.81 | 16.44 | 15.47 | 16.36 | 16.90 | 21.13 | 65.83 | 66.67 | 63.40 | 1516 | 1083 | 530 | 8224 | 4200 | 983 |
| Observed value (shared motifs not allowed) | 16.64 | 15.06 | 14.34 | 15.50 | 15.91 | 19.96 | 67.86 | 69.03 | 65.70 | 1484 | 1056 | 516 | 8256 | 4227 | 997 |
| Random meane | 8.48 | 8.25 | 7.60 | 10.45 | 10.96 | 11.96 | 81.07 | 80.79 | 80.44 | 1516 | 1083 | 530 | 8224 | 4200 | 983 |
| p-valuef (random >= observed value, shared motifs allowed) | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | -- | -- | -- | -- | -- | -- |
| p-valuef (random <= observed value, shared motifs allowed) | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | -- | -- | -- | -- | -- | -- |
| p-valuef (random >= observed value, shared motifs not allowed) | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | -- | -- | -- | -- | -- | -- |
| p-valuef (random <= observed value, shared motifs not allowed) | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | -- | -- | -- | -- | -- | -- |
apercentage of promoter pairs defined as similar, intermediate, and different (see Methods).
bThe promoter set contains all promoters from Drosophila genome annotation release 5.5.
cThe promoter set contains only high quality promoters in fly genome; see Methods.
dThe promoter set contains only cap-supported promoters in fly genome; see Methods.
emean of 10,000 randomizations.
fempirical p-value calculated as the proportion of sampled randomizations where the percentage of the promoter pairs in each of the three similarity levels was no less than/no greater than the observed values.
Figure 3Sequence similarity of neighboring genes and their promoters. (A) The gene sequence similarity of neighboring unique gene pairs whose motifs are similar (black bars) or different (white bars). (B) The promoter sequence (-130 to +50 bp) similarity of neighboring unique gene pairs whose motifs are similar (black bars) or different (white bars).