| Literature DB >> 21461879 |
Fumiaki Uchiumi1, Satoru Miyazaki, Sei-ichi Tanuma.
Abstract
Transcription is one of the most fundamental nuclear functions and is an enzyme complex-mediated reaction that converts DNA sequences into mRNA. Analyzing DNA sequences of 5'-flanking regions of several human genes that respond to 12-O-tetradecanoyl-phorbol-13-acetate (TPA) in HL-60 cells, we have identified that the ets (GGAA) motifs are duplicated, overlapped, or clustered within a 500-bp distance from the most 5'-upstream region of the cDNA. Multiple protein factors including Ets family proteins are known to recognize and bind to the GGAA containing sequences. In addition, it has been reported that the ets motifs play important roles in regulation of various promoters. Here, we propose a molecular mechanism, defined by the presence of duplication and multiplication of the GGAA motifs, that is responsible for the initiation of transcription of several genes and for the recruitment of binding proteins to the transcription start site (TSS) of TATA-less promoters.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21461879 PMCID: PMC3101357 DOI: 10.1007/s00018-011-0674-x
Source DB: PubMed Journal: Cell Mol Life Sci ISSN: 1420-682X Impact factor: 9.261
Duplication of the GGAA motifs in various human promoters
| Genes | Sequence |
|---|---|
|
| 5′-GAAGC |
|
| 5′-CCTCA |
|
| 5′-TGTTC |
|
| 5′-TCACA |
|
| 5′-CTCAT |
|
| 5′-AGCAA |
|
| 5′-GCTCC |
|
| 5′-GTGGC |
|
| 5′-CGGGC |
|
| 5′-AAGAC |
|
| 5′-ACAGT |
|
| 5′-CCTCG |
|
| 5′-TTGTG |
The 5′-upstream regions (300–500 bp upstream from TSS) of the human IL-1β- and TPA-inducible late responding genes [4] were retrieved from the GRCh37 reference primary assembly sequence database and analyzed with the TF-SEARCH program (http://www.cbrc.jp/research/db/TFSEARCH.html). Eight of the 20 “late responding” classified promoters contain duplicated or triplicated GGAA (TTCC) motifs [4]. GGAA (TTCC) motifs in the IFN-inducible CD41, PDCD1, ISG15, and CD40 promoters are also shown
Fig. 1Most of the 5′-upstream regions having duplicated GGAA motifs are TATA-less. a The number of genes that have duplicated GGAA consensus 14-bp sequence [14] in the 2,000 bp upstream from TSS was 469. Among them, 372 genes have no TATA-box within 500 bp upstream. b The number of genes that have duplicated GGAA consensus 14-bp sequencein the 2,000 bp upstream and GGAA(TTCC) overlapping within 500 bp upstream of TSS was 345. Among them, 279 genes have no TATA-box within 500 bp upstream. Numbers of genes from the Ensemble database are shown
Human promoter regions containing duplication of the GGAA motifs near the TSS
| Group A | Group B | |||||||
|---|---|---|---|---|---|---|---|---|
| AAGAB | CPSF7 | FAM72D | HSPA5 | OBP2A | RBBP5 | SPATA2 | USP32 | ANAPC2 |
| ACAA1 | CRTC1 | FAM76B | HSPA8 | OTUB1 | RBM33 | SPTAN1 | VPS53 | ARMC10 |
| ACOX3 | CXCL17 | FAM89A | IFITM5 | OXT | RBM34 | SRP19 | WDFY3 | CCNK |
| AEN | CYB561 | FAM126B | IRF2BP1 | PAFAH1B3 | RDBP | SSNA1 | WDR27 | CIA01 |
| AFAP1L2 | DAPK3 | FAM153A | ITGB3BP | PDGFRB | R3HDM2 | SUMO1 | WDR87 | CNST |
| AGBL5 | DCAF5 | FASN | KBTBD4 | PEX6 | RPAP1 | SYNGAP1 | YRDC | CRYAB |
| AKAP10 | DDA1 | FBXL6 | KCNG4 | PEX14 | RSL1D1 | TFB2 M | ZBTB17 | EFCAB7 |
| AKT3 | DEAF1 | FBXL13 | KCNJ2 | PGAP1 | RSL1D1 | TFDP2 | ZBTB32 | FANCD2 |
| ALDH3B1 | DEC1 | FBXW2 | KTI12 | PHOX2B | RXFP4 | TMEM127 | ZBTB45 | IQCH |
| APITD1 | DEFA1 | FCHSD2 | MAPKAP1 | PIGM | SDHB | TMEM161A | ZC3H13 | LAMB2 |
| ARHGAP1 | DEFA1B | FIBP | MCM3AP | PITX3 | SDK2 | TOMM20 | ZNF252 | MRPL32 |
| ARMC7 | DEFA3 | FIZ1 | MED8 | PLLP | SENP1 | TNFSF12 | ZNF343 | NDUFB3 |
| BTRC | DFFA | GDPD2 | MEX3A | POLD4 | SETD3 | TRAF1 | ZNF443 | NDUFS3 |
| CCDC79 | DIS3 | GOLGA3 | MGAT4B | POLR2L | SF3A1 | TRPM3 | ZNF555 | NUDT22 |
| CCT6A | DPAGT1 | G6PD | MUT | PRELP | SF3B2 | TRPT1 | ZNF558 | ODF3B |
| CELA3B | DUX4 | GPR155 | MYBBP1A | P2RY14 | SFRS4 | TTC16 | ZNF628 | PDCD1 |
| CFL1 | EDC3 | GRB2 | NARS | PSMA2 | SLC2A13 | TTLL8 | ZNF669 | PSPH |
| CHMP1A | ELAC2 | GYS1 | NBPF11 | PSMB1 | SLC3A2 | TYMP | ZNF782 | SDHAF2 |
| CIDECP | ERO1L | HAS2 | NCAPG2 | PSME1 | SLC10A4 | UBE2MP1 | ZNF828 | TSPAN4 |
| C1QTNF6 | ETFA | HCFC1 | NOC2L | PURB | SLC35F2 | UBXN1 | ZSCAN29 | VPS52 |
| CORT | FAM49B | HDHD2 | NT5C | RAB27A | SLMO2 | USP19 | ZNF408 | |
| COX17 | FAM70B | HERC4 | NUP155 | RABL5 | SMUG1 | USP28 | ||
| 174 genes | 21 genes | |||||||
Group A The tentatively defined TPA-activation consensus 14-bp sequence, 5′-(A/G/C)N(A/G/C)(C/G)(C/G)GGAA(A/G)(C/T)(G/C/T)(A/G/C)(A/G/C)-3′, was searched in the Ensembl data base. Duplicated or overlapped GGAA motifs were identified near the most 5′-upstream of those 174 genes. Group B Twenty-one genes were found to be head–head linked with genes listed in Group A
Fig. 2Hypothetical transcription controlling system in which duplicated GGAA (TTCC) motifs and Ets family proteins are involved. Closed circles represent GGAA (TTCC) motifs that are located near the TSS. The transcription activity is indicated by the size of the arrows. Multiple Ets family proteins, which are indicated by open squares, closed squares, triangles, and diamonds, could be induced by differentiation-inducing signals, apoptosis-inducing signals or specific cytokines. Their redundant occupancy around the TSS can regulate variable gene expression with subtle changes, depending on the combinations of the binding proteins. The over-transcribed mRNAs may be further regulated or fine-tuned by micro RNAs