| Literature DB >> 17288573 |
Brandon Smith1, Hung Fang, Youlian Pan, P Roy Walker, A Fazel Famili, Marianna Sikorska.
Abstract
BACKGROUND: Transcription factors regulate gene expression by interacting with their specific DNA binding sites. Some transcription factors, particularly those involved in transcription initiation, always bind close to transcription start sites (TSS). Others have no such preference and are functional on sites even tens of thousands of base pairs (bp) away from the TSS. The Cyclic-AMP response element (CRE) binding protein (CREB) binds preferentially to a palindromic sequence (TGACGTCA), known as the canonical CRE, and also to other CRE variants. CREB can activate transcription at CREs thousands of bp away from the TSS, but in mammals CREs are found far more frequently within 1 to 150 bp upstream of the TSS than in any other region. This property is termed positional bias. The strength of CREB binding to DNA is dependent on the sequence of the CRE motif. The central CpG dinucleotide in the canonical CRE (TGACGTCA) is critical for strong binding of CREB dimers. Methylation of the cytosine in the CpG can inhibit binding of CREB. Deamination of the methylated cytosines causes a C to T transition, resulting in a functional, but lower affinity CRE variant, TGATGTCA.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17288573 PMCID: PMC1796609 DOI: 10.1186/1471-2148-7-S1-S15
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Variation of representation index (RI) of CRE motifs over human TSS-SR regions. The representation index of each 50 bp window along the human TSS-SRs are plotted for the 5 CRE motifs showing the strongest positional bias in the 3 windows from -1 to -150 bp away from the TSS.
Figure 2Variation of representation index (RI) over the TSS-SR of 9 species. The normalised representation index of each 50 bp window along the TSS-SRs are plotted for the canonical (TGACGTCA) and TG-variant (TGATGTCA) CREs in mouse, rat and dog (A and D); chicken, frog and fish (B and E); and sea squirt, fly and worm (C and F), respectively. In order to be comparable on the same scale, we normalised the RI values in each 50 bp by dividing them by the sum of all RI values in the 40 windows over the TSS-SRs. The values presented are percentages.
Conservation of human CREs in mouse, rat, and dog
| TGACGTCA | 56.9 | 57.2 | 56.7 | 57.0 | 0.3 |
| TGATGTCA | 34.6 | 32.9 | 33.0 | 33.5 | 1.0 |
| TGACGCAC | 26.3 | 27.0 | 33.0 | 28.8 | 3.7 |
| TGACGTCG | 27.2 | 27.9 | 31.2 | 28.8 | 2.2 |
| TGACGTGG | 29.4 | 29.9 | 24.7 | 28.0 | 2.9 |
Frequency of orthologous sequences
| TGACGTCA | 373 | 364 | 380 | 372.3 | 56.9 | 0.3 | |
| TGA | 26 | 27 | 31 | 28.0 | 4.3 | 0.3 | |
| TGACGTCA | TGAC | 29 | 22 | 19 | 23.3 | 3.6 | 0.8 |
| TGACG | 13 | 14 | 8 | 11.7 | 1.8 | 0.5 | |
| TGACGTC | 7 | 10 | 12 | 9.7 | 1.5 | 0.4 | |
| TGATGTCA | 475 | 438 | 540 | 484.3 | 33.5 | 1.0 | |
| TGA | 44 | 47 | 57 | 49.3 | 3.4 | 0.2 | |
| TGATGTCA | TGA | 24 | 23 | 25 | 24.0 | 1.7 | 0.1 |
| TGAT | 17 | 20 | 29 | 22.0 | 1.5 | 0.3 | |
| T | 18 | 20 | 20 | 19.3 | 1.3 | 0.1 | |
Sites differing from the human sequence are highlighted in bold.
Canonical and TG-variant CREs in human CpG islands
| TGA | TGA | CG:CG | 298 | 373 | 80 |
| TGA | CG:TG | 33 | 55 | 60 | |
| TGA | TGA | TG:CG | 14 | 44 | 32 |
| TGA | TG:TG | 82 | 475 | 17 | |
Summary of human (hg17) TSS-SRs
| 21,164 | 18,016 | 23,903 | 46,485 |
RefSeq: Sequences from NCBIs RefSeq collection, MGC: Sequences from the Mammalian Genome Collection, hINV: Sequences from the human invitational clone database
Summary of TSS-SRs from 9 species
| hg17 | RefSeq Genes | 24,819 | |
| mm5 | RefSeq Genes | 20,199 | |
| rn3 | RefSeq Genes | 8,555 | |
| galGal2 | RefSeq Genes | 1,692 | |
| xenTro1 | JGI genes | 33,749 | |
| danRer2 | RefSeq Genes | 9,944 | |
| ci1 | JGI genes | 15,569 | |
| dm1 | RefSeq Genes | 18,960 | |
| ce2 | RefSeq Genes | 23,461 |