| Literature DB >> 21421563 |
Daniel Calva1, Fadi S Dahdaleh, George Woodfield, Ronald J Weigel, Jennifer C Carr, Sathivel Chinnathambi, James R Howe.
Abstract
Inactivation of SMAD4 has been linked to several cancers and germline mutations cause juvenile polyposis (JP). We set out to identify the promoter(s) of SMAD4, evaluate their activity in cell lines and define possible transcription factor binding sites (TFBS). 5'-rapid amplification of cDNA ends (5'-RACE) and computational analyses were used to identify candidate promoters and corresponding TFBS and the activity of each was assessed by luciferase vectors in different cell lines. TFBS were disrupted by site-directed mutagenesis (SDM) to evaluate the effect on promoter activity. Four promoters were identified, two of which had significant activity in several cell lines, while two others had minimal activity. In silico analysis revealed multiple potentially important TFBS for each promoter. One promoter was deleted in the germline of two JP patients and SDM of several sites led to significant reduction in promoter activity. No mutations were found by sequencing this promoter in 65 JP probands. The predicted TFBS profiles for each of the four promoters shared few transcription factors in common, but were conserved across several species. The elucidation of these promoters and identification of TFBS has important implications for future studies in sporadic tumors from multiple sites, and in JP patients.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21421563 PMCID: PMC3141234 DOI: 10.1093/nar/gkr091
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Map of chromosome 18 near SMAD4, including NC and coding exons (top line) and five different splice variants found by 5′-RACE below. The number of 5′-RACE clones found in various tissues and ESTs described in the UCSC genome browser (www.genome.ucsc.edu) are shown at the right of each isoform. From the 149 ESTs described, only 43 of them contained the NC exons and 5′-UTR, while the rest consisted only of coding exons. The map positions of NC exons are: NC-exon 1:46 748 387–46 748 482; NC-exon 2:46 754 765–46 754 933; NC-exon 3:46 808 762–46 809 033; NC-exon 4:46 810 582–46 810 991 and 5′-UTR 46 827 288–46 827 414. Regions A, B, C, D are predicted to be the potential promoters for these isoforms.
Figure 2.Luciferase expression from the various deletion constructs of promoter A, B, C and D from the 4 cell lines, HEK-293, CRL-1459, MCF-7 and CCL-247. Deletion construct numbering is relative to the TSS; the last constructs of promoter A end at +19 relative to the TSS, for B at +473, C at +156 from the TSS and for D at +449 into the first coding exon. Note that there is no activity seen for promoter A in CRL-1459; that promoter B was only evaluated in HEK-293 and CRL-1459 cells and that Promoter D was not evaluated in CCL-247 cells.
Figure 3.Sequences of each promoter region, including phylogenetically conserved TFBS as predicted by comparative genomics, where highlighted nucleotides exhibit the highest conservation within the TFBS consensus sequence. The center line is the Chromosome 18 map, with the red vertical lines representing the four promoters relative to SMAD4 coding exons. Red arrows designate TSS in each sequence, the numbers on the right of each sequence are relative to the TSS; CG dinucleotides are shown in red. (A) Gene2promoter ID: GXP_57502, which overlaps with promoter A. There were eight TFBS predicted to be phylogenetically conserved (with spacing demonstrated by the line with colored beads at top), represented by colored highlighted regions at their sequences and names given to each color above: (HOMF, XCPE, ETSF, HNFP, GLIF, NRF1 and CTCF. (B) GXP_1261977, which overlaps with promoter B. TFBS were predicted to be: YY1F, EBOX, NR2F, MYBL, NF1F, DICE, MITF, KLFS and HESF. (C) GXP_57777, which overlaps with promoter C; six TFBS were predicted: ETSF, DMTF, SRFF, NRF1, BPTF and P53F. The sequences with stars are predicted to be phylogenetically conserved between species, while the colored sequences without stars were predicted as TFBS by the Genomatix software (but were not conserved) and were also evaluated by SDM. (D) GXP_1484970, which overlaps with promoter D. In promoter D, 14 TFBS were predicted: RUSH, NEUR, HOMF, BRNF, OVOL, GREF, MYBL, ABDB, NKX6, SORY, HOXF, FKHD, P53F and PDX1. The sequence within promoter D that has a single underline is the 5′-UTR and the doubly underlined represents coding exon 1. The green arrow is the translation start site.
Figure 4.Map of chromosome 18 in the vicinity of the SMAD4 gene. The top line shows the map positions and exons of SMAD4; the second line the location of MLPA probes and the smaller line at left beneath, the location of CGH probes. The bar graph below shows MLPA results of two patients, showing a 50% decrease in amplification of the three probes within NC exon 4 and the probe located just 5′ upstream of NC exon 4 (Patient A) and additionally up through coding exon 8 in Patient B.
Figure 5.Luciferase activities in HEK-293 cells after mutating selected TFBS in promoter C. The wild-type luciferase activity is compared to 11 different mutant constructs: ETSF; DMTF; SRFF; VTBP; SP1F: Sp/KLF family of transcription factors; ZBPF: Zinc finger transcription factors (three different sites); NRF1; BPTF; P53F: p53 tumor suppressor.
Figure 6.Map of Chromosome 18, with (lines from top to bottom): the coding exons of SMAD4; promoter regions; results of ChIP-Seq for RNA Polymerase II in MCF-7 cells; DNase I hypersensitivity sites in MCF-7 cells (from ENCODE data); ChIP-Seq for RNA Polymerase II in HEK-293 cells (from ENCODE data); DNase I hypersensitivity sites in fibroblasts (from ENCODE data) and the density of CG dinucleotides. The areas under promoters A and C are highlighted in gray.