| Literature DB >> 17651478 |
Sergio Lois1, Noemí Blanco, Marian Martínez-Balbás, Xavier de la Cruz.
Abstract
BACKGROUND: Epigenetic regulators (histone acetyltransferases, methyltransferases, chromatin-remodelling enzymes, etc) play a fundamental role in the control of gene expression by modifying the local state of chromatin. However, due to their recent discovery, little is yet known about their own regulation. This paper addresses this point, focusing on alternative splicing regulation, a mechanism already known to play an important role in other protein families, e.g. transcription factors, membrane receptors, etc.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17651478 PMCID: PMC1949830 DOI: 10.1186/1471-2164-8-252
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Alternative splicing pattern of human histone methyltransferase SUV39H2. Representation of the domain structure of three isoforms of SUV39H2, together with their sizes. Shown in red are the two domains, PRE-SET and SET that constitute the catalytic unit of the enzyme. The interaction domain, chromodomain, is shown in green. This domain is seriously damaged in the second isoform, and is unlikely to play any targeting role. The catalytic unit, on the contrary, remains intact in this second isoform, but is clearly damaged in the third isoform, with 28 % of the SET domain and the whole PRE-SET domain missing.
Summary of the data utilised in this work
| Number of genes | Number of genes with AS | Number of genes with AS involving protein domains | |
| 160 | 78 | 46 | |
| 71 | 50 | 32 | |
| 31 | 21 | 10 |
Cases for which alternative splicing sequence changes mainly affect catalytic domains
| Gene name | Species | Reference Isoform Size | Alternative Isoform Size | Domains affected |
| SUV39H2 | H.s. | 410 | 230 | PRESET, SET |
| SMARCA1 (SNF2L) | H.s. | 1054 | 1033 | DEXHC |
| PRKDC (DNA-PK) | H.s. | 4127 | 4097 | PI3_PI4 KINASE |
| RPS6KA5 (MSK1) | H.s. | 802 | 549 | PKINASE |
| EZH2 | H.s. | 751 | 376 | SET |
| EHMT2 (G9a) | H.s. | 1210 | 202 | ANK, PRESET, SET* |
| CARM1 (PRMT4) | H.s. | 608 | 412 | SKB1 |
| SETDB1 | H.s. | 1290 | 397 | MBD, PRESET, SET, TUDOR |
| EHMT1 | H.s. | 1267 | 1153 | SET |
| FBXL11 (JHDM1A) | H.s. | 1162 | 856 | JMJC |
| AOF2 (LSD1) | H.s. | 876 | 852 | AMINO_OXIDASE |
| GSG2 (HASPIN) | H.s. | 798 | 314 | PKINASE |
| PRDM2 (RIZ1) | H.s. | 1718 | 1481 | SET |
| Setdb1 | M.m. | 1308 | 488 | MBD, PRESET, SET |
| Htatip | M.m | 546 | 492 | MOZ_SAS |
| Fbxl10 (Jhdm1b) | M.m. | 1309 | 776 | JMJC |
| Fbxl10 (Jhdm1b) | M.m | 1309 | 656 | JMJC |
| Jmjd1b (Jhdm2b) | M.m | 1562 | 1124 | JMJC |
| fbxl10 (Jhdm1b) | X.l. | 1259 | 738 | JMJC |
| mez2 | Z.m. | 894 | 624 | SET |
Figure 2Impact of alternative splicing in catalytic domains. In all cases the part of the protein affected by alternative splicing is shown in yellow, while the remaining of the protein is shown in blue. (A) Domain DEXHC of human chromatin remodelling SMARCA1. Alternative splicing results in the loss of a α-helix. (B) Domain PI3_PI4_KINASE of kinase PRKDC. Alternative splicing results in the loss of a sequence stretch that has very distant ends. The figures were obtained using the MOLSCRIPT software [65].
Cases for which alternative splicing sequence changes mainly affect interaction domains
| Gene name | Species | Reference Isoform Size | Alternative Isoform Size | Domains affected |
| SUV39H2 | H.s. | 410 | 350 | CHROMO |
| GCN5L2 | H.s. | 837 | 476 | PCAF_N |
| GCN5L2 | H.s. | 837 | 427 | PCAF_N |
| MYST-1 | H.s. | 467 | 300 | CHROMO |
| SMARCA2 (BRM) | H.s. | 1590 | 1572 | BROMO |
| MLL | H.s. | 3969 | 3931 | PHD |
| MORF4L1 | H.s. | 362 | 333 | CHROMO |
| MORF4L1 | H.s. | 362 | 323 | CHROMO |
| FBXL10 (JHDM1B) | H.s. | 1336 | 1326 | LRR_RI |
| FBXL10 (JHDM1B) | H.s. | 1336 | 1306 | LRR_RI |
| JMJD2B (JHDM3B) | H.s. | 1096 | 448 | PHD, TUDOR |
| MLL2 | H.s. | 5265 | 4957 | RING, PHD |
| MLL3 | H.s. | 4911 | 4029 | PHD |
| NSD1 | H.s. | 2696 | 2593 | PWWP |
| RNF40 | H.s. | 1001 | 838 | RING, ZF_C3HC4 |
| Morf4l1 | M.m. | 362 | 323 | CHROMO |
| Htatip | M.m. | 546 | 302 | CHROMO |
| Fbxl11 (Jhdm1a) | M.m. | 1161 | 494 | ZF_CXXC |
| Fbxl11 (Jhdm1a) | M.m. | 1161 | 338 | ZF_CXXC |
| Jmjd2a (Jhdm3a) | M.m. | 1064 | 1033 | PHD, TUDOR |
| Jmjd2b (Jhdm3b) | M.m. | 1086 | 1021 | TUDOR |
| cbp-1 | C.e. | 2056 | 2045 | ZNF_TAZ |
Figure 3Impact of alternative splicing in interaction domains. In all cases the part of the protein affected by alternative splicing is shown in yellow, while the remaining of the protein is shown in blue. (A) Chromodomain of human histone methyltransferase SUV39H2. One of the main strands of the β-sheet is missing in one of the alternative splice isoforms. (B) Bromodomain of human chromatin remodelling SMARCA2. One of the four helices of the helix bundle is lost in the alternative splice isoforms. The figures were obtained using the MOLSCRIPT software [65].
Figure 4Impact of alternative splicing in the ZNFTAZ domain of . A small strand (yellow) is lost in one of the alternative splice isoform. Only small changes can be expected from this deletion. The figure was obtained using the MOLSCRIPT software [65].
Cases for which alternative splicing sequence changes result in drastically affected isoforms
| Gene name | Species | Reference Isoform Size | Alternative Isoform Size | Domains affected |
| SETDB1 | H.s. | 1290 | 249 | MBD, PRESET, SET, TUDOR |
| SETDB1 | H.s. | 1290 | 151 | MBD, PRESET, SET, TUDOR |
| SMARCA2 (BRM) | H.s. | 1590 | 278 | HSA, BRK, DEXHC, HELICASE_C, BROMO |
| SMARCA2 (BRM) | H.s. | 1590 | 254 | HSA, BRK, DEXHC, HELICASE_C, BROMO |
| SMARCA2 (BRM) | H.s. | 1590 | 236 | HSA, BRK, DEXHC, HELICASE_C, BROMO |
| SMARCA2 (BRM) | H.s. | 1590 | 119 | HSA, BRK, DEXHC, HELICASE_C, BROMO |
| SMARCA4 (BRG1) | H.s. | 1679 | 628 | BRK, BROMO, DEXHC, HSA |
| SUV39H1 | H.s. | 412 | 409 | CHROMO, PRESET, SET |
| MLL | H.s. | 3969 | 511 | BROMO, FYRC, FYRN, PHD, SET, ZF-CXXC |
| ATM | H.s. | 3056 | 138 | FAT, FATC, PI3_PI4 KINASE |
| MORF4L1 | H.s. | 362 | 235 | MRG |
| EHMT1 | H.s. | 1267 | 825 | ANK, PRESET, SET |
| WBP7 (MLL4) | H.s. | 2715 | 582 | ZF_CXXC, PHD, FYRC, FYRN, SET |
| Setdb1 | M.m. | 1308 | 500 | MBD, PRESET, TUDOR |
| Stk4 | M.m. | 487 | 126 | PKINASE |
| Htatip | M.m. | 546 | 302 | CHROMO |
| Suv39h2 | M.m. | 477 | 257 | CHROMO, PRESET, SET |
| Fbxl10 (Jhdm1b) | M.m. | 1309 | 114 | JMJC, ZF_CXXC |
| Su(var)3–9 | D.m. | 635 | 475 | CHROMO, PRESET, SET |
| mez2 | Z.m. | 894 | 341 | SET |
In the "Gene name" column we list the standard names of the proteins, although in some cases we also provide alternative names that are frequently used in the literature. In the "Species" column H.s., M.m., D.m., C.e., O.s., X.l. and Z.m. mean Homo sapiens, Mus musculus, Drosophila melanogaster, Caenorhabditis elegans, Oryza sativa, Xenopus laevis and Zea mays, respectively. The sizes of the different isoforms are given in amino acid number. *In this case, although the ANK protein interaction domain is lost, the NFSP transcription factor binding domain is retained.
Figure 5Alternative splicing of human ubiquitin-conjugating enzyme E2 A UBE2A. The part of the protein affected by alternative splicing is shown in yellow, and the remaining in blue. One can see that a α-helix and a whole β-sheet are lost in one of the isoforms, with a potentially very disruptive effect. The figure was obtained using the MOLSCRIPT software [65].
Figure 6Hypothetical mechanism of regulation by alternative splicing of histone dimethyltransferase G9a function. (A) Experimental evidence indicates that histone dimethyltransferase G9a plays an important role in the silencing of neuronal genes in non-neuronal tissues [9]. In the proposed mechanism [9], shown here with red arrows, in non-neuronal tissues the transcription factor NFSP (shown in magenta) recruits the fully functional isoform of G9a (shown here with two domains: a binding domain in blue, and a catalytic domain in yellow) to a series of target genes that are subsequently silenced by G9a dimethylation of lysine-9 from histone H3. This mechanism may be inhibited/modulated by expression of the G9a short isoform (which only retains the NFSP transcription factor binding domain, Table 2), as shown here with green arrows. This isoform may behave as a dominant-negative inhibitor, as shown by the green arrows, blocking the access of the catalytically active isoform to the chromatin of the target gene. Absence of methylation marks in histone H3's lysine-9 would then result in an active gene. (B) The expression state of the target genes in both the nervous system (active, green colour) and in other tissues (silenced, red colour), as a result of the silencing, combined, action of NFSP and G9a. Co-expression of both the long and the short isoforms may result in the modification of the expression state of the target genes in non-neuronal tissues. These target genes may now show varying degrees of activity, as a result of the dominant-negative inhibitor role played by the short isoform (described in (A)).
List of genes showing alternative splicing
| Gene Symbol | Species | Function | Protein name |
| CDY-1 | H.s. | A | |
| GCN5L2 | H.s. | A | |
| HAT1 | H.s. | A | |
| HTATIP (TIP60) | H.s. | A | |
| MORF4L1 | H.s. | A | |
| MYST1 | H.s. | A | |
| NCOA-1 | H.s. | A | |
| TAF1 (TAF250) | H.s. | A | |
| CARM1 (PRMT4) | H.s. | M | |
| DOT1L | H.s. | M | |
| EHMT2 (G9a) | H.s. | M | |
| EZH2 | H.s. | M | |
| MLL | H.s. | M | |
| PRMT1 | H.s. | M | |
| SETD8 (PR-SET7, SET8) | H.s. | M | |
| SETDB1 | H.s. | M | |
| SUV39H1 | H.s. | M | |
| SUV39H2 | H.s. | M | |
| ATM | H.s. | P | |
| ATR | H.s. | P | |
| AURKB | H.s. | P | |
| MAP3K12 (DLK/ZIP) | H.s. | P | |
| PRKDC (DNA-PK) | H.s. | P | |
| RPS6KA5 (MSK1) | H.s. | P | |
| RPS6KA4 (MSK2) | H.s. | P | |
| CHD-3 | H.s. | R | |
| CHD-4 | H.s. | R | |
| SMARCA1 (SNF2L) | H.s. | R | |
| SMARCA2 (BRM) | H.s. | R | |
| SMARCA4 (BRG1) | H.s. | R | |
| UBE2A | H.s. | U | |
| CKII | H.s. | P | |
| EHMT1 | H.s. | M | |
| GSG2 (HASPIN) | H.s. | P | |
| FBXL11 (JHDM1A) | H.s. | DM | |
| FBXL10 (JHDM1B) | H.s. | DM | |
| JMJD1B (JHDM2B) | H.s. | DM | |
| JMJD2B (JHDM3B) | H.s. | DM | |
| JMJD2C (JHDM3C) | H.s. | DM | |
| AOF2 (LSD1) | H.s. | DM | |
| MLL2 | H.s. | M | |
| MLL3 | H.s. | M | |
| WBP7 (MLL4) | H.s. | M | |
| MLL5 | H.s. | M | |
| NSD1 | H.s. | M | |
| PRMT5 | H.s. | M | |
| PRDM2 (RIZ1) | H.s. | M | |
| RNF40 | H.s. | U | |
| SETDB2 | H.s. | M | |
| SIRT2 | H.s. | DA | |
| Gtf3c4 | M.m. | A | |
| Htatip | M.m. | A | |
| Morf4l1 | M.m. | A | |
| Ncoa-1 | M.m. | A | |
| Ehmt2 | M.m. | M | |
| Ezh2 | M.m. | M | |
| Prmt1 | M.m. | M | |
| Carm1 (Prmt4) | M.m. | M | |
| Setdb1 | M.m. | M | |
| Suv39h1 | M.m. | M | |
| Suv39h2 | M.m. | M | |
| Stk4 | M.m. | P | |
| Myst2 (Hbo1) | M.m. | A | |
| Fbxl11 (Jhdm1a) | M.m. | DM | |
| Fbxl10 (Jhdm1b) | M.m. | DM | |
| Jmjd1a (Jhdm2a) | M.m. | DM | |
| Jmjd1b (Jhdm2b) | M.m. | DM | |
| Jmjd2a (Jhdm3a) | M.m. | DM | |
| Jmjd2b (Jhdm3b) | M.m. | DM | |
| Ring1A | M.m. | U | |
| Rnf20 | M.m. | U | |
| Su(var)3–9 | D.m. | M | |
| trx | D.m. | M | |
| Taf1 | D.m. | P | |
| brm | D.m. | R | |
| cbp-1 | C.e. | A | |
| fbxl10 (jhdm1b) | X.l. | DM | |
| mez2 | Z.m. | M |
In the "Species" column H.s., M.m., C.e., O.s., X.l. and Z.m. mean Homo sapiens, Mus musculus, Caenorhabditis elegans, Oryza sativa, Xenopus laevis and Zea mays, respectively. In the column "Function" A, DA, DM, M, P, U and R mean Acetylation, deacetylation, demethylation, methylation, phosphorylation, ubiquitination and chromatin remodelling, respectively.
List of domains affected by alternative splicing in chromatin-modifying enzymes
| Domain name | Function | Enzyme name |
| AMINO_OXIDASE | Catalytic | AOF2 |
| ANK | Protein-Protein Interaction | EHMT1, EHMT2 |
| BRK | Unknown | SMARCA2, SMARCA4 |
| BROMO | Interaction (Acetylated Lysines) | SMARCA2, SMARCA4, MLL |
| CHROMO | Interaction (Methylated Lysines) | SUV39H1, SUV39H2, Suv39h2, Su(var)3–9, MYST-1, MORF4L1, Morf4l1, Htatip |
| DEXHC | Catalytic | SMARCA1, SMARCA2, SMARCA4 |
| FAT | Interaction/Modulate catalysis | ATM |
| FATC | Interaction/Modulate catalysis | ATM |
| FYRC | Probably not-catalytic | MLL, WBP7 |
| FYRN | Probably not-catalytic | MLL, WBP7 |
| HELICASE_C | SMARCA2 | |
| HSA | Probably DNA binding | SMARCA2, SMARCA4 |
| JMJC | Catalytic | FBXL11, fbxl10 (from Mus musculus and Xenopus laevis), jmjd1b |
| LRR_RI | Interaction | FBXL10 |
| MBD | DNA binding | SETDB1, Setdb1 |
| MOZ_SAS | Catalytic | Htatip |
| MRG | Interaction | MORF4L1 |
| PCAF_N | Interaction with CBP | GCN5L2 |
| PHD | Intra- and Intermolecular interactions | MLL, MLL2, MLL3, JMJD2B, Jmjd2a, WBP7 |
| PI3_PI4_KINASE | Catalytic | PRKDC, ATM |
| PKINASE | Catalytic | AURKB, GSG2, RPS6KA5, stk4 |
| PRESET | Interaction-Catalysis | SUV39H1, SUV39H2, Suv39h2, Su(var)3–9, SETDB1, Setdb1, EHMT1, EHMT2 |
| PWWP | Unknown | NSD1 |
| RING | Interaction | MLL2, RNF40 |
| SET | Catalytic | PRDM2, SUV39H1, SUV39H2, suv39h2, Su(var)3–9,SETDB1, Setdb1, mez2, MLL, WBP7, EHMT1, EHMT2, EZH2 |
| SKB1 | Catalytic | CARM1 |
| UBCC | Whole protein | UBE2A |
| TUDOR | Interaction | Jmjd2a, Jmjd2b, JMJD2B, SETDB1, Setdb1 |
| ZF_C3HC4 | Interaction | RNF40 |
| ZF_CXXC | Interaction | Fbxl10, Fbxl11, MLL, WBP7, |
| ZNF_TAZ | Interaction | cbp-1 |
Templates utilised for comparative modelling
| Protein name | Size Ref. | Species | Domain name | PDB code | % Seq. Id. |
| cbp-1 | 2056 | C.e. | ZNF_TAZ | 1L8C | 75 |
| SMARCA1 (SNF2L) | 1054 | H.s. | DEXHC | 1Z6A | 38 |
| SMARCA2 (BRM) | 1590 | H.s. | BROMO | 1N72 | 26 |
| SUV39H2 | 410 | H.s. | CHROMO | 1KNA | 47 |
| SUV39H2 | 410 | H.s. | SET | 1MVH | 39 |
| UBE2A | 154 | H.s. | UBCC | 1JAS | 95 |
| PRKDC (DNA-PK) | 4127 | H.s. | PI3-PI4 KINASE | 1E8Y | 29 |
In the "Species" column H.s. and C.e., mean Homo sapiens and Caenorhabditis elegans, respectively. The size of the whole protein is given in amino acid number. % Seq.Id. is the percentage of sequence identity between the target and the template sequences. The PDB code is the code of the template structure utilised for the comparative modelling in the PDB database [64].