| Literature DB >> 22749400 |
Marija Buljan1, Guilhem Chalancon, Sebastian Eustermann, Gunter P Wagner, Monika Fuxreiter, Alex Bateman, M Madan Babu.
Abstract
Alternative inclusion of exons increases the functional diversity of proteins. Among alternatively spliced exons, tissue-specific exons play a critical role in maintaining tissue identity. This raises the question of how tissue-specific protein-coding exons influence protein function. Here we investigate the structural, functional, interaction, and evolutionary properties of constitutive, tissue-specific, and other alternative exons in human. We find that tissue-specific protein segments often contain disordered regions, are enriched in posttranslational modification sites, and frequently embed conserved binding motifs. Furthermore, genes containing tissue-specific exons tend to occupy central positions in interaction networks and display distinct interaction partners in the respective tissues, and are enriched in signaling, development, and disease genes. Based on these findings, we propose that tissue-specific inclusion of disordered segments that contain binding motifs rewires interaction networks and signaling pathways. In this way, tissue-specific splicing may contribute to functional versatility of proteins and increases the diversity of interaction networks across tissues.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22749400 PMCID: PMC3437557 DOI: 10.1016/j.molcel.2012.05.039
Source DB: PubMed Journal: Mol Cell ISSN: 1097-2765 Impact factor: 17.970
Figure 1Classification of Exons and Characterization of the Protein Segments
(A) Cassette exons that are tissue-specifically spliced are in red, and alternatively spliced cassette exons not present in the set of tissue-specific exons are in dark gray. Constitutive exons are in light gray.
(B) Shown are types of data and computational approaches used to characterize the exons. See also Figure S1.
Figure 2Tissue-Specific Segments Are Enriched in Disordered Binding Motifs and Are Conserved
Percentages of protein segments encoded by the three different sets of exons that contain (A) Pfam domains, (B) disordered regions, (C) ANCHOR-binding motifs, and (D) PTM sites. Significance was assessed by chi-square test, and p values are indicated in each panel. Box plot of the distribution of conservation (percent amino acid identity) for the (E) disordered region, (F) binding motifs, and (G) binding motifs versus other regions for the three different sets of exons. Statistical significance was assessed using Mann-Whitney test. The median value within each set is shown with a thick black line. Boxes enclose values between the first and third quartile. Interquartile range (IQR) is calculated by subtracting the first quartile from the third quartile. All values that lie more than 1.5× IQR lower than the first quartile or 1.5× higher than the third quartile are considered to be outliers and were removed from the graphs to improve visualization. The smallest and highest values that are not outliers are connected with the dashed line. The notches correspond to ∼95% confidence interval for the median. See also Figure S2 and Table S1.
Figure 3Genes with Tissue-Specific Exons Play an Important Role in Protein Interaction Networks
(A) Distribution of the number of interaction partners for genes with a TS exon (TSE genes; with and without predicted binding motif) and for all other genes (non-TSE genes). See Figure 2 legend for the description of a box plot.
(B) Average number of interaction partners for the TSE (red circles) and non-TSE (gray circles) genes in different tissues. The average and median number of interactions for TSE genes was greater than for non-TSE genes in each of the tissues (p < 5 × 10−2; Mann-Whitney), except for liver and testis interstitial cells.
(C) Principle of the Jaccard similarity index. The score for a gene varies between 0 and 1, where 0 indicates that no interactions are maintained and 1 indicates that all interactions are maintained between a pair of tissues. Hence, the smaller the number is, the more tissue specific are its interactions.
(D) Distribution of Jaccard similarity indices for TSE and non-TSE genes in the tissues included in this study. Gray horizontal line in all box plots indicates median value for the non-TSE genes. Statistical significance was assessed using Mann-Whitney test. See also Figure S3 and Table S2.
Genes with Tissue-Specific Exons that Participate in More Than One Signaling Pathway
| HGNC Gene Name | Gene Description from ENSEMBL | Tissues or Cell Lines with Differential Exon Inclusion | Signaling Pathways |
|---|---|---|---|
| PRKACA | Protein kinase, cAMP-dependant, catalytic, alpha | Cerebellum and MB435 | EGF/MAPKNC, HedgehogC, WNT/WinglessC |
| GSK3B | Glycogen synthase kinase 3 beta | MCF7 and cerebellum | EGF/MAPKC, NC, HedgehogC, TGF-βC, WNT/WinglessC |
| AXIN1 | Axin 1 | Cerebellum and T47D, cerebellum and MCF7 | EGF/MAPKNC, TGF-βC,NC, WNT/WinglessC |
| DAB2 | Disabled homolog 2, mitogen-responsive phosphoprotein | Breast and colon, cerebellum and heart | TGF-βNC, WNT/WinglessNC |
| TGFBR2 | Transforming growth factor, beta receptor II | Heart and testes | EGF/MAPKC, TGF-βC |
| IRAK1 | Interleukin-1 receptor-associated kinase 1 | Cerebellum and skeletal muscle | EGF/MAPKNC, JAK/STATNC |
| DAXX | Death-domain-associated protein | Breast and cerebellum, breast and heart | EGF/MAPKNC, TGF-βNC |
Multipathway genes were obtained from the SignaLink Database (http://signalink.org/); core and noncore pathway genes are denoted by C and NC, respectively. See also Figure S4 and Table S5.
Examples Where Tissue-Specific Segments Overlap with Experimentally Verified Interaction Sites
| Protein Name (Tissue-Specifically Spliced Region) | UniProt Accession (Annotated Binding Region) | Binding Partners (PMID) | Description of Interaction |
|---|---|---|---|
| ANXA7_HUMAN (5–20) | Sorcin (9268363) | Annexin VII functions in membrane fusion events. The N-terminal repeat region of Annexin VII (residues 1–31) is required for the calcium-dependent interaction with the EF-hand protein sorcin. | |
| AXIN1_HUMAN (20–29) | Tankyrase (22307604) | The tumor suppressor Axin-1 regulates the Wnt/β catenin signaling pathways. Turnover of Axin-1 is controlled by tankyrase. Axin-1 contains two Tankyrase binding motifs (residues 18–30 and 60–80). The putative TS isoform of Axin-1 lacks both of these motifs. | |
| PF21A_HUMAN (488–535) | Histone H3 (17687328) | PF21A is a component of the BRAF-HDAC complex and mediates transcriptional repression of neuron specific genes in nonneuronal cells. The putative TS isoform of PF21A lacks a PHD zinc finger domain that recognizes unmethylated lysine 4 of histone H3. | |
| CIP4_HUMAN (293–537) | CDC42, TC10 (9210375, 12242347, 19387844) | The F-BAR protein CIP-4 has an important role in endocytosis. The second coiled-coil domain of CIP4 (residues 332–425) is required for interaction with the GTPases CDC-42 and TC-10. | |
| ILK_HUMAN (33–139) | LIMS1 [PINCH] (19074270) | Integrin-linked kinase has been implicated in cell adhesion, growth factor, and Wnt/β catenin signaling pathways. The adaptor protein PINCH is overexpressed in several types of cancers. The ankyrin repeats of ILK (residues 2–154) interact with the first LIM domain of PINCH. | |
| MASP1_HUMAN (20–278) | FCN2, MBL2 (18596036) | MASP-1 is a component of the mannan binding lectin pathway of the complement system. The putative TS exon of MASP-1 encodes a signal peptide (1–19) and part of a Cub-1 domain (20–138). The latter is involved in calcium-dependent homodimerization of MASP-1 and in the interaction with Mannan Binding Lectin 2 as well as Ficolin. | |
| NUB1_HUMAN (427–474) | NEDD8 (12816948) | Alternative splicing modulates the role of Nub-1 as an important negative regulator of the NEDD8 conjugation system. A longer Nub-1 isoform (Nub1L) possesses a third UBA motif required for Nedd8 interaction. | |
| NFX1_HUMAN (9–26) | PABPC1, PABC4 (17267499) | The interaction of the N-terminal PAM2 motif (9–26) of NFX1 with the cytoplasmic poly(A) binding proteins PABPC1 and PABC4 is important for the role of NFX1 in regulation of telomerase activity (hTERT transcription). | |
| PEX19_HUMAN (1–56) | PEX3 (21102411) | The interaction of Pex19 and Pex3 is critical for peroxisome formation and the posttranslational targeting of peroxisomal membrane proteins. Pex3 interacts with residues 17–32 of Pex19, which are disordered in the unbound state. | |
| PI51C_HUMAN (641–668) | TLN2, AP-2μ, AP-2β (12422219, 19903820) | The predominant brain splice variant of PtdInsPKI gamma contains a short C-terminal recognition motif. Interaction of this motif with the FERM domain of Talin targets and activates the kinase for focal adhesion assembly during cell migration and synaptic vesicle recycling at nerve terminals. Moreover, it has been found that the same C-terminal region of PtdInsPKI interacts with the appendage domain of AP-2β and the signal-sorting domain of AP-2μ during clathrin-mediated endocytosis. | |
| KCC2A_HUMAN, KCC2B_HUMAN (autoinhibition) | Intramolecular autoinhibition of CamKII (21884935) | CamKII functions in neuronal signaling, and the interaction of the N-terminal kinase domain with the C-terminal Hub domain is important for autoinhibition of the enzyme. The linker length between these two domains differs between the paralogues, providing a mechanism to fine-tune the strength of the interaction and autoinhibition. The TS exons encoding for same linker region are also alternatively spliced. | |
| FGFR2_HUMAN (autoinhibition) | Intramolecular autoinhibition of FGFR2 (22244757) | Uncontrolled fibroblast growth factor (FGF) signaling has been implicated in human diseases. The acidic linker region between the first and second immunoglobulin domains (D1 and D2) interacts with D2 and maintains an autoinhibitory state of FGFR. Certain isoforms lack D1 and/or the acidic linker region, leading to increased capacity of FGF signaling. | |
| CO2_HUMAN (1–20) | not annotated (7852336, 19237749) | Complement C2 is a part of the classical pathway of the complement system, and deficiencies of this protein are associated with autoimmune diseases. A short TS splice isoform of Complement C2 retains an intracellular localization, as it lacks the N-terminal signal peptide required for secretion. Moreover, it lacks a CCP domain that is involved in the interaction with C4b. | |
| RIP_HUMAN (164–180) | not annotated (16135809) | RPA functions in DNA replication, repair, and recombination. RIP interacts with RPA and determines thereby the subcellular localization of RPA. A region of RIP (164–180), required for its cytoplasmic retention, has been shown to be encoded by an alternatively spliced exon. | |
Definitions of protein regions that mediate an interaction were derived from the UniProt Database, and the functional description are summarized from the literature. PubMed identifiers (PMIDs) are provided as reference for interactions.
Figure 4Examples of Tissue-Specific Exons that Can Affect Protein Interactions
Transcripts (above) and the encoded protein sequences (below) are shown for tissues (box). Protein domains (colored regions) and tissue-specific segments (red) are shown.
(A) The PIP5K1C gene shows differential inclusion of a TS exon between cerebellum and lymph node. This segment undergoes disorder-to-order transition when interacting with the clathrin adaptor AP-2 (AP2B1). The exclusion of this segment is likely to abolish the interaction.
(B) Protein structure (3H1Z), containing interacting segments from these two proteins.
(C) PACSIN2 shows differential inclusion of a TS exon between cerebellum and breast tissue. All four annotated phosphorylation sites are inside the TS exon. Thus, the exclusion is likely to affect regulation via phosphorylation. See also Table S3.
Figure 5Alternative Inclusion of Tissue-Specific Exons Can Rewire Interaction Networks and Modulate Protein Interactions
(A) When tissue-specific splicing gives rise to isoforms that differ in the presence of interacting protein segments, this can result in the rewiring of an interaction network in the respective tissues. This is illustrated with the PIP5K1C kinase gene that has an exon with different inclusion levels in cerebellum and lymph mode. The exon encodes a binding motif (red cartoon representation), which mediates the interaction with the AP-2β appendage domain (cyan surface representation).
(B) Molecular mechanisms by which a tissue-specific segment can rewire or fine-tune protein interactions. Segments in red are encoded by TS exons; other regions encoded by the same protein are colored dark gray. Interaction partners are colored in cyan. Segments encoded by TS exons can include a region that directly interacts with other proteins, DNA, RNA, or ligands, or indirectly affect the protein's binding properties (e.g., affinity, kinetics, and selectivity). TS segments that are involved in interactions are less frequently domains and more often disordered regions that embed peptide-binding motifs. An example for the former is a TS segment encoding the complete PF21A PHD zinc finger domain that recognizes nonmethylated lysine 4 of histone H3 (2PUY). An example for the latter is the disordered segment encoded by a TS exon of PEX19, which becomes ordered upon binding to PEX3 (3MK4). An example of a splicing event that affects linker length, thereby altering its affinity to bind DNA, is the fly UBX gene (1B8I; disordered linker, red line; and DNA, light gray surface). An example of where the spliced exon encodes an intramolecular interaction motif that competes or masks an interaction interface is the acid box region of the FGFR kinases (model based on 2CKN and 1YRY; acidic linker, red line). See also Table S4.