| Literature DB >> 18055499 |
Fabian Birzele1, Gergely Csaba, Ralf Zimmer.
Abstract
Alternative splicing is thought to be one of the major sources for functional diversity in higher eukaryotes. Interestingly, when mapping splicing events onto protein structures, about half of the events affect structured and even highly conserved regions i.e. are non-trivial on the structure level. This has led to the controversial hypothesis that such splice variants result in nonsense-mediated mRNA decay or non-functional, unstructured proteins, which do not contribute to the functional diversity of an organism. Here we show in a comprehensive study on alternative splicing that proteins appear to be much more tolerant to structural deletions, insertions and replacements than previously thought. We find literature evidence that such non-trivial splicing isoforms exhibit different functional properties compared to their native counterparts and allow for interesting regulatory patterns on the protein network level. We provide examples that splicing events may represent transitions between different folds in the protein sequence-structure space and explain these links by a common genetic mechanism. Taken together, those findings hint to a more prominent role of splicing in protein structure evolution and to a different view of phenotypic plasticity of protein structures.Entities:
Mesh:
Substances:
Year: 2007 PMID: 18055499 PMCID: PMC2241867 DOI: 10.1093/nar/gkm1054
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Distribution of 488 splicing events in the four major categories. 35% of the events fall into variable regions of the corresponding superfamily while 11% affect complete domains of multi-domain proteins; 8% of the isoforms affect larger regions (more than 50% of the structure) while 46% affect conserved regions of their corresponding superfamily which are present in all superfamily members.
The distribution of non-trivial isoforms in the eight categories defined based on evolutionary considerations with respect to different features
| Coil | α | β (p) | β (i) | αβ (p) | αβ (i) | Repeat | Large | Total | |
|---|---|---|---|---|---|---|---|---|---|
| Conserved region affected (class) | 14 | 50 | 29 | 25 | 49 | 35 | 21 | 37 | 255 |
| Isoform confirmed (protein level) | 4 | 15 | 2 | 9 | 6 | 3 | 1 | 3 | 43 |
| Function described | 2 | 7 | 1 | 5 | 5 | 3 | 1 | 2 | 26 |
| Log Odds: Function/Class | 0.36 | 0.41 | −1.16 | 0.78 | 0 | −0.2 | −0.81 | −0.67 |
(p) and (i) indicate the position of the corresponding β-strands either at internal or peripheral positions of the sheet. The ‘Conserved region affected’ row displays the number isoforms which affect conserved regions of the corresponding superfamily. The ‘Isoform confirmed’ row displays isoforms which have been confirmed in the literature (see Supplementary Data for complete list) on the protein level, while the ‘Function described’ row references isoforms in the different categories which have been described in the literature to perform a well-defined function. The ‘Function/Class’ row contains the log odd ratios of functionally described isoforms in the different structural classes (third row) versus the background class distribution (first row). All log odd ratios of the values given for ‘Isoform confirmed’ and ‘Function described’ against the background distribution of structural classes within isoforms with ‘conserved regions affected’ can be found in the Supplementary Data, Figure S1.
Figure 2.Visualization of alternative splicing events on the structure level. Substitutions are coloured in green while deletions are coloured in red. All figures have been created using PyMOL (http://www.pymol.org). (a) The removal of the carboxy-terminal part from MK14_HUMAN (Q16539-4, pdb: 1zzlA), (b) the removal of one external strand and helix motif in PPAC_HUMAN (P24666-3, pdb: 5pnt), (c) the removal of an internal strand in TF65-HUMAN (Q04206-3, pdb 1nfi), (d) the removal of a large part of the protein from TIP30_HUMAN (Q9BUP3-2, pdb: 2bkaA), (e) the removal of several strands in CASP9_HUMAN (P55211-2, pdb: 1nw9B), (f) and (g) the removal of repetitive motifs from WDR1_CAEEL (Q11176-2: pdb: 1pevA) and CD2A1_HUMAN (P42771-2, pdb 2a5e) as well as (h) the removal of one half of a TIM-barrel structure in CHIA_HUMAN (Q9BZP6-3, pdb: 1vf8A). A comprehensive database of alternative splicing events mapped onto protein structures can be found at: http://www.bio.ifi.lmu.de/ProSAS/NARSupplement.html.
Figure 3.This figure shows four examples for probable fold-changing events by non-trivial splicing events (i.e. those which cannot be accommodated in the native structure) identified by superposing the spliced structure to a different SCOP fold. The examples can also be explored interactively following this link http://www.bio.ifi.lmu.de/ProSAS/NARSupplement.html. Each column represents one example. In the first row, the splicing event is visualized on the native protein structure of the Swissprot protein. In the second row, the superposition of the spliced protein with the corresponding protein from a different fold is shown. Rows three and four display TOPS (34) diagrams of the spliced protein (row three) and the protein belonging to the different fold (row four). In the TOPS diagrams, corresponding secondary structure elements are coloured the same, elements missing in the other protein are coloured in red. Sometimes corresponding helices are split up which frequently results from breaks in the DSSP assignments. From left to right the following examples are shown: Column 1 DNMT2_HUMAN (O14717-6, Astral: d1g55a_, SCOP: c.66.1.26). The spliced protein superposes very well (TM-Score: 0.68) with d1gsoa2 (SCOP: c.30.1.1). Topologically the proteins are very similar, except for a very short strand (length 2) - helix (length 4) motif at the C-terminal end of d1gsoa2. Column 2 AUHM_MOUSE (Q9JLZ3-2, Astral: d1hzda_, SCOP: c.14.1.3) that superposes well (TM-Score: 0.56) with d1vc1a_ (SCOP: c.13.2.1). Topologically the proteins are similar, except for two small strands (both of length 2) and one short helix (length 3) missing in d1vc1a_. Additionally, the C-Terminal part of d1vc1a_ has an additional, short helix-strand motif. Column 3 HER1_CAEEL (P34704-2, Astral: d1szha_, SCOP: a.226.1.1) superposed with d1ni8a_ (SCOP: a.155.1.1, TM-Score 0.49). Only a small fragment (helix–turn–helix-motif) is left over by the splicing event. The two TOPS diagrams are similar with the two main helices being preserved while short helical parts are missing in either of the two proteins. Interestingly, d1ni8a_ is described to contribute to DNA binding after dimerization (35), which might also be the way how the isoform resulting from the HER1 splicing event is stabilized. Column 4 TIP30_HUMAN (Q9BUP3-2, PDB: 2bka, SCOP: c.2.1.2) that again superposes well with d1gsoa2 (see also DNMT2_HUMAN) from SCOP fold c.30.1.1 (TM-Score: 0.54). Topologically the two proteins are very similar according to TOPS except for two helices missing at the C- and N-terminal ends. The function of the isoform is discussed in the text (isoform TC3). Images have been created using Jmol (http://www.jmol.org) and PyMol (http://www.pymol.org).