| Literature DB >> 15096275 |
Marc N Offman1, Ramil N Nurtdinov, Mikhail S Gelfand, Dmitrij Frishman.
Abstract
BACKGROUND: Alternative splicing is an efficient mechanism for increasing the variety of functions fulfilled by proteins in a living cell. It has been previously demonstrated that alternatively spliced regions often comprise functionally important and conserved sequence motifs. The objective of this work was to test the hypothesis that alternative splicing is correlated with contact regions of protein-protein interactions.Entities:
Mesh:
Substances:
Year: 2004 PMID: 15096275 PMCID: PMC419334 DOI: 10.1186/1471-2105-5-41
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1A schematic representation of the study. The positions of sequence regions involved in protein-protein interactions were delineated from structurally characterized protein complexes and juxtaposed with the location of putative alternatively spliced regions.
Figure 2Alternative splicing in the cyclin-dependent kinase 2 (cdk2). a. Amino acid sequence of the protein. Spliced regions are colored red. The first (N-terminal) region was identified based on the homology with the deletion type cdk2 variant in human breast cancer (protein id BAA32794.1). The second (C-terminal) region was identified based on the homology with the human cDNA sequence associated with leiomyosarcoma (GenBank accession number BQ225275). b. Graphical representation of spliced regions (red) and sequence spans involved in protein-protein interactions (blue). The full amino acid sequence is shown as black bar. c. Ribbon representation of the three-dimensional structure of cdk1 (PDB code 1fin, chain a). Regions involved in protein-protein interactions are shown in blue colour, spliced regions are red. Violet colour indicates regions where PPI and AS regions overlap.
Figure 3Alternative splicing in phosducin. a. Amino acid sequence of the protein. The spliced region was identified based on the homology with the human phosducin isoform b, also called phosducin-like orphan protein 1 (GenBank accession number NP_072098). b. Graphical representation of the spliced and interacting regions. c. Ribbon representation of the three-dimencional structure (PDB code 2TRC, chain p). Colouring as in Figure 2.
Figure 4Alternative splicing in importin-β. a. Amino acid sequence of the protein. The spliced region was identified based on the homology with the human cDNA sequence with the GenBank accession number BE744080. b. Graphical representation of the spliced and interacting regions. c. Ribbon representation of the three-dimencional structure (PDB code 1IBR, chain b). Colouring as in Figure 2.
Figure 5Alternative splicing in the human rhoa protein. a. Amino acid sequence of the protein. The splicing region was identified based on the homology with the human cDNA sequence associated with large cell carcinoma (GenBank accession number BQ231766) as well as several other cDNAs. b. Graphical representation of the spliced and interacting regions. c. Ribbon representation of the three-dimensional structure (PDB code 1CXZ, chain a). Colouring as in Figure 2.
The dataset of heterodimers.
| 1AM4-A | 199 | 856.04 | 84–88, 119, 122, 123, 126, 127, 185, 189–194, 196–202, 205, 211–213, 215–218, 220 | del., subst. | 103 | 234 | 28 |
| 1AM4-D | 174 | 958.07 | 511–513, 532–539, 556, 560–564, 566, 567, 570, 571, 586, 588, 592, 596 | del., subst. | 536 | 674 | 6 |
| del., subst. | 663 | 674 | 29 | ||||
| del., subst. | 663 | 674 | 1 | ||||
| 1BKD-R | 166 | 1655.36 | 5, 12–18, 20, 21, 25, 30–35, 37, 40, 41, 54–71, 73, 95, 98, 99, 102, 103, 105 | del., subst. | 151 | 166 | 16 |
| 1C1Y-A | 167 | 695.37 | 3, 21, 24, 25, 27, 29, 33, 34, 37, 38–42, 52, 54, 56, 63, 71 | del., subst. | 20 | 167 | 15 |
| del., subst. | 62 | 167 | 33 | ||||
| 1C1Y-B | 77 | 616.01 | 55, 57, 59, 62, 64–71, 73, 84, 85, 87–91 | del. | 70 | 131 | 0 |
| 1CXZ-A | 182 | 892.09 | 0–3, 5, 25–29, 32, 43–48, 50, 52–54, 163–169, 172 | del., subst. | 52 | 181 | 38 |
| del. | 52 | 136 | 0 | ||||
| del., subst. | 139 | 181 | 48 | ||||
| del., subst. | 52 | 181 | 1 | ||||
| 1E96-A | 178 | 579.01 | 21, 22, 24–33, 36, 40, 41, 159–162 | Ins. | 75 | 76 | 19 |
| del., subst. | 13 | 178 | 18 | ||||
| del., subst. | 76 | 178 | 40 | ||||
| 1FIN-A | 289 | 1609.88 | 37–50, 52–58, 69, 71–74, 76, 115, 116, 119–124, 150–159, 162, 179–183, 271–279 | del. | 163 | 196 | 0 |
| del. | 40 | 65 | 0 | ||||
| 163 | 196 | 0 | |||||
| 1FIN-B | 260 | 1794.40 | 173–178, 181, 182, 185, 186, 189, 228, 230, 262, 263, 265–272, 274, 275, 288, 289, 292, 295–300, 302–309, 312–317 | del., subst. | 266 | 432 | 4 |
| 1I2M-A | 165 | 1438.50 | 17–21, 23, 67–77, 91–103, 106–108, 110, 133, 134, 137, 138, 140 | del., subst. | 84 | 173 | 6 |
| Ins. | 83 | 84 | 17 | ||||
| del. | 9 | 12 | 0 | ||||
| del., ins. | 9 | 12 | 0 | ||||
| 83 | 84 | 17 | |||||
| 1I2M-B | 388 | 1341.05 | 42, 44, 45, 55, 56, 75–77, 93–96, 106, 109, 128, 129, 147–152, 181, 200, 201, 249, 250, 266, 268–271, 278, 279, 303, 304, 320, 322, 323, 325, 334, 354, 355, 371, 373, 374, 382, 384, 407–409 | del., subst. | 182 | 411 | 5 |
| Ins. | 43 | 44 | 17 | ||||
| Ins. | 43 | 44 | 31 | ||||
| ins., del., subst. | 43 | 44 | 17 | ||||
| 182 | 411 | 5 | |||||
| 1IBR-B | 458 | 1646.59 | 7, 10–15, 18, 19, 21, 22, 25, 26, 51, 52, 55–60, 62, 63, 66–69, 72, 76, 104–108, 110, 111, 114, 155–157, 159, 160, 189, 199, 200, 232, 235, 239, 246, 273–275, 277, 278, 281, 284, 285, 288, 335, 336, 339–343, 350, 354 | del. | 233 | 334 | 101 |
| del. | 263 | 334 | 71 | ||||
| 1LFD-A | 87 | 583.07 | 18, 20, 27–35, 51–54, 56 | del., subst. | 37 | 100 | 17 |
| 1WQ1-G | 320 | 1391.63 | 745, 746, 749, 750, 782–793, 795, 796, 799, 802, 803, 831, 833, 894–898, 901–904, 906, 907, 910, 911, 914, 927, 928, 931, 934, 935, 938, 939, 942, 944, 947–952 | del., subst. | 897 | 1037 | 15 |
| 2TRC-P | 212 | 2251.56 | 14–26, 28–30, 32, 33, 62–71, 77, 80, 85, 90, 93–95, 97, 99, 102, 105, 132, 135, 193, 194, 196–201, 207, 219, 220, 222–230 | del | 14 | 52 | 0 |
| 2TRC-B | 340 | 2175.25 | 8, 12, 42, 44–48, 55–57, 59, 75–77, 96–101, 116–119, 143–148, 161–164, 184–186, 188, 203–206, 226–230, 246, 266, 268, 270–274, 288–292, 295, 304, 306–317, 332, 333, 335, 337, 339 | del | 20 | 33 | 0 |
a Nooren & Thornton, 2003 b c del. = deletion of residues, subst. = substitution of residues. d residue numbers according to PDB file numbering e Length of the substituted part.
The dataset of homodimers.
| 1IKN-A | 285 | 698.19 | 195, 197–201, 211, 213–218, 242, 243, 245, 246, 248–251 | subst. | 19 | 24 | 7 |
| del. | 85 | 112 | 0 | ||||
| del. subst. | 85 | 290 | 18 | ||||
| del. | 113 | 290 | 0 | ||||
| del. subst. | 186 | 291 | 1 | ||||
| del. | 186 | 219 | 0 | ||||
| 1A15-A | 67 | 752.65 | 20–31, 35, 36, 61, 62, 64–67 | del. | 16 | 39 | 0 |
| del. subst. | 63 | 67 | 10 | ||||
| del. | 40 | 49 | 0 | ||||
| 1DOM-A | 76 | 849.23 | 3–18, 20, 29, 31, 33–38, 40–43, 49–53 | del. subst. | 42 | 76 | 1 |
| 1CNT-1 | 177 | 954.74 | 11, 19, 81, 84, 91, 92, 95, 96, 100–103, 106–111, 113–115, 117, 118, 120–122, 124, 125, 128, 129, 133–137, 140 | del. subst. | 49 | 187 | 24 |
| 1TRZ-A | 21 | 74.10 | 21 | 1 | 15 | ||
a Nooren & Thornton, 2003 b c del. = deletion of residues, subst. = substitution of residues. d residue numbers according to PDB file numbering e Length of the substituted part
Figure 6Smoothing positions of PPI regions. Amino acid sequence is represented as a bit string in which 1 indicates those residue positions that are involved in PPI, and 0 – those that are not. Subsequently smoothing is conducted such that contact segments separated by spacers shorter than N residues are merged, and then the segments shorter than M are deleted. In this example a sequence fragment of the human importin β-chain (PDB code 1ibr; residue positions from 7 to 77) is shown which interacts with the GTP-binding nuclear protein RAN. The interaction involves residues situated on one side of two α-helices. After smoothing with N = 5 and M = 1 entire helices are considered interacting regions.
Figure 7Counting coinciding PPI and AS regions. For clarity, no smoothing was made. a. By individual residues. In this example, there are 20 amino acid positions involved both in PPI and AS (PPI/AS, red colour), 19 non-PPI/AS positions (blue colour), 10 PPI/non-AS positions (green colour), and 6 non-PPI/non-AS positions (black colour). b. By entire sequence segments. In this example there are three PPI regions entirely covered by AS regions (red), one PPI region partially overlapping with an AS region (blue), and one PPI region not overlapping with AS.