| Literature DB >> 19308253 |
Karen A Power1, James P McRedmond, Andreas de Stefani, William M Gallagher, Peadar O Gaora.
Abstract
Alternative splicing (AS) is an intrinsic regulatory mechanism of all metazoans. Recent findings suggest that 100% of multiexonic human genes give rise to splice isoforms. AS can be specific to tissue type, environment or developmentally regulated. Splice variants have also been implicated in various diseases including cancer. Detection of these variants will enhance our understanding of the complexity of the human genome and provide disease-specific and prognostic biomarkers. We adopted a proteomics approach to identify exon skip events - the most common form of AS. We constructed a database harboring the peptide sequences derived from all hypothetical exon skip junctions in the human genome. Searching tandem mass spectrometry (MS/MS) data against the database allows the detection of exon skip events, directly at the protein level. Here we describe the application of this approach to human platelets, including the mRNA-based verification of novel splice isoforms of ITGA2, NPEPPS and FH. This methodology is applicable to all new or existing MS/MS datasets.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19308253 PMCID: PMC2654914 DOI: 10.1371/journal.pone.0005001
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Workflow for the identification of novel exon skip events.
A rectangle represents a program, rhombus represents a program output, cylinder represents a data source and circle represents a program input.
Figure 2Generation and usage of the SkipE database.
(A) Generation of representative transcripts. Each box represents an exon and each line is an intron. i), ii) and iii) represent transcripts from a single gene. iv) shows the full length representative transcript used to generate the junction peptides. (B) Structure of a junction peptide. The top two boxes represent the translated sequences of two separate, non-adjacent exons. The tryptic cleavage sites are represented by dashed vertical bars. The C-terminal sequence of the upstream exon from the final tryptic site is spliced to the N-terminus of the downstream exon and extends to the first tryptic cleavage site of the downstream exon.
Figure 3Characteristics of the exon skip events detected in human platelets.
(A) and (B) describe the distribution of the SkipE and IPI peptides respectively, across the different subcellular compartments for both resting and activated platelet samples.
Description of the 89 genes identified in both SkipE and IPI.
| Gene Symbol | Ensembl Gene ID | Exon ID 1 | Exon ID 2 | Description |
| ACLY | ENSG00000131473 | ENSE00000898911 | ENSE00000898879 | ATP-citrate synthase |
| ACOX1 | ENSG00000161533 | ENSE00001222343 | ENSE00001117984 | Acyl-coenzyme A oxidase 1, peroxisomal |
| ACTN4 | ENSG00000130402 | ENSE00000895798 | ENSE00000895787 | Alpha-actinin-4 |
| ALOX12 | ENSG00000108839 | ENSE00000905333 | ENSE00000887238 | Arachidonate 12-lipoxygenase, 12S-type |
| AMPD2 | ENSG00000116337 | ENSE00001153152 | ENSE00000913099 | AMP deaminase 2 |
| AP1B1 | ENSG00000100280 | ENSE00000652055 | ENSE00000652051 | AP-1 complex subunit beta-1 |
| APOB | ENSG00000084674 | ENSE00000932268 | ENSE00000718984 | Apolipoprotein B-100 precursor |
| APOL1 | ENSG00000100342 | ENSE00000935990 | ENSE00001369317 | Apolipoprotein-L1 precursor |
| ARHGEF7 | ENSG00000102606 | ENSE00000686804 | ENSE00000686825 | Rho guanine nucleotide exchange factor 7 |
| ARHGEF7 | ENSG00000102606 | ENSE00001236980 | ENSE00000686833 | Rho guanine nucleotide exchange factor 7 |
| ATIC | ENSG00000138363 | ENSE00001363573 | ENSE00001146950 | Bifunctional purine biosynthesis protein PURH |
| ATP5C1 | ENSG00000165629 | ENSE00001481323 | ENSE00001094820 | ATP synthase gamma chain, mitochondrial precursor |
| C2 | ENSG00000204364 | ENSE00001467298 | ENSE00001467293 | Complement C2 precursor |
| C21orf33 | ENSG00000160221 | ENSE00001506662 | ENSE00001506660 | ES1 protein homolog, mitochondrial precursor |
| C3 | ENSG00000125730 | ENSE00001053527 | ENSE00000858107 | Complement C3 precursor |
| C3 | ENSG00000125730 | ENSE00001053551 | ENSE00000858104 | Complement C3 precursor |
| CCT5 | ENSG00000150753 | ENSE00001082664 | ENSE00001082663 | T-complex protein 1 subunit epsilon |
| CD109 | ENSG00000156535 | ENSE00001144336 | ENSE00001144250 | CD109 antigen precursor |
| CD109 | ENSG00000156535 | ENSE00001144243 | ENSE00001084417 | CD109 antigen precursor |
| CLTC | ENSG00000141367 | ENSE00000948100 | ENSE00000948105 | Clathrin heavy chain 1 |
| CLTCL1 | ENSG00000070371 | ENSE00000596272 | ENSE00001343357 | Clathrin heavy chain 2 |
| COL14A1 | ENSG00000187955 | ENSE00001022732 | ENSE00001090753 | Collagen alpha-1 |
| COL14A1 | ENSG00000187955 | ENSE00000702894 | ENSE00001476378 | Collagen alpha-1 |
| COPB1 | ENSG00000129083 | ENSE00000886038 | ENSE00000703797 | Coatomer subunit beta |
| COPB2 | ENSG00000184432 | ENSE00001322263 | ENSE00001311447 | Coatomer subunit beta |
| CP | ENSG00000047457 | ENSE00001008190 | ENSE00000779559 | Ceruloplasmin precursor |
| CSE1L | ENSG00000124207 | ENSE00000845497 | ENSE00000845507 | Exportin-2 |
| CYFIP1 | ENSG00000068793 | ENSE00000883355 | ENSE00000883353 | Cytoplasmic FMR1-interacting protein 1 |
| DCTN1 | ENSG00000204843 | ENSE00001261315 | ENSE00001199793 | Dynactin-1 |
| ENO1 | ENSG00000074800 | ENSE00000739712 | ENSE00000738913 | Alpha-enolase |
| FAM62A | ENSG00000139641 | ENSE00000939452 | ENSE00000939471 | Protein FAM62A |
| FH | ENSG00000091483 | ENSE00000961691 | ENSE00001069123 | Fumarate hydratase, mitochondrial precursor |
| FLII | ENSG00000177731 | ENSE00001289389 | ENSE00001289270 | Protein flightless-1 homolog. |
| FLNA | ENSG00000196924 | ENSE00000678331 | ENSE00000868362 | Filamin-A |
| GLUD1 | ENSG00000148672 | ENSE00000986500 | ENSE00000986506 | Glutamate dehydrogenase 1, mitochondrial precursor |
| GPD2 | ENSG00000115159 | ENSE00000924640 | ENSE00001188495 | Glycerol-3-phosphate dehydrogenase, mitochondrial precursor |
| GUCY1A3 | ENSG00000164116 | ENSE00001231799 | ENSE00001081588 | Guanylate cyclase soluble subunit alpha-3 |
| HD | ENSG00000197386 | ENSE00000854949 | ENSE00000854981 | Huntington disease protein |
| HD | ENSG00000197386 | ENSE00000854965 | ENSE00001251513 | Huntington disease protein |
| HD | ENSG00000197386 | ENSE00000854958 | ENSE00000854991 | Huntington disease protein |
| HD | ENSG00000197386 | ENSE00000854979 | ENSE00000855002 | Huntington disease protein |
| HERC2 | ENSG00000128731 | ENSE00000672196 | ENSE00001275912 | Probable E3 ubiquitin-protein ligase HERC2 |
| HERC2 | ENSG00000128731 | ENSE00000672179 | ENSE00001275876 | Probable E3 ubiquitin-protein ligase HERC2 |
| HERC2 | ENSG00000128731 | ENSE00000908550 | ENSE00000908562 | Probable E3 ubiquitin-protein ligase HERC2 |
| HK1 | ENSG00000156515 | ENSE00001145338 | ENSE00001276961 | Hexokinase-1 |
| HSD17B4 | ENSG00000133835 | ENSE00001143964 | ENSE00000972282 | Peroxisomal multifunctional enzyme type 2 |
| HSD17B4 | ENSG00000133835 | ENSE00001143927 | ENSE00000972282 | Peroxisomal multifunctional enzyme type 2 |
| HSD17B4 | ENSG00000133835 | ENSE00001169924 | ENSE00001144014 | Peroxisomal multifunctional enzyme type 2 |
| HSD17B4 | ENSG00000133835 | ENSE00001143964 | ENSE00001143927 | Peroxisomal multifunctional enzyme type 2 |
| HYOU1 | ENSG00000149428 | ENSE00001195270 | ENSE00000990519 | 150 kDa oxygen-regulated protein precursor |
| IQGAP2 | ENSG00000145703 | ENSE00000971759 | ENSE00001030776 | Ras GTPase-activating-like protein IQGAP2. |
| ITGA2 | ENSG00000164171 | ENSE00001082079 | ENSE00001082066 | Integrin alpha-2 precursor |
| ITGA2 | ENSG00000164171 | ENSE00001082085 | ENSE00001082079 | Integrin alpha-2 precursor |
| ITGB3 | ENSG00000056345 | ENSE00000947489 | ENSE00000735016 | Integrin beta-3 precursor |
| ITIH2 | ENSG00000151655 | ENSE00001415117 | ENSE00001395332 | Inter-alpha-trypsin inhibitor heavy chain H2 precursor |
| ITPR1 | ENSG00000150995 | ENSE00001072653 | ENSE00001122088 | Inositol 1,4,5-trisphosphate receptor type 1 |
| KIF5B | ENSG00000170759 | ENSE00001163763 | ENSE00001163716 | Kinesin heavy chain |
| KRT16 | ENSG00000186832 | ENSE00001118312 | ENSE00001118295 | Keratin, type I cytoskeletal 16 |
| KTN1 | ENSG00000126777 | ENSE00001292736 | ENSE00000867340 | Kinectin |
| LCP2 | ENSG00000043462 | ENSE00000769281 | ENSE00000812799 | Lymphocyte cytosolic protein 2 |
| LRRFIP2 | ENSG00000093167 | ENSE00000825531 | ENSE00000760563 | Leucine-rich repeat flightless-interacting protein 2 |
| LTBP1 | ENSG00000049323 | ENSE00000932484 | ENSE00000932488 | Latent-transforming growth factor beta-binding protein, isoform 1L precursor |
| LTBP1 | ENSG00000049323 | ENSE00000932483 | ENSE00001006678 | Latent-transforming growth factor beta-binding protein, isoform 1L precursor |
| LTBP1 | ENSG00000049323 | ENSE00000932485 | ENSE00000744639 | Latent-transforming growth factor beta-binding protein, isoform 1L precursor |
| LTBP1 | ENSG00000049323 | ENSE00000809557 | ENSE00000744639 | Latent-transforming growth factor beta-binding protein, isoform 1L precursor |
| MACF1 | ENSG00000127603 | ENSE00001041391 | ENSE00001079474 | Microtubule-actin cross-linking factor 1, isoforms 1/2/3/5 |
| MACF1 | ENSG00000127603 | ENSE00001408360 | ENSE00001218066 | Microtubule-actin cross-linking factor 1, isoforms 1/2/3/5 |
| MACF1 | ENSG00000127603 | ENSE00001411283 | ENSE00001218029 | Microtubule-actin cross-linking factor 1, isoforms 1/2/3/5 |
| MACF1 | ENSG00000127603 | ENSE00001411283 | ENSE00001041391 | Microtubule-actin cross-linking factor 1, isoforms 1/2/3/5 |
| MMRN1 | ENSG00000138722 | ENSE00001003940 | ENSE00001003943 | Multimerin-1 precursor |
| MTCH2 | ENSG00000109919 | ENSE00000714864 | ENSE00001267224 | Mitochondrial carrier homolog 2 |
| MTHFD1 | ENSG00000100714 | ENSE00000658410 | ENSE00000658424 | C-1-tetrahydrofolate synthase, cytoplasmic |
| MTHFD1 | ENSG00000100714 | ENSE00000658406 | ENSE00000658420 | C-1-tetrahydrofolate synthase, cytoplasmic |
| MYH4 | ENSG00000141048 | ENSE00000907666 | ENSE00000907657 | Myosin-4 |
| NID2 | ENSG00000087303 | ENSE00000854715 | ENSE00000657316 | Nidogen-2 precursor |
| NID2 | ENSG00000087303 | ENSE00000657316 | ENSE00000854708 | Nidogen-2 precursor |
| NPEPPS | ENSG00000141279 | ENSE00001138170 | ENSE00001138132 | Puromycin-sensitive aminopeptidase |
| NRBP1 | ENSG00000115216 | ENSE00000809167 | ENSE00000733215 | Nuclear receptor-binding protein. |
| OGDH | ENSG00000105953 | ENSE00000681534 | ENSE00000681548 | 2-oxoglutarate dehydrogenase E1 component, mitochondrial precursor |
| PDIA5 | ENSG00000065485 | ENSE00001149277 | ENSE00001353839 | Protein disulfide-isomerase A5 precursor |
| PICALM | ENSG00000073921 | ENSE00000742961 | ENSE00001376469 | Phosphatidylinositol-binding clathrin assembly protein |
| PIP5K2A | ENSG00000150867 | ENSE00000996551 | ENSE00000996552 | Phosphatidylinositol-4-phosphate 5-kinase type-2 alpha |
| PKHD1L1 | ENSG00000205038 | ENSE00001477427 | ENSE00001477413 | fibrocystin L |
| PKHD1L1 | ENSG00000205038 | ENSE00001477417 | ENSE00001477394 | fibrocystin L |
| PKHD1L1 | ENSG00000205038 | ENSE00001477471 | ENSE00001477421 | fibrocystin L |
| PKHD1L1 | ENSG00000205038 | ENSE00001477455 | ENSE00001477439 | fibrocystin L |
| PKHD1L1 | ENSG00000205038 | ENSE00001477437 | ENSE00001477347 | fibrocystin L |
| PKHD1L1 | ENSG00000205038 | ENSE00001477474 | ENSE00001477449 | fibrocystin L |
| PLEC1 | ENSG00000178209 | ENSE00001244151 | ENSE00001244041 | Plectin-1 |
| PLEC1 | ENSG00000178209 | ENSE00001244070 | ENSE00001295392 | Plectin-1 |
| PLG | ENSG00000122194 | ENSE00000828808 | ENSE00001315450 | Plasminogen precursor |
| PLXDC2 | ENSG00000120594 | ENSE00001137970 | ENSE00000996527 | Plexin domain-containing protein 2 precursor |
| PROS1 | ENSG00000184500 | ENSE00001142430 | ENSE00001142413 | Vitamin K-dependent protein S precursor. |
| PSMC6 | ENSG00000100519 | ENSE00000657442 | ENSE00000657448 | 26S protease regulatory subunit S10B |
| PTPN18 | ENSG00000072135 | ENSE00000436095 | ENSE00000776192 | Tyrosine-protein phosphatase non-receptor type 18 |
| RAB8A | ENSG00000167461 | ENSE00001113277 | ENSE00001277163 | Ras-related protein Rab-8A |
| RASA3 | ENSG00000185989 | ENSE00001334941 | ENSE00001334928 | Ras GTPase-activating protein 3 |
| RTN2 | ENSG00000125744 | ENSE00000858227 | ENSE00000858223 | Reticulon-2 |
| SNX17 | ENSG00000115234 | ENSE00000734775 | ENSE00000734780 | Sorting nexin-17 |
| SNX17 | ENSG00000115234 | ENSE00000962998 | ENSE00000734785 | Sorting nexin-17 |
| SPTBN1 | ENSG00000115306 | ENSE00001036038 | ENSE00001036017 | Spectrin beta chain, brain 1 |
| SRC | ENSG00000197122 | ENSE00001390472 | ENSE00000661882 | Proto-oncogene tyrosine-protein kinase Src |
| STOM | ENSG00000148175 | ENSE00000983575 | ENSE00001262522 | Erythrocyte band 7 integral membrane protein |
| THBS1 | ENSG00000137801 | ENSE00000883758 | ENSE00000883772 | Thrombospondin-1 precursor |
| TMEM33 | ENSG00000109133 | ENSE00001489658 | ENSE00000712706 | Transmembrane protein 33 |
| TMOD3 | ENSG00000138594 | ENSE00001170748 | ENSE00001102815 | Tropomodulin-3 |
| TPD52L2 | ENSG00000101150 | ENSE00000663594 | ENSE00001391722 | Tumor protein D54 |
| UBASH3B | ENSG00000154127 | ENSE00001014167 | ENSE00001014158 | Suppressor of T-cell receptor signaling 1 |
| UBE1L | ENSG00000182179 | ENSE00001305417 | ENSE00001306981 | Ubiquitin-activating enzyme E1 homolog |
| UGCGL1 | ENSG00000136731 | ENSE00001148961 | ENSE00001206051 | UDP-glucose:glycoprotein glucosyltransferase 1 precursor |
| UGP2 | ENSG00000169764 | ENSE00001189522 | ENSE00001165982 | UTP–glucose-1-phosphate uridylyltransferase 2 |
| UNC13D | ENSG00000092929 | ENSE00001227797 | ENSE00001406672 | Unc-13 homolog D |
| UNC13D | ENSG00000092929 | ENSE00001227615 | ENSE00001430590 | Unc-13 homolog D |
| USP14 | ENSG00000101557 | ENSE00001208659 | ENSE00001252715 | Ubiquitin carboxyl-terminal hydrolase 14 |
| VPS13A | ENSG00000197969 | ENSE00001024130 | ENSE00000803886 | Vacuolar protein sorting-associated protein 13A |
| VPS13A | ENSG00000197969 | ENSE00001024085 | ENSE00000708339 | Vacuolar protein sorting-associated protein 13A |
| VPS13A | ENSG00000197969 | ENSE00000708190 | ENSE00000708458 | Vacuolar protein sorting-associated protein 13A |
| VPS13A | ENSG00000197969 | ENSE00001171911 | ENSE00000803905 | Vacuolar protein sorting-associated protein 13A |
| VPS13A | ENSG00000197969 | ENSE00001024110 | ENSE00000708298 | Vacuolar protein sorting-associated protein 13A |
| VPS13A | ENSG00000197969 | ENSE00001024141 | ENSE00001024126 | Vacuolar protein sorting-associated protein 13A |
| VPS13A | ENSG00000197969 | ENSE00001024130 | ENSE00000707929 | Vacuolar protein sorting-associated protein 13A |
| VPS13C | ENSG00000129003 | ENSE00001124918 | ENSE00000885044 | Vacuolar protein sorting-associated protein 13C. |
| VPS13C | ENSG00000129003 | ENSE00001124912 | ENSE00001364815 | Vacuolar protein sorting-associated protein 13C. |
| VPS13C | ENSG00000129003 | ENSE00000449795 | ENSE00001380396 | Vacuolar protein sorting-associated protein 13C. |
| VPS13C | ENSG00000129003 | ENSE00000885045 | ENSE00001368990 | Vacuolar protein sorting-associated protein 13C. |
| VPS13C | ENSG00000129003 | ENSE00000885061 | ENSE00001484949 | Vacuolar protein sorting-associated protein 13C. |
| VPS13C | ENSG00000129003 | ENSE00000885061 | ENSE00000885051 | Vacuolar protein sorting-associated protein 13C. |
| WAS | ENSG00000015285 | ENSE00000669947 | ENSE00001255082 | Wiskott-Aldrich syndrome protein |
| WDR44 | ENSG00000131725 | ENSE00000899838 | ENSE00000899846 | WD repeat protein 44 |
The Gene symbol, Ensembl gene and exon identifiers and the gene descriptions are listed for all 129 junctions found in 89 genes. The exon identifiers one and two indicate the exons involved in the junction peptide identified in SkipE.
Figure 4Exon skip events verified at mRNA level.
Each numbered box represents an exon and the position in the gene. The skip event is indicated by the diagonal lines. The parallelograms enclose the portion of amino acid sequence that is absent from the novel splice isoform. The bold and underlined form the junction peptides.
Figure 5Frequency distribution of exon skip lengths identified in the SkipE database.
The number of occurrences of each skip length identified is shown.
Design of exon-junction-spanning PCR primers.
| Gene | Junction Peptide | Junction Primer | Product Length (bp) | ||||||||
|
| AQELDALDNSHPIEAR | T | CCT | ATT | GAA | – | GCT | CGA | GCT | G | 200 |
| P | I | E | A | R | |||||||
|
| MPEFSGYVQQVK | AA | CGC | ATG | CCA | – | GAA | TTT | AGT | G | 165 |
| M | P | E | F | S | |||||||
|
| ELIPLMIMKPDEK | CC | AAA | GAA | TTG | – | ATT | CCC | CTG | A | 115 |
| E | L | I | P | L | |||||||
The gene symbol, junction peptide sequence, junction primer and product length are shown. The junction primer column indicates, with a dash, the exon-exon boundary.
Figure 6Agarose gel electrophoresis of PCR products.
Lanes 1–5 show products amplified from megakaryocyte cDNA; samples in lanes 6–10 contained template from platelets. Molecular size markers are in lane 11.