| Literature DB >> 34857024 |
Markus List1, Olga Tsoy2, Zakaria Louadi3,4, Maria L Elkjaer5,6,7, Melissa Klug3,8,9, Chit Tong Lio3,4, Amit Fenn3,4, Zsolt Illes5,6,7, Dario Bongiovanni8,9,10, Jan Baumbach4,11, Tim Kacprowski12,13.
Abstract
Alternative splicing (AS) is an important aspect of gene regulation. Nevertheless, its role in molecular processes and pathobiology is far from understood. A roadblock is that tools for the functional analysis of AS-set events are lacking. To mitigate this, we developed NEASE, a tool integrating pathways with structural annotations of protein-protein interactions to functionally characterize AS events. We show in four application cases how NEASE can identify pathways contributing to tissue identity and cell type development, and how it highlights splicing-related biomarkers. With a unique view on AS, NEASE generates unique and meaningful biological insights complementary to classical pathways analysis.Entities:
Keywords: Alternative splicing; Differential splicing; Dilated cardiomyopathy; Disease pathways; Functional enrichment; Multiple sclerosis; Platelet activation; Protein-protein interactions; Systems biology
Mesh:
Substances:
Year: 2021 PMID: 34857024 PMCID: PMC8638120 DOI: 10.1186/s13059-021-02538-1
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1Overview of NEASE. A Annotated exons are mapped to Pfam domains, motifs, and residues. The joint graph of PPIs, DDIs, DMIs, and co-resolved structure is used to identify the interactions mediated by these features. B For a list of exons/events, NEASE identifies interactions mediated by the spliced protein features and pathways that are significantly affected by those interactions. C NEASE provides a corrected p value, in addition to an enrichment score (NEASE Score) for every pathway (see the “Methods” section). The user can further focus on an individual pathway, where NEASE can prioritize genes and find new biomarkers. In this example, the gene G3 was not part of the enriched pathway A but it has the largest number of affected interactions with genes from the pathway
Fig. 2Analysis of tissue-specific exons. A Heatmap and hierarchical clustering of standardized PSI values obtained from VastDB. The heatmap only shows events with a standard deviation of PSI values ≥ 20. The heatmap shows that clusters of exons upregulated in neural tissues and muscle/heart tissues are dominant (clusters C1 and C3). B NEASE analysis shows that 28% and 27% for both neural and muscle upregulated exons, respectively, are encoding protein features: domains, linear motifs, and residues. For these subgroups of events, the exact protein complexes involved can be identified, and NEASE enrichment can be performed. C, D Comparison between gene-level enrichment and NEASE enrichment for the two sets of exons
Fig. 3NEASE visually highlights the impact of the AS regulation at the interactome level. The gray nodes represent proteins from the pathway and the red nodes represent genes with AS events. Red edges represent the affected interactions for the nodes with known DDIs, DMIs, or co-resolved structures. The visualization of the pathway “Synaptic vesicle cycle” from the KEGG database for the exons upregulated in the neural tissues shows that the splicing in the genes CLTA and CLTB is co-regulated and affects the interactions of the same complex. Similarly, NEASE highlights the importance of the domain ATP6V0A1 which is upregulated in neural tissues and binds seven proteins from the “Synaptic vesicle cycle” pathway
Fig. 4A 15 % of differentially spliced exons, between reticulated and mature platelets, are known to encode protein features. For this subset of exons, NEASE enrichment can be performed. B Gene level enrichment of all differentially spliced exons in the Reactome database fails to capture the most relevant pathways. C In contrast, NEASE shows an enrichment of the GPCR downstream signaling and other related pathways that are well known to be important in platelet activation. D A further look at the genes driving the enrichment of the GPCR pathway shows the most relevant genes affected by AS
NEASE enrichment obtained from AS comparison between normal-appearing white matter and acute lesions, from multiple sclerosis patients. The highly enriched pathways belong to Neurotransmitter receptors, MAPK, and bacterial infection. Most of these pathways are hallmarks of MS. The NEASE score is obtained after combining the p value with the number of significant genes. The latter is obtained after individual tests for each gene in the column “Spliced genes” (see the “Methods” section)
| Pathway name | Spliced genes (number of interactions affecting the pathway) | adj | NEASE score | |
|---|---|---|---|---|
| Neurotransmitter receptors and postsynaptic signal transmission | GRIA1 (7), ATP2B1 (2), BRAF (4), MAP2K4 (1), GRIN1 (4) | 4.38e−09 | 0.000004 | 16.71 |
| Uptake and function of anthrax toxins | ATP2B1 (1), BRAF (5), MAP2K4 (3) | 2.98e−09 | 0.000004 | 14.76 |
| Transmission across chemical synapses | GRIA1 (7), ATP2B1 (2), BRAF (4), MAP2K4 (1), GRIN1 (4) | 5.65e−08 | 0.000010 | 14.49 |
| Uptake and actions of bacterial toxins | ATP2B1 (1), BRAF (5), MAP2K4 (3) | 3.46e−08 | 0.000009 | 12.92 |
| Neuronal system | GRIA1 (7), ATP2B1 (2), BRAF (4), MAP2K4 (1), GRIN1 (4) | 8.71e−07 | 0.000122 | 12.11 |
| MAPK family signaling cascades | MYH10 (2), ATP2B1 (1), BRAF (17), MAP2K4 (5), GRIN1 (3) | 1.52e−06 | 0.000184 | 10.07 |
| Activation of NMDA receptor and postsynaptic events | GRIA1 (2), ATP2B1 (1), BRAF (4), MAP2K4 (1), GRIN1 (3) | 2.12e−06 | 0.000241 | 9.82 |
| FCERI mediated MAPK activation | MYH10 (1), BRAF (7), MAP2K4 (8) | 2.52e−07 | 0.000038 | 9.33 |
| RAF/MAP kinase cascade | MYH10 (1), ATP2B1 (1), BRAF (16), MAP2K4 (4), GRIN1 (3) | 1.00e−06 | 0.000130 | 8.48 |
| Signaling by moderate kinase activity BRAF mutants | MYH10 (1), BRAF (14), MAP2K4 (2) | 8.30e−09 | 0.000004 | 8.08 |
Enrichment of the pathway “Dilated cardiomyopathy (DCM)” from KEGG for the exons differentially used in DCM patients. The table shows the most significant genes (p value< 0.05) (see the “Methods” section)
| Differentially spliced genes | DCM associated | Percentage of affected edges associated with DCM | Affected binding (edges) associated with DCM | |
|---|---|---|---|---|
| MYO19 | No | 6/51 | 0.000002 | MYL2, TPM4, TPM3, TPM2, TPM1, ACTG |
| OBSCN | No | 1/2 | 0.014 | TTN |
| USP49 | No | 1/4 | 0.028 | PRKACA |
| CACNA1C | Yes | 1/4 | 0.028 | RYR2 |