| Literature DB >> 26437396 |
Elvezia Maria Paraboschi1, Giulia Cardamone2, Valeria Rimoldi3,4, Donato Gemmati5, Marta Spreafico6, Stefano Duga7,8, Giulia Soldà9,10, Rosanna Asselta11,12.
Abstract
Abnormalities in RNA metabolism and alternative splicing (AS) are emerging as important players in complex disease phenotypes. In particular, accumulating evidence suggests the existence of pathogenic links between multiple sclerosis (MS) and altered AS, including functional studies showing that an imbalance in alternatively-spliced isoforms may contribute to disease etiology. Here, we tested whether the altered expression of AS-related genes represents a MS-specific signature. A comprehensive comparative analysis of gene expression profiles of publicly-available microarray datasets (190 MS cases, 182 controls), followed by gene-ontology enrichment analysis, highlighted a significant enrichment for differentially-expressed genes involved in RNA metabolism/AS. In detail, a total of 17 genes were found to be differentially expressed in MS in multiple datasets, with CELF1 being dysregulated in five out of seven studies. We confirmed CELF1 downregulation in MS (p=0.0015) by real-time RT-PCRs on RNA extracted from blood cells of 30 cases and 30 controls. As a proof of concept, we experimentally verified the unbalance in alternatively-spliced isoforms in MS of the NFAT5 gene, a putative CELF1 target. In conclusion, for the first time we provide evidence of a consistent dysregulation of splicing-related genes in MS and we discuss its possible implications in modulating specific AS events in MS susceptibility genes.Entities:
Keywords: alternative splicing; autoimmune neurodegenerative disorder; microarray; multiple sclerosis; peripheral blood mononuclear cells
Mesh:
Substances:
Year: 2015 PMID: 26437396 PMCID: PMC4632709 DOI: 10.3390/ijms161023463
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Characteristics of multiple sclerosis (MS)-related datasets included in the study.
| Dataset (accession number) | Array Type | MS Cases/Controls | Origin | % Females (Cases/Controls) | Age (Years) | Notes on MS Patients Included | Tissue | Ref. |
|---|---|---|---|---|---|---|---|---|
| GSE21942 | Affymetrix Human Genome U133 Plus 2.0 Array | 10/15 | Finland | 100/100 | Mean age cases: 54.2 Mean age controls: 71.6 | 4 patients under treatment | Peripheral blood mononuclear cells | [ |
| GSE41848 | Affymetrix Human Exon 1.0 ST Array | 54/38 (discovery dataset) | n.a. | 71/74 | Median age cases: | Only RR-MS untreated cases included in the analyses | Whole blood | [ |
| GSE41849 | Affymetrix Human Exon 1.0 ST Array | 21/22 (replication dataset) | 66/59 | Median age cases: | ||||
| GSE41890 | Affymetrix Human Gene 1.0 ST Array | 22/24 | n.a. | 56/52 | Mean age cases: | RR-MS patients in remitting phase | Peripheral blood leukocytes | [ |
| GSE17048 | Illumina HumanHT-12 V3.0 Expression BeadChip | 36/45 | Australia | 80/64 | Mean age cases: | Only RR-MS untreated cases included in the analyses | Whole blood | [ |
| GSE43592 | Affymetrix Human Genome U133 Plus 2.0 Array | 10/10 | Sweden | 40/40 | Mean age cases: 38 Mean age controls: 40 | RR-MS | Peripheral blood T-cells | [ |
| GSE13732 | Affymetrix Human Genome U133 Plus 2.0 Array | 37 */28 | 92% Caucasian | 74/64 | Mean age cases: 37 Mean age controls: 35 | - | Naïve CD4+ T cells | [ |
* Clinically isolated syndrome (CIS), the earliest clinical manifestation of MS. Abbreviations: n.a., not available; RR, relapsing remitting MS.
Summary of the DAVID results.
| Dataset (Accession Number) | No. of Identified Cluster * | No. of Significantly Enriched Clusters ** | Enrichment Score Range (max–min) *** | “Nuclear mRNA Splicing” Category | “NADH Dehydrogenase Complex” Category | ||
|---|---|---|---|---|---|---|---|
| Cluster Rank | Enrichment Score | Cluster Rank | Enrichment Score | ||||
| GSE21942 | 218 | 27 | 12–1.34 | 1 | 12 **** | 8 | 3 **** |
| GSE41848 | 246 | 37 | 4.2–1.32 | 7 | 2.67 | 6 | 2.97 |
| GSE41849 | 214 | 9 | 5.52–1.35 | - | - | 1 and 2 | 5.52 and 5.1 **** |
| GSE41890 | 185 | 10 | 5.1–1.47 | - | - | - | - |
| GSE17048 | 174 | 10 | 2.41–1.34 | 4 | 2.16 | 10 | 1.34 |
| GSE43592 | 222 | 23 | 5.24–1.32 | 10 | 1.82 | - | - |
| GSE13732 | 199 | 25 | 9.06–1.33 | 3 | 4.34 **** | - | - |
* Total number of groups of functionally-related genes identified by DAVID through the integration of data from dozens of databases (or details see [27]); listed numbers include both significantly and not-significantly enriched groups; ** Clusters significantly enriched are those showing a DAVID enrichment score >1.3 (corresponding to p < 0.05 calculated through a modified Fisher exact test, one tailed [27]); *** It is provided the score range for the significantly enriched groups; and **** Cluster surviving the correction for multiple testing (Benjamini correction). DAVID, Database for Annotation, Visualization and Integrated Discovery (http://david.abcc.ncifcrf.gov/).
The 17 splice-regulatory genes found dysregulated in at least three MS datasets enriched for the gene-ontology category “nuclear mRNA splicing”.
| Gene Symbol | Gene Name | No. of Datasets in Which the Gene Is DE | Combined T Stat * | Combined |
|---|---|---|---|---|
| CELF1/CUGBP1 | CUG triplet repeat, RNA binding protein 1 | 5 out of 5 | −38.352 | 4.0144 × 10−5 |
| MBNL1 | muscleblind-like splicing regulator 1 | 4 out of 5 | 36.567 | 7.6664 × 10−5 |
| CSTF3 | cleavage stimulation factor, 3ʹ pre-RNA, subunit 3 | 3 out of 5 | 41.24 | 1.4073 × 10−5 |
| SRSF10 | FUS interacting protein (serine/arginine-rich) 1 | n.c. | n.c. | |
| GEMIN6 | gem (nuclear organelle) associated protein 6 | −34.092 | 0.00018533 | |
| HNRNPA2B1 | heterogeneous nuclear ribonucleoprotein A2/B1 | 43.464 | 6.2483 × 10−6 | |
| HNRNPH1 | heterogeneous nuclear ribonucleoprotein H1 (H) | 39.348 | 2.7932 × 10−5 | |
| HNRNPH3 | heterogeneous nuclear ribonucleoprotein H3 (2H9) | 32.941 | 0.00027984 | |
| DDX39B | HLA-B associated transcript 1 | n.c. | n.c. | |
| PCBP1 | poly(rC) binding protein 1 | −35.428 | 0.0001146 | |
| POLR2E | polymerase (RNA) II (DNA directed) polypeptide E, 25 kDa | n.c. | n.c. | |
| POLR2J | polymerase (RNA) II (DNA directed) polypeptide J, 13.3 kDa | −27.037 | 0.0021827 | |
| RPL36AP51 | ribosomal protein L36a pseudogene 51 | n.c. | n.c. | |
| SRSF1 | splicing factor, arginine/serine-rich 1 | 46.327 | 2.1799 × 10−6 | |
| SCAF11 | splicing factor, arginine/serine-rich 2, interacting protein | 26.307 | 0.0028008 | |
| SRSF3 | splicing factor, arginine/serine-rich 3 | −42.054 | 1.0493 × 10−5 | |
| YTHDC1 | YTH domain containing 1 | n.c. | n.c. |
Combined T stats and p values were calculated using the INMEX software. Datasets included in this analysis were: GSE21942, GSE17048, GSE43592, and GSE13732. * The minus sign indicates decreased expression levels in cases with respect to controls. DE: differentially expressed; n.c.: not calculated (not surviving the threshold of significance of 0.05).
Figure 1CUGBP Elav-like family member 1 (CELF1) expression levels in peripheral blood mononuclear cells (PBMCs) of MS patients and controls. Expression levels were measured by semi-quantitative real-time RT-PCRs in 30 MS patients and 30 controls, using primers located in exons 10 and 11, as depicted in the scheme above boxplots. Boxplots show CELF1 expression levels according to disease status. Boxes define the interquartile range; the thick line refers to the median. Results are presented as normalized rescaled values. Significance level for differences between groups was calculated by a t-test. **: p < 0.005.
Figure 2Splicing pattern analysis of the NFAT5 gene. (A) Schematic representation of the NFAT5 gene between exons 1 and 5. Exons and introns are represented by boxes and lines, and are approximately drawn to scale. Exons indicated in black are those undergoing alternatively skipping (alone or in combination). The positions of the primer couple used for competitive fluorescent RT-PCR experiments are shown by arrows (the forward primer was labeled with the FAM fluorophore). The C > T transition (rs12599391) abolishing the intron-2 ISE element is also indicated; (B) Fluorescent RT-PCR products, obtained from the RNA extracted from PBMCs of a control individual (heterozygous CT for the rs12599391 polymorphism), were separated by using capillary electrophoresis. The panel on the left is a close-up view of the GeneMapper window encompassing all peaks corresponding to different isoforms. A schematic representation of the obtained RT-PCR products is also depicted on the right; (C) Boxplot diagram showing the percentage of the exon-2-including transcripts among all NFAT5 mRNAs, according to disease status and by stratifying individuals on the basis of their rs12599391 genotype. Expression levels were measured by fluorescent competitive RT-PCRs. Significance levels of t-tests and of the one-way ANOVA analyses, are shown above and below the boxplots, respectively. * p < 0.05; ns: not significant.
Figure 3A single network includes most splice regulatory genes dysregulated in MS. The network was generated through the Ingenuity Pathway Analysis software, using as input the list of genes in Table 3. The genes in the original list are represented by hexagonal nodes, whereas those integrated into the computationally-generated network on the basis of the evidence stored in the IPA knowledge memory are indicated as circles. Node color indicates differential expression levels: Yellow: for genes upregulated in MS cases, blue: downregulated in MS cases, and grey: not differentially expressed (Table 3 data). Edges represent gene relationships, with red arrows: indicating CELF1 targets in resting T cells [45].