| Literature DB >> 35047017 |
Ferdinando Clarelli1, Nadia Barizzone2, Eleonora Mangano3, Miriam Zuccalà2, Chiara Basagni2, Santosh Anand4, Melissa Sorosina1, Elisabetta Mascia1, Silvia Santoro1, Franca Rosa Guerini5, Eleonora Virgilio6, Antonio Gallo7, Alessandro Pizzino2, Cristoforo Comi6, Vittorio Martinelli8, Giancarlo Comi9, Gianluca De Bellis3, Maurizio Leone10, Massimo Filippi8,9,11,12, Federica Esposito1,8, Roberta Bordoni3, Filippo Martinelli Boneschi13,14, Sandra D'Alfonso2.
Abstract
Genome-wide association studies identified over 200 risk loci for multiple sclerosis (MS) focusing on common variants, which account for about 50% of disease heritability. The goal of this study was to investigate whether low-frequency and rare functional variants, located in MS-established associated loci, may contribute to disease risk in a relatively homogeneous population, testing their cumulative effect (burden) with gene-wise tests. We sequenced 98 genes in 588 Italian patients with MS and 408 matched healthy controls (HCs). Variants were selected using different filtering criteria based on allelic frequency and in silico functional impacts. Genes showing a significant burden (n = 17) were sequenced in an independent cohort of 504 MS and 504 HC. The highest signal in both cohorts was observed for the disruptive variants (stop-gain, stop-loss, or splicing variants) located in EFCAB13, a gene coding for a protein of an unknown function (p < 10-4). Among these variants, the minor allele of a stop-gain variant showed a significantly higher frequency in MS versus HC in both sequenced cohorts (p = 0.0093 and p = 0.025), confirmed by a meta-analysis on a third independent cohort of 1298 MS and 1430 HC (p = 0.001) assayed with an SNP array. Real-time PCR on 14 heterozygous individuals for this variant did not evidence the presence of the stop-gain allele, suggesting a transcript degradation by non-sense mediated decay, supported by the evidence that the carriers of the stop-gain variant had a lower expression of this gene (p = 0.0184). In conclusion, we identified a novel low-frequency functional variant associated with MS susceptibility, suggesting the possible role of rare/low-frequency variants in MS as reported for other complex diseases.Entities:
Keywords: EFCAB13; burden test; multiple sclerosis; pool sequencing; rare variants
Year: 2022 PMID: 35047017 PMCID: PMC8762330 DOI: 10.3389/fgene.2021.800262
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Features of the three cohorts.
| Discovery | Replication | Array cohort | |||||
|---|---|---|---|---|---|---|---|
| MS | HC | MS | HC | MS | HC | ||
| N | — | 588 | 408 | 504 | 504 | 1,298 | 1,430 |
| Sex | % females | 60.33 | 57.84 | 68.05 | 32.14 | 66.49 | 33.08 |
| Disease course | % BOMS | 90.10 | — | 90.59 | — | 79.79 | — |
| % PPMS | 9.90 | — | 9.41 | - | 20.21 | — | |
| Age at onset (years) | Mean | 29.97 | — | 30.31 | — | 32.19 | — |
| Min–max | 10–65 | — | 10–62 | — | 10–69 | — | |
| SD | 10.03 | — | 9.96 | — | 10.80 | — | |
| Disease duration (years) | Mean | 11.33 | — | 11.68 | — | 10.18 | — |
| Min–max | 0–45 | — | 0–43 | — | 0–44 | — | |
| SD | 8.16 | — | 8.66 | — | 7.28 | — | |
| EDSS | Median | 2.5 | — | 2 | — | 2 | — |
| Range | 0–9.5 | — | 0–9.0 | — | 0–9.5 | — | |
Clinical and demographic features of MS patients, and HC, for the three cohorts; BOMS, bout-onset MS, PPMS, primary progressive MS, EDSS, expanded disability status scale; SD, standard deviation.
Number of SNVs and the top-associated gene in each of the seven filters in the discovery cohort.
| WSS | C-alpha | |||||||
|---|---|---|---|---|---|---|---|---|
| Filter | No. of SNVs | Tested genes | Top gene | No. of SNVs | P | Top gene | No. of SNVs | P |
|
| 17 | 1 |
| 8 | <0.0001 |
| 8 | 0.0023 |
|
| 598 | 75 |
| 13 | 0.001 |
| 13 | 0.021 |
|
| 624 | 77 |
| 16 | <0.0001 |
| 16 | 0.007 |
|
| 367 | 58 |
| 7 | 0.0093 |
| 7 | 0.0125 |
|
| 381 | 61 |
| 10 | 0.0002 |
| 10 | 0.0054 |
|
| 598 | 83 |
| 22 | 0.0012 |
| 22 | 0.022 |
|
| 1,393 | 84 |
| 21 | 0.0213 |
| 141 | 0.0011 |
Genes nominated for replication.
| Symbol | Chr | Disr | MisDisr_01 | MisDisr_05 | DamDisr_01 | DamDisr_05 | SynMisDisr_01 | Reg_01 |
|---|---|---|---|---|---|---|---|---|
|
| 1 | — |
|
| 0.6824 | 0.6824 | 0.0723 | 0.8001 |
|
| 1 | — | 0.3668 | 0.3668 | — | — |
|
|
|
| 3 | — | 0.7969 | 0.7969 | 0.5683 | 0.5683 | 0.5977 |
|
|
| 6 | — | 0.5763 | 0.5100 | 0.8311 | 0.8311 |
| 0.2600 |
|
| 6 | — |
|
|
|
|
| 0.8813 |
|
| 6 | — | 0.0727 | 0.3667 | 0.1780 | 0.1780 |
| 0.9128 |
|
| 7 | — | 0.1757 | 0.1757 | 0.1757 | 0.1757 |
| 0.6864 |
|
| 8 | — | 0.5695 | 0.5695 | 0.7601 | 0.7601 | 0.5426 |
|
|
| 8 | — | 0.2675 |
| 0.7708 |
| 0.1548 | 0.5452 |
|
| 11 | — | 0.9656 | 0.9656 | 0.6077 | 0.6077 |
| 0.5679 |
|
| 12 | — | 0.5840 |
| — | — | 0.584 | 0.5223 |
|
| 13 | — | 1 | 1 | — | — | 0.1918 |
|
|
| 16 | — | — | — | — | — | — |
|
|
| 17 |
|
|
|
|
|
| 0.8669 |
|
| 17 | — | 0.6933 | 0.6933 | 0.6933 | 0.6933 | 0.2196 |
|
|
| 17 | — |
|
| 0.3283 | 0.3283 | 0.2611 | 0.6951 |
|
| 22 | — | 0.9688 | 0.9688 | 0.9688 | 0.9688 |
| — |
p-values for the six filters for the genes that were nominated for replication. p-values were computed upon the application of the hybrid test in the discovery cohort. For nomenclature of filters, see the Methods section. Significant associations at p < 0.05 for the various filters are labeled in bold.
FIGURE 1Bar plots of minor allele frequency for EFCAB13 disruptive variants. Bar plots reporting minor allele frequencies for SNVs classified as disruptive (stop-loss, stop-gain, and splice site) located in the EFCAB13 gene for the three investigated cohorts. Red: MS patients, blue: healthy controls. (A) discovery cohort (N = 8 SNVs), (B) replication cohort (N = 6 SNVs), (C) array-based cohort (N = 5 SNVs), and (D) forest plot for meta-analyzed odd ratios estimated for stop-gain SNV rs118004742.
FIGURE 2Genomic structure of the two main isoforms of the EFCAB13 gene. GeneBank references (Ensembl) of EFCAB13 isoforms are ENST00000331493 (Isoform A) and ENST00000517484 (isoform B), respectively. The positions of the eight disruptive variants are displayed in red (stop-gain variants) or green (splicing variants).
FIGURE 3Analysis of the EFCAB13 transcript. (A–C) Electropherograms derived from Sanger sequencing on a heterozygous healthy subject for the variant rs118004742. (A) Sequence derived from RT-PCR designed to amplify cDNA of only isoform B of the EFCAB13 transcript; (B) sequence derived from RT-PCR designed to amplify cDNA of both A and B EFCAB13 transcripts; (C) sequence derived from PCR on genomic DNA of the same subject. The rs118004742 variant is indicated by an arrow. (D) EFCAB13 cDNA expression levels by ddPCR analysis. Box plots display the expression levels by ddPCR analysis in 23 subjects, including 10 carriers (TG) and 13 non-carriers (TT) of the stop-gain rs118004742-G variant. The ddPCR assay tests the expression of both A and B EFCAB13 isoforms. The p-value was derived from the Mann–Whitney test.