| Literature DB >> 34681001 |
Nadia Barizzone1, Rachele Cagliani2, Chiara Basagni1, Ferdinando Clarelli3, Laura Mendozzi4, Cristina Agliardi4, Diego Forni2, Martina Tosi1, Elisabetta Mascia3, Francesco Favero5, Davide Corà5, Lucia Corrado1, Melissa Sorosina3, Federica Esposito3, Miriam Zuccalà1, Domizia Vecchio6, Maria Liguori7, Cristoforo Comi6, Giancarlo Comi8, Vittorio Martinelli9, Massimo Filippi8,9,10,11,12, Maurizio Leone13, Filippo Martinelli-Boneschi14,15, Domenico Caputo4, Manuela Sironi2, Franca Rosa Guerini4, Sandra D'Alfonso1.
Abstract
Known multiple sclerosis (MS) susceptibility variants can only explain half of the disease's estimated heritability, whereas low-frequency and rare variants may partly account for the missing heritability. Thus, here we sought to determine the occurrence of rare functional variants in a large Italian MS multiplex family with five affected members. For this purpose, we combined linkage analysis and next-generation sequencing (NGS)-based whole exome and whole genome sequencing (WES and WGS, respectively). The genetic burden attributable to known common MS variants was also assessed by weighted genetic risk score (wGRS). We found a significantly higher burden of common variants in the affected family members compared to that observed among sporadic MS patients and healthy controls (HCs). We also identified 34 genes containing at least one low-frequency functional variant shared among all affected family members, showing a significant enrichment in genes involved in specific biological processes-particularly mRNA transport-or neurodegenerative diseases. Altogether, our findings point to a possible pathogenic role of different low-frequency functional MS variants belonging to shared pathways. We propose that these rare variants, together with other known common MS variants, may account for the high number of affected family members within this MS multiplex family.Entities:
Keywords: NGS; linkage study; multiple sclerosis; multiplex families; rare variants
Mesh:
Year: 2021 PMID: 34681001 PMCID: PMC8535321 DOI: 10.3390/genes12101607
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Index family pedigree. Individuals with shaded symbols are affected by MS.
Figure 2Study workflow.
Figure 3wGRS comparison between index family members, non-familial MS cases, and healthy controls. wGRS analysis of known common MS risk alleles. wGRS was constructed with results of 42 non-HLA SNPs and 8 HLA markers (risk alleles DRB1*15:01, 03:01, 13:03, 08:01, and the protective alleles A*02:01, B*44:02, B*38:01; B*55:01). Within the family member category, two healthy members (I.2 and III.3) were included in the distribution (bright pink dots), but they were excluded from the statistical analysis.
Gene based burden test analysis of the multiplex cohort.
| Gene | Chr | No. of Variants a | Burden b |
|
|---|---|---|---|---|
|
| 15 | 4 | −1.6295 | 0.103214 |
|
| 9 | 4 | −1.3859 | 0.165765 |
|
| 10 | 5 | −1.3563 | 0.174993 |
|
| 9 | 2 | 1.2021 | 0.229316 |
|
| 6 | 7 | −1.0906 | 0.275457 |
|
| 18 | 10 | −0.781 | 0.434797 |
|
| 8 | 3 | −0.6585 | 0.510208 |
|
| 21 | 6 | 0.4399 | 0.659993 |
|
| 17 | 5 | 0.3498 | 0.726526 |
|
| 18 | 4 | −0.1488 | 0.88168 |
|
| 2 | 2 | 0.1461 | 0.883868 |
|
| 15 | 2 | 0.1461 | 0.883868 |
|
| 12 | 3 | −0.1299 | 0.896626 |
|
| 1 | 4 | 0.0279 | 0.977741 |
a Total number of low-frequency functional variants observed in the multiplex sample set. b A positive stat means that the number of rare functional variants is increased in MS patients compared to healthy individuals.
Genes presenting with only one low-frequency functional variant in a multiplex sample set. For these genes, the burden test was not performed, as they only displayed one rare variant with a predicted functional effect.
| Gene | Chr | rs ID | POS (hg19) | REF | ALT | n MS a | n CT a | Effect | Ann | AF | CADD |
|---|---|---|---|---|---|---|---|---|---|---|---|
|
| 1 | rs3831366; | 1334052 | CTAGAG | C | 6 f | 5 f | Splice acceptor variant | HIGH | 0.0487 | NA |
|
| 18 | chr18:66358530 | 66358530 | A | G | 2 | 0 | Splice donor variant | HIGH | −1 | 24.5 |
a For every family, only one subject was considered, independently of their health status. b Predicted impact on protein function. c Allelic frequency on Europeans. e The CCNL2 was observed also in the index family. f The deletion was observed in 12 MS patients belonging to 6 different multiplex families, 5 healthy relatives of the patients, and 5 unrelated healthy subjects. d Phred CADD score.
Enrichment analysis. The table shows the results obtained using ToppGene software.
| ToppGene | |||||||
|---|---|---|---|---|---|---|---|
| Category | GO Term | Description | Genes | No. of Selected Genes | No. of Hits in Genome |
| |
| Biological processes a | GO:0051028 | mRNA transport |
| 5 | 157 | 6.2 × 10−6 | 0.011 |
| Molecular function | GO:0003774 | motor activity |
| 5 | 134 | 3.44 × 10−6 | 9.50 × 10−4 |
| Molecular function | GO:0003713 | transcription coactivator activity b |
| 5 | 285 | 1.29 × 10−4 | 3.57 × 10−2 |
| Disease | C0524851 | Neurodegenerative Disorders |
| 11 | 1480 | 1.01 × 10−5 | 0.0156 |
| Disease | C0027765 | nervous system disorder c |
| 9 | 942 | 1.12 × 10−5 | 0.0173 |
| Transcription factors | RFX1_01 d |
| 4 | 194 | 7.82 × 10−5 | 0.0375 | |
a The same five genes also resulted significantly enriched in the mRNA transport pathway according to Metascape software: No. of hits in genome = 152; p = 1.05 × 10−6; p adjusted = 0.024. b Two genes (MMS19 and PPRC1) are located in the same linkage region. Enrichment analysis was re-run excluding one of the genes. The term still shows a nominal enrichment (p = 0.00105, but it is no longer significant after applying Bonferroni correction. c Two genes (ABCA1 and RNF20) are located in the same linkage region. Enrichment analysis was re-run excluding one of the genes. The term still shows a nominal enrichment (p = 3.03 × 10−4), but it is no longer significant after applying Bonferroni correction. d Two genes (PPP2R2B and YIPF5) are located in the same linkage region. Enrichment analysis was re-run excluding one of the genes, but it is not significant. No. of hits in genome: number of genes in genome linked to the specific GO Term. For p adjusted, Bonferroni correction was performed, multiplying for the number of annotations in the category.