| Literature DB >> 32032478 |
Iris M de Lange1, Flip Mulder1, Ruben van 't Slot1, Anja C M Sonsma1, Marjan J A van Kempen1, Isaac J Nijman1, Robert F Ernst1, Nine V A M Knoers1,2, Eva H Brilstra1, Bobby P C Koeleman1.
Abstract
BACKGROUND: SCN1A is one of the most important epilepsy-related genes, with pathogenic variants leading to a range of phenotypes with varying disease severity. Different modifying factors have been hypothesized to influence SCN1A-related phenotypes. We investigate the presence of rare and more common variants in epilepsy-related genes as potential modifiers of SCN1A-related disease severity.Entities:
Keywords: zzm321990SCN1Azzm321990; Dravet; GEFS+; epilepsy; modifier genes; phenotypic variability
Mesh:
Substances:
Year: 2020 PMID: 32032478 PMCID: PMC7196470 DOI: 10.1002/mgg3.1103
Source DB: PubMed Journal: Mol Genet Genomic Med ISSN: 2324-9269 Impact factor: 2.183
Number of variants found in the ExAC database and in the current cohort for different groups of patients and different categories of variants
| Gene set | Type C variants (MAF | Type D variants (MAF <0.05) | Type E variants (MAF <0.1) | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ExAC | Complete cohort | Extreme patients | Intermediate patients | ExAC | Complete cohort | Extreme patients | Intermediate patients | ExAC | Complete cohort | Extreme patients | Intermediate patients | |
| Epilepsy genes | 300,155 | 851 | 337 | 161 | 556,293 | 1,876 | 662 | 335 | 805,686 | 2,455 | 888 | 481 |
| Control 1 | 299,750 | 858 | 305 | 176 | 641,440 | 1,778 | 626 | 371 | 950,857 | 2,608 | 893 | 546 |
| Control 2 | 90,285 | 278 | 130 | 49 | 185,549 | 497 | 218 | 83 | 260,406 | 765 | 327 | 140 |
| Control 3 | 260,659 | 707 | 270 | 147 | 513,655 | 1,378 | 501 | 300 | 812,452 | 2,165 | 798 | 467 |
| Control 4 | 261,763 | 702 | 247 | 131 | 565,998 | 1,603 | 579 | 331 | 853,276 | 2,291 | 824 | 473 |
| Control 1–4 (total) | 912,457 | 2,545 | 952 | 503 | 1,906,642 | 5,256 | 1,924 | 1,085 | 2,876,991 | 7,829 | 2,842 | 1,626 |
| ID genes | 557,769 | 1,572 | 582 | 355 | 1,099,721 | 3,172 | 1,132 | 708 | 1,707,615 | 4,919 | 1,739 | 1,071 |
Minor allele frequency; only variants with a frequency below this threshold in both the exomes and genomes in the gnomAD database are included.
Overrepresentation of variants in epilepsy genes in the cohort, calculated based on different sets of control genes
| Assessed group of genes | Based on control set | Over‐ or underrepresentation of epilepsy gene variants in cohort (% more or less than expected | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Type C variants (MAF | Type D variants (MAF <0.05) | Type E variants (MAF <0.1) | ||||||||
| Complete cohort | Extreme patients | Intermediate patients | Complete cohort | Extreme patients | Intermediate patients | Complete cohort | Extreme patients | Intermediate patients | ||
| Epilepsy genes | 1 | −1 (.846) | 10 (.221) | 9 (.414) |
| 22 (<.0005) | 4 (.598) |
| 17 (.001) | 4 (.552) |
| 2 | −8 (.229) | −22 (.017) | −1 (.941) |
| 1 (.904) | 35 (.016) | 4 (.388) | −12 (.044) | 11 (.283) | |
| 3 | 5 (.387) | 8 (.329) | −5 (.690) |
| 22 (.001) | 3 (.721) |
| 12 (.019) | 4 (.581) | |
| 4 | 6 (.285) | 19 (.038) | 7 (.597) |
| 16 (.008) | 3 (.727) |
| 14 (.007) | 8 (.257) | |
| 1–4 (total) | 2 (.677) | 8 (.245) | −3 (.785) |
|
| 6 (.374) |
| 12 (.005) | 6 (.292) | |
| ID genes | 1 | −2 (.718) | 3 (.750) | 8 (.042) | 4 (.184) | 5 (.288) | 11 (.101) | 5 (.045) | 8 (.051) | 9 (.097) |
| 2 | −8 (.178) | −28 (.001) | 17 (.315) | 8 (.127) | −12 (.073) | 44 (.002) | −2 (.612) | −19 (.001) | 17 (.092) | |
| 3 | 4 (.405) | 1 (.940) | 13 (.231) | 8 (.025) | 6 (.326) | 10 (.166) | 8 (.003) | 4 (.407) | 9 (.120) | |
| 4 | 5 (.280) | 11 (.193) | 27 (.020) | 2 (.561) | 1 (.918) | 10 (.159) | 7 (.006) | 5 (.216) | 13 (.025) | |
| 1–4 (total) | 1 (.784) | 0 (1.000) | 15 (.041) | 5 (.045) | 2 (.599) | 13 (.011) | 6 (.002) | 3 (.320) | 11 (.008) | |
| Control set genes | 3 | 6 (.298) | −2 (.835) | 4 (.738) | 3 (.370) | 0 (1.000) | −1 (.908) | 3 (.323) | −4 (.367) | 0 (1.000) |
| 4 | 7 (.204) | 8 (.393) | 17 (.170) | −2 (.535) | −5 (.419) | −1 (.910) | 2 (.465) | −3 (.578) | 4 (.594) | |
Numbers represent the percentage of over‐ or underrepresentation of variants in epilepsy genes, based on a comparison of ratios of variants found in epilepsy genes and in different control groups, in the ExAC database and the current cohort (e.g. the first “7” means that 107% of the expected number of epilepsy genes was identified in our cohort, based on the ratio of variants present in epilepsy genes and control 1‐genes in the ExAC database, and the number of variants in control‐1 genes present in our cohort).
p‐values are based on Fishers' exact tests on the ratios of variants found in the ExAC database and the current cohort. Significant values (p < .0003968) are bolded.
Minor allele frequency; only variants with a frequency below this threshold in both the exomes and genomes in the gnomAD database are included.
Figure 1Overrepresentation of variants in epilepsy genes in the cohort. Bars represent the percentage of over‐ or underrepresentation of variants in the different patient groups, based on the ratio of variants found in epilepsy genes and different control groups (ctrl 1, 2, 3, 4 and 1–4), compared to ratios in the ExAC database. (a) Variants in epilepsy genes compared to different control groups; (b) variants in intellectual disability genes compared to different control groups; (c) variants in control group 1 genes compared to control group 3 and 4 (negative control). Results are presented for categories of variants with different allele frequency cut‐offs (<0.01, <0.05, <0.1). Significant values are depicted by asterisks
Distribution of variants in the epilepsy genes between groups of patients, for different categories of genes
| Group of patients | Type A variants (CADD | Type B variants (CADD >10/MAF <0.01) | Type C variants (all CADD/MAF <0.01) | Type D variants (all CADD/MAF <0.05) | Type E variants (all CADD/MAF <0.10) | |
|---|---|---|---|---|---|---|
| % of variant alleles (based on total number of alleles per group) | Mild ( | 1.79 | 2.33 | 2.76 | 4.38 | 5.70 |
| Severe ( | 2.04 | 2.33 | 2.76 | 4.52 | 5.81 | |
| Mild + severe ( | 1.96 | 2.33 | 2.76 | 4.47 | 5.78 | |
| Intermediate ( | 1.95 | 2.06 | 2.35 | 4.03 | 5.57 | |
|
| Mild versus severe | .456 | 1 | .997 | .73 | .812 |
| Mild versus intermediate | .663 | .379 | .195 | .334 | .765 | |
| Severe versus intermediate | .783 | .294 | .112 | .109 | .491 | |
| Mild + severe versus intermediate | .998 | .259 | .089 | .113 | .727 |
PHRED‐scaled CADD (combined annotation dependent depletion). A score of >20 represents the top 1% deleterious substitutions in the human genome.
Minor allele frequency; only variants with a frequency below this threshold in both the exomes and genomes in the gnomAD database are included.
Figure 2Distribution of variants in epilepsy genes among different groups of patients. Distribution of variants in epilepsy genes among different groups of patients, depicted for different types of variants (A–E). 1: (type A variants) CADD >20/MAF <0.01; 2: (type B variants) CADD >10/MAF <0.01; 3: (type C variants) all CADD/MAF <0.01; 4: (type D variants) all CADD/MAF <0.05; 5: (type E variants) all CADD/MAF <0.10. Percentages of variants relative to the total number of alleles per group are shown
Top 5 genes with an overrepresentation of variants in mild or severe patients, per category of variants
| Type A variants (CADD | Type B variants (CADD >10/MAF <0.01) | Type C variants (all CADD/MAF <0.01) | Type D variants (all CADD/MAF <0.05) | Type E variants (all CADD/MAF <0.10) | |
|---|---|---|---|---|---|
| Genes with an excess of variants in mild patients (gene name [ |
|
|
| MOCS2 (.003) | MOCS2 (.003) |
|
|
|
| KCNH1 (.008) | KCNH1 (.008) | |
|
|
|
| EFHC1 (.01) | DRD4 (.009) | |
|
|
|
| SLC6A8 (.013) | SLC6A8 (.013) | |
|
|
|
| MYT1 (.027) | CTSD (.026) | |
| Genes with an excess of variants in severe patients (gene name [ |
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
| |
|
|
|
|
| ||
|
|
|
|
|
PHRED‐scaled CADD (combined annotation dependent depletion). A score of >20 represents the top 1% deleterious substitutions in the human genome.
Minor allele frequency; only variants with a frequency below this threshold in both the exomes and genomes in the gnomAD database are included.
p‐values are based on Fishers' exact test.
Rare and predicted deleterious variants present in only relatively mildly or severely affected members of families with varying SCN1A‐related phenotyes
| Family | Gene | Established epilepsy gene | Variant | MAF | CADD‐phred score | |
|---|---|---|---|---|---|---|
| Relatively severely affected family members | 1 |
| c.452T > C|p.Ile151Thr (missense) | 0 | 26.1 | |
| 1 |
| c.1435G > A|p.Val479Ile (missense) | 0.0047 | 22.2 | ||
| 2 |
| c.3151G > T|p.Asp1051Tyr (missense) | 0.0021 | 26 | ||
| 3 |
| c.1016G > A|p.Gly339Asp (missense) | 0.0015 | 25 | ||
| 3 |
| Yes | c.3551T > A|p.Leu1184Gln (missense) | 0 | 32 | |
| 3 |
| Yes | c.3434C > T|p.Ser1145Phe (missense) | 0.00007785 | 21.6 | |
| 3 |
| c.685C > A|p.His229Asn (missense) | 0.0003 | 25.5 | ||
| 3 |
| c.166G > A|p.Val56Met (missense) | 0.0019 | 27 | ||
| 4 |
| c.977C > T|p.Pro326Leu (missense & splice region) | 0.0007 | 23.5 | ||
| 5 |
| c.1648G > A|p.Ala550Thr (missense) | 0.00002394 | 24.3 | ||
| 5 |
| c.2201T > C|p.Phe734Ser (missense) | 0.0073 | 32 | ||
| 6 (proband and father) |
| c.359T > C|p.Leu120Pro (missense) | 0.0003 | 23.6 | ||
| 6 (proband and father) |
| c.566C > T|p.Ser189Leu (missense) | 0.0002 | 23.6 | ||
| 6 (proband and father) |
| c.22G > A|p.Gly8Arg (missense) | 0.0017 | 29.3 | ||
| 6 (proband and father) |
| c.194C > T|p.Ser65Phe (missense) | 0.000005 | 28.4 | ||
| 6 (only proband) |
| Yes | c.275A > T|p.Glu92Val (missense) | 0.001 | 25.9 | |
| 6 (only proband) |
| Yes | c.725C > T|p.Pro242Leu (missense) | 0.003 | 24.3 | |
| 6 (only proband) |
| Yes | c.766C > T|p.Arg256Trp (missense) | 0 | 29.3 | |
| Relatively mildly affected family members | 1 |
| Yes | c.2266A > C|p.Ile756Leu (missense) | 0 | 23.3 |
| 1 |
| c.76G > A|p.Asp26Asn (missense) | 0.0084 | 33 | ||
| 1 |
| c.9650C > T|p.Ala3217Val (missense) | 0.0091 | 23.6 | ||
| 1 |
| Yes | c.1379_1382delGAAT|p.Arg460fs (frameshift) | 0.0021 | 25.4 | |
| 1 |
| Yes | c.3982_3983insCCCCCCCG|p.Arg1328fs (frameshift) | 0 | 34 | |
| 1 |
| c.2171G > A|p.Arg724His (missense) | 0.0059 | 23.3 | ||
| 1 |
| c.805G > A|p.Val269Ile (missens) | 0.0025 | 28 | ||
| 2 (father and brother of proband) |
| c.4451A > G|p.Tyr1484Cys (missense) | 0.000008183 | 25.5 | ||
| 3 |
| Yes | c.647C > A|p.Pro216His (missense) | 0.0005 | 26.2 | |
| 3 |
| c.5074G > T|p.Gly1692Trp (missense) | 0.0002 | 34 | ||
| 4 |
| Yes | c.388G > A|p.Val130Met (missense) | 0.000004062 | 21 | |
| 4 |
| Yes | c.8384C > G|p.Thr2795Arg (missense) | 0.000008129 | 27.5 | |
| 6 |
| c.1165C > T|p.Arg389* (stop) | 0.00003253 | 37 | ||
| 6 |
| Yes | c.509A > G|p.Asn170Ser (missense) | 0.0003 | 23.3 | |
| 6 |
| c.5446G > A|p.Val1816Met (missense) | 0 | 22.5 | ||
| 6 |
| Yes | c.1243C > A|p.Leu415Ile (missense) | 0.0025 | 20.1 |
* indicates a stop mutation, as per the HGVS nomenclature guidelines that Molecular Genetics & Genomic Medicine requires to be used.
Members from families 1–6, as described in Data S2. The upper part of the table represents the patients who are relatively severely affected, compared to their other family members; the lower part of the table represents the participants who are relatively mildly affected, compared to their other family members.
Genes were considered established epilepsy genes when present in the diagnostic epilepsy gene panel of the University Medical Center Utrecht.
Highest frequency of the variant observed in both exomes and genomes in the gnomAD database (Exome Aggregation Consortium et al., 2015).
Combined annotation dependent depletion (Kircher et al., 2014): numbers represent PHRED‐scaled CADD scores. CADD scores of >20 represent the top 1% deleterious substitutions in the human genome.
Predicted most deleterious, rare variants in extremely mild and extremely severe patients
| Patient (IQ at the age of 6) | Established epilepsy genes | Candidate modifier genes | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Gene | Variant | CADD | MAF | Gene | Variant | CADD | MAF | ||
| Mild patients | 1 (89.1) |
|
c.1925C > T| p.Thr642Met | 27.5 | 0 |
|
c.661C > T| p.Arg221Cys | 33 | 0.0009 |
| 2 (72.8) |
|
c.367C > T| p.His123Tyr | 31 | 0.0036 |
|
c.496C > T| p.Arg166Trp | 34 | 0.0001 | |
| 3 (91.4) |
|
c.3770A > G| p.Asn1257Ser | 23.9 | 0.0044 |
|
c.1376C > T| p.Ser459Phe | 29.3 | 0.00004 | |
| Both 4 and 5 (brothers) (76.6 and 98.2) |
|
c.496G > A| p.Val166Met | 25.2 | 0 | — | — | — | — | |
| Both 6 and 7 (twins) (83.7 and 93.3) |
|
c.1353T > G| p.Cys451Trp | 26.7 | 0.00003964 |
|
c.4849G > T| p.Val1617Phe | 28.8 | 0.00007 | |
| 8 (100) |
|
c.5036C > T| p.Ala1679Val | 22.2 | 0.0006 | — | — | — | — | |
| 9 (75.7) |
|
c.1123C > T| p.Arg375Cys | 24.7 | 0.00003229 |
|
c.1123C > T| p.Arg375Cys | 32 | 0.0044 | |
| 10 (81.9) |
|
c.4972C > G| p.Arg1658Gly | 29.6 | 0.0001 |
|
c.2470G > T| p.Ala824Ser | 22.8 | 0.00001 | |
| Extremely severe patients | 1 (18.9) |
|
c.203G > A| p.Arg68Gln | 34 | 0.000005615 |
|
c.2716G > A| p.Glu906Lys | 28.8 | 0.0028 |
| 2 (22.7) |
|
c.5383C > T| p.Arg1795Cys | 31 | 0.0015 |
|
c.1779G > T| p.Trp593Cys | 34 | 0 | |
| 3 (25.4) |
|
c.1706A > G| p.Asp569Gly | 29.4 | 0.00002031 |
|
c.2117G > T| p.Arg706Leu | 32 | 0.0026 | |
| 4 (25.4) |
|
c.8350G > T| p.Val2784Phe | 32 | 0.0025 |
| c.6048 + 2_6048+5delTAGG | 35 | 0.00003 | |
| 5 (26.7) |
|
c.2353C > T| p.Arg785Trp | 24.4 | 0.0029 |
|
c.22G > A| p.Gly8Arg | 29.3 | 0.0017 | |
| 6 (27.6) | — | — | — | — |
|
c.1135G > A| p.Val379Met | 28.6 | 0.0054 | |
| 7 (28.3) |
|
c.1735A > G| p.Met579Val | 25.1 | 0.0000488 |
|
c.1414A > G| p.Ile472Val | 26.8 | 0.0001 | |
A distinction was made between established monogenetic epilepsy genes (when present in the diagnostic epilepsy gene panel of the University Medical Center Utrecht) and candidate genes (all other genes in the epilepsy group).
Patients with an interpolated IQ at the age of six years old >70 (all patients with an SCN1A variant that is predicted to cause complete LoF, or a variant that has been previously described in Dravet syndrome patients).
IQ‐ and developmental assessment scores, conducted at different ages, were interpolated by linear regression, to obtain approximate scores at age 6 years of age as previsously described (de Lange, Gunning, et al., 2018).
Combined annotation dependent depletion (Kircher et al., 2014): numbers represent PHRED‐scaled CADD scores (a score of >20 represents the top 1% deleterious substitutions in the human genome).
Highest frequency of the variant observed in both exomes and genomes in the gnomAD database (Exome Aggregation Consortium et al., 2015).
Patients with an interpolated IQ at the age of six years old <30.