Literature DB >> 32337343

Polygenic risk scores of several subtypes of epilepsies in a founder population.

Claudia Moreau1, Rose-Marie Rébillard1, Stefan Wolking1, Jacques Michaud1, Frédérique Tremblay1, Alexandre Girard1, Joanie Bouchard1, Berge Minassian1, Catherine Laprise1, Patrick Cossette1, Simon L Girard1.   

Abstract

OBJECTIVE: Polygenic risk scores (PRSs) are used to quantify the cumulative effects of a number of genetic variants, which may individually have a very small effect on susceptibility to a disease; we used PRSs to better understand the genetic contribution to common epilepsy and its subtypes.
METHODS: We first replicated previous single associations using 373 unrelated patients. We then calculated PRSs in the same French Canadian patients with epilepsy divided into 7 epilepsy subtypes and population-based controls. We fitted a logistic mixed model to calculate the variance explained by the PRS using pseudo-R2 statistics.
RESULTS: We show that the PRS explains more of the variance in idiopathic generalized epilepsy than in patients with nonacquired focal epilepsy. We also demonstrate that the variance explained is different within each epilepsy subtype.
CONCLUSIONS: Globally, we support the notion that PRSs provide a reliable measure to rightfully estimate the contribution of genetic factors to the pathophysiologic mechanism of epilepsies, but further studies are needed on PRSs before they can be used clinically.
Copyright © 2020 The Author(s). Published by Wolters Kluwer Health, Inc. on behalf of the American Academy of Neurology.

Entities:  

Year:  2020        PMID: 32337343      PMCID: PMC7164970          DOI: 10.1212/NXG.0000000000000416

Source DB:  PubMed          Journal:  Neurol Genet        ISSN: 2376-7839


In the last decade, many groups have been working on different genetic techniques and statistics to better understand the complex genetic mechanisms underlying epilepsy.[1] Last year, a large genome-wide association study on epilepsy identified 16 loci associated with the disease, and many of these were already known or suspected.[2] Despite these efforts, there is still a substantial missing heritability component in epilepsy genetics.[3] It is likely that a wide spectrum of genetic factors is in play, ranging from very rare mutations with large effects to relatively rare variants with medium effect sizes and finally to common variants with smaller risk effects. Polygenic risk scores (PRSs) aim to quantify the cumulative effects of a number of variants, which may individually have a very small effect on susceptibility. They have been used previously in many common traits and diseases such as heart disease[4-6] and in neurologic disorders.[7-10] In this study, we aim to use PRSs to see whether this method can explain more of the epilepsy genetics than the classical methods did. We take advantage of the recent meta-analysis GWAS metrics[2] to calculate PRSs in 373 unrelated French Canadian (FC) patients with epilepsy divided into 7 subtypes. The population is known for its well-documented recent (400 years) founder effect and its particular genetic background, which makes it an ideal population for genetic studies. French Canadians are also closely related to the European population, which is predominant in the GWAS.[2]

Methods

Standard protocol approvals, registrations, and patient consents

This study was approved by the Centre de Recherche du Centre Hospitalier Universitaire de Montréal ethics committee, and written informed consent was obtained for all patients.

Phenotyping of patients

The epilepsy cohort was composed of families with at least 3 affected individuals with idiopathic generalized epilepsy (IGE) or nonacquired focal epilepsy (NAFE) previously collected and diagnosed by neurologists. The clinical epilepsy phenotype is defined based on the Classification of the Epilepsy Syndromes established by the International League Against Epilepsy (ILAE).[11] More specifically, the operational definitions of the epilepsy phenotypes studied in the project for NAFE were as follows: (1) patients were aged at least 5 years and had experienced at least 2 unprovoked seizures with focal EEG abnormalities or clear clinical focal semiology in the 6 months before starting treatment AND (2) an MRI scan of the brain that did not demonstrate any potentially epileptogenic lesion (no lesion) OR (3) documented hippocampal sclerosis and lesion other than mesial temporal sclerosis (other lesion). For IGE, patients were at least 4 years of age at the diagnosis and the IGE subtype (childhood absence epilepsy, juvenile absence epilepsy, juvenile myoclonic epilepsy or IGE not otherwise specified) was determined according to the 1989 ILAE syndrome definitions using clinical and EEG characteristics. In IGE, we also included patients with epilepsy with eyelid myoclonia (Jeavons), which is an idiopathic generalized form of reflex epilepsy characterized by childhood-onset, unique seizure manifestations, striking light sensitivity, and possible occurrence of generalized tonic-clonic seizures alone. Our epilepsy cohort consisted of 643 patients diagnosed with familial epilepsy. We used 1 patient per family for the analyses for a total of 373 not closely related FC patients, 192 with IGE and 132 with NAFE (and 49 unclassified epilepsies because of a lack of information). We validated that the remaining patients were not closely related (first cousins or more related) using PLINK. FC ancestry was assessed by self-declared ethnicity and principal component (PC) analysis (figure e-1, links.lww.com/NXG/A253). Table 1 shows the different subtypes of epilepsy that are represented in our cohort. In addition, we selected 954 FC individuals from a reference population data set.[12]
Table 1

Basic statistics for different epilepsy subtypes

Basic statistics for different epilepsy subtypes

Data availability

The patients' genotype data used in the present study will be available on request.

Genotyping and imputation

For this study, we used whole-genome genotyping data for the patient and the French Canadian control[12] cohorts. All samples were processed on either the Illumina Omni Express (number of single nucleotide polymorphisms = 710,000) or the Illumina Omni 2.5 (number of single nucleotide polymorphisms = 2,500,000 including the Omni Express core) depending on the availability of the chip regardless if they were controls or patients. Genotypes of all samples were merged, and only positions present on both chips were kept. We performed cleaning steps to remove individuals having more than 2% missing genotypes among all SNPs, SNPs with more than 2% missing SNPs over all individuals, and SNPs with Hardy-Weinberg p value <0.001 using PLINK software.[13] We then removed 121 individuals of non-FC descent using the first 2 principal components in addition to self-identification of patients whenever this information was available. Principal component analysis (figure e-1, links.lww.com/NXG/A253) was performed using Eigensoft[14] on pruned SNPs (pairwise r2 < 0.2 in sliding windows of size 50 shifting every 5 SNPs) at 5% minor allele frequency (MAF). We finally aligned the data set to the GRCh37 genome build for further imputation following the method described here.[15] The Sanger Imputation Service was used to conduct whole-genome imputation of SNPs.[16] We selected the Human Reference Consortium data set as the reference panel. Postimputation quality control filters were applied to remove SNPs within imputed data with an imputation info score of <0.9 or HWE p value of <1 e−6, and only biallelic SNPs at MAF 1% or higher were kept for further analyses.

Association analysis

We used PLINK software for the logistic association analysis with the first 10 PCs and sex as covariates. Associations were only tested for the 20 SNPs found significant in the ILAE study and only in the epilepsy subtypes in which they were originally reported. We used a p value threshold of 0.0025 to account for multiple testing (n = 20).

PRS calculation

PRSs were calculated with PRSice software[17] using ILAE meta-analysis on epilepsy summary statistics.[2] Because the BETA was not provided for the METAL analyses (all, generalized, and focal epilepsies' analyses), we used the formula from reference 18 to calculate it. We used the first 10 PCs in addition to sex as covariates, recalculating eigenvectors for each patient subset including controls using SNPs at MAF 0.05 pruned (as described above). PRSs were standardized for graphs.

Statistical analyses

We fitted a logistic regression mixed model using R. We then calculated the Nagelkerke pseudo-R2 (using the PRS at the p value threshold that best predicts the phenotype) with and without the PRS as the full and null model. Note that pseudo-R2 is reported on the observed scale to avoid overfitting.

Results

We used whole-genome genotyping on 373 unrelated patients having epilepsy and 954 population controls. All individuals were confirmed with French Canadian ancestry. FC control individuals used in this study have already been demonstrated to cluster with Western Europeans.[19] First, we wanted to assess whether the associations found by the ILAE study[2] were valid for our cohort. Table e-1, links.lww.com/NXG/A253, presents the statistics of the association analysis. One locus was found to be significant, and 3 were close to significance (p value threshold = 0.0025). These results show that our founder FC population shares a portion of the epilepsy genetic risks with the Western European populations studied by the ILAE. Next, to assess whether SNPs taken together could explain a portion the epilepsy phenotype, we used the basic statistics of the ILAE study[2] to construct the PRS. Figure 1 shows the density plots of standardized PRS values of patients compared with controls for the 3 broad epilepsy types; best-fit p values are shown in figure e-2 (links.lww.com/NXG/A253). Our first observation was that the PRS distribution is more shifted to the right in IGE than in NAFE, which is consistent with the heritability estimates reported in the ILAE study.[2] Figures 2 and 3 show the same analysis for the IGE and NAFE subtypes. The best-fits are shown in figures e-3 and e-4 (links.lww.com/NXG/A253).
Figure 1

PRS density for broad epilepsy types

PRS density plots for French Canadian controls and (A) all patients with epilepsy, (B) patients with IGE, and (C) patients with NAFE. IGE = idiopathic generalized epilepsy; NAFE = nonacquired focal epilepsy; PRS = polygenic risk score.

PRS density for broad epilepsy types

PRS density plots for French Canadian controls and (A) all patients with epilepsy, (B) patients with IGE, and (C) patients with NAFE. IGE = idiopathic generalized epilepsy; NAFE = nonacquired focal epilepsy; PRS = polygenic risk score.

PRS density for IGE subtypes

PRS density plots for French Canadian controls and patients with IGE syndrome (A) CAE, (B) GTCS, (C) JME, and (D) JAE. CAE = childhood absence epilepsy; GTCS = generalized tonic-clonic seizures alone; IGE = idiopathic generalized epilepsy; JAE = juvenile absence epilepsy; JME = juvenile myoclonic epilepsy; PRS = polygenic risk score.

PRS density for NAFE subtypes

PRS density plots for French Canadian controls and patients with NAFE with (A) HS, (B) no documented lesion, and (C) lesions other than HS. HS = documented hippocampal sclerosis; NAFE = nonacquired focal epilepsy; PRS = polygenic risk score. The next logical question was to investigate whether the PRS could be used to discriminate between a patient with epilepsy and a control. Table 2 presents the logistic mixed model statistics and the variance explained by the PRS calculated using the Nagelkerke pseudo-R2 (on the observed scale) for patients and controls for all epilepsy subtypes. The variance explained by the PRS varies among epilepsy subtypes, but is generally higher for IGE types than for NAFE types. This is reflected by the higher odds ratio and variance explained by the PRS in IGE broad subtype compared with NAFE and corroborates what was found in a recent study.[20]
Table 2

PRS and pseudo-R2 statistics based on the logistic mixed model

PRS and pseudo-R2 statistics based on the logistic mixed model

Discussion

The strongest association in our FC cohort was observed with the SNP rs1402398. This SNP is located in the noncoding region surrounding genes FANCL and BCL11A. These genes have been linked with epilepsies through association studies,[21] but no other functional or clinical evidence highlights their roles in the disease. Although we successfully replicated associations, we believe that the biggest contribution of our study lies in the PRSs that were established for each epilepsy type. This is, to our knowledge, one of the first documented examples of how PRSs can be used for epilepsy genetic studies for different subtypes of epilepsy, although a recent study has shown that for broad epilepsy subtypes.[20] Although this measure cannot yet be translated into clinical use, our analysis shows that the additive value of common variants can be used to better understand the disease. One definite pitfall of our study is the small size of our cohort. The initial GWAS was performed on more than 15,000 patients with epilepsy. Our study only included 373 patients with epilepsy and thus cannot have the same outreach as the initial one. This is why we did not report genome-wide association statistics and focused only on the replication of associated SNPs. We believe that the small size of our cohort also affects the PRS calculations, but to a smaller degree. For these reasons, we have to take the variance explained by the PRS with caution. However, for the broad phenotypes, we explain 4 times more of the variance for patients with IGE than what we explain for patients with NAFE, as shown elsewhere.[20] This also supports the fact that epilepsy should be divided into subtypes when studying the genetic mechanism underlying the disease, as some epilepsy types were reasonably well explained by the PRS (i.e., childhood absence epilepsy). This study was conducted on a documented founder population. The FC population is well known for its high prevalence of specific disease-causing mutations.[22,23] For epilepsy, although we cannot exclude that some of the associations found were driven by rare haplotypes, we show here that the genetic etiology of the disease is consistent with that of the general European population. In future work, we will try to assess whether the strong PRS found in some epilepsy subtypes could be explained by rarer haplotypes, as we would expect in a founder population. Globally, we support the notion that PRSs provide a reliable measure to rightfully estimate the contribution of genetic factors to the pathophysiologic mechanism of epilepsies.
  20 in total

1.  Common polygenic variation and risk for childhood-onset schizophrenia.

Authors:  K Ahn; S S An; Y Y Shugart; J L Rapoport
Journal:  Mol Psychiatry       Date:  2014-12-16       Impact factor: 15.992

Review 2.  What do genetic studies tell us about the heritable basis of common epilepsy? Polygenic or complex epilepsy?

Authors:  Bobby P C Koeleman
Journal:  Neurosci Lett       Date:  2017-03-25       Impact factor: 3.046

3.  Revised terminology and concepts for organization of seizures and epilepsies: report of the ILAE Commission on Classification and Terminology, 2005-2009.

Authors:  Anne T Berg; Samuel F Berkovic; Martin J Brodie; Jeffrey Buchhalter; J Helen Cross; Walter van Emde Boas; Jerome Engel; Jacqueline French; Tracy A Glauser; Gary W Mathern; Solomon L Moshé; Douglas Nordli; Perrine Plouin; Ingrid E Scheffer
Journal:  Epilepsia       Date:  2010-02-26       Impact factor: 5.864

Review 4.  Human genetics: lessons from Quebec populations.

Authors:  C R Scriver
Journal:  Annu Rev Genomics Hum Genet       Date:  2001       Impact factor: 8.929

Review 5.  The hidden genetics of epilepsy-a clinically important new paradigm.

Authors:  Rhys H Thomas; Samuel F Berkovic
Journal:  Nat Rev Neurol       Date:  2014-04-15       Impact factor: 42.937

6.  PRSice: Polygenic Risk Score software.

Authors:  Jack Euesden; Cathryn M Lewis; Paul F O'Reilly
Journal:  Bioinformatics       Date:  2014-12-29       Impact factor: 6.937

7.  Genome-wide mega-analysis identifies 16 loci and highlights diverse biological mechanisms in the common epilepsies.

Authors: 
Journal:  Nat Commun       Date:  2018-12-10       Impact factor: 14.919

8.  Validation of Genome-Wide Polygenic Risk Scores for Coronary Artery Disease in French Canadians.

Authors:  Florian Wünnemann; Ken Sin Lo; Alexandra Langford-Avelar; David Busseuil; Marie-Pierre Dubé; Jean-Claude Tardif; Guillaume Lettre
Journal:  Circ Genom Precis Med       Date:  2019-06-11

9.  Polygenic burden in focal and generalized epilepsies.

Authors:  Costin Leu; Remi Stevelink; Alexander W Smith; Slavina B Goleva; Masahiro Kanai; Lisa Ferguson; Ciaran Campbell; Yoichiro Kamatani; Yukinori Okada; Sanjay M Sisodiya; Gianpiero L Cavalleri; Bobby P C Koeleman; Holger Lerche; Lara Jehi; Lea K Davis; Imad M Najm; Aarno Palotie; Mark J Daly; Robyn M Busch; Dennis Lal
Journal:  Brain       Date:  2019-11-01       Impact factor: 13.501

10.  Genetic determinants of common epilepsies: a meta-analysis of genome-wide association studies.

Authors: 
Journal:  Lancet Neurol       Date:  2014-07-30       Impact factor: 44.182

View more
  3 in total

1.  Genetic Diagnosis Spectrum and Multigenic Burden of Exome-Level Rare Variants in a Childhood Epilepsy Cohort.

Authors:  Ruen Yao; Yunqing Zhou; Jie Tang; Niu Li; Tingting Yu; Yingzhong He; Cuijin Wang; Jiwen Wang; Jian Wang
Journal:  Front Genet       Date:  2021-12-21       Impact factor: 4.599

Review 2.  Epilepsy Genetics and Precision Medicine in Adults: A New Landscape for Developmental and Epileptic Encephalopathies.

Authors:  Álvaro Beltrán-Corbellini; Ángel Aledo-Serrano; Rikke S Møller; Eduardo Pérez-Palma; Irene García-Morales; Rafael Toledano; Antonio Gil-Nagel
Journal:  Front Neurol       Date:  2022-02-17       Impact factor: 4.003

3.  Role of Common Genetic Variants for Drug-Resistance to Specific Anti-Seizure Medications.

Authors:  Stefan Wolking; Ciarán Campbell; Caragh Stapleton; Mark McCormack; Norman Delanty; Chantal Depondt; Michael R Johnson; Bobby P C Koeleman; Roland Krause; Wolfram S Kunz; Anthony G Marson; Josemir W Sander; Graeme J Sills; Pasquale Striano; Federico Zara; Sanjay M Sisodiya; Gianpiero L Cavalleri; Holger Lerche
Journal:  Front Pharmacol       Date:  2021-06-09       Impact factor: 5.810

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.