Caitlin A Bennett1, Slavé Petrovski1, Karen L Oliver1, Samuel F Berkovic1. 1. Department of Medicine (C.A.B., S.P., K.L.O., S.F.B.), Epilepsy Research Centre; and Department of Medicine (S.P.), Royal Melbourne Hospital, University of Melbourne, Victoria, Australia.
Abstract
OBJECTIVE: To assist the interpretation of genomic data for common epilepsies, we asked whether variants implicated in mild epilepsies in autosomal dominant families are present in the general population. METHODS: We studied 12 genes for the milder epilepsies and identified published variants with strong segregation support (de novo germline mutation or ≥4 affected family members). These variants were checked in the Exome Aggregation Consortium (ExAC), a database of genetic variation in over 60,000 individuals. We subsequently evaluated variants in these epilepsy genes that lacked strong segregation support. To determine whether the findings in epilepsies were representative of other diseases, we also assessed the presence of variants in other dominant neurologic disorders (e.g., CADASIL). RESULTS: Published epilepsy variants with strong segregation support (n = 65) were absent (n = 61) or present once (n = 4) in ExAC. By contrast, of 46 published epilepsy variants without strong segregation support, 8 occurred recurrently (2-186 times). Similarly, none of the 45 disease-associated variants from other neurologic disorders with strong segregation support occurred more than once in ExAC. Reanalysis using the larger ExAC V2 plus gnomAD reference cohort showed consistent results. CONCLUSIONS: Variants causing autosomal dominant epilepsies are ultra-rare in the general population. Variants observed more than once in ExAC were only found among reports without strong segregation support, suggesting that they may be benign. Clinicians are increasingly faced with the interpretation of genetic variants of unknown significance. These data illustrate that variants present more than once in ExAC are less likely to be pathogenic, reinforcing the valuable clinical role of ExAC.
OBJECTIVE: To assist the interpretation of genomic data for common epilepsies, we asked whether variants implicated in mild epilepsies in autosomal dominant families are present in the general population. METHODS: We studied 12 genes for the milder epilepsies and identified published variants with strong segregation support (de novo germline mutation or ≥4 affected family members). These variants were checked in the Exome Aggregation Consortium (ExAC), a database of genetic variation in over 60,000 individuals. We subsequently evaluated variants in these epilepsy genes that lacked strong segregation support. To determine whether the findings in epilepsies were representative of other diseases, we also assessed the presence of variants in other dominant neurologic disorders (e.g., CADASIL). RESULTS: Published epilepsy variants with strong segregation support (n = 65) were absent (n = 61) or present once (n = 4) in ExAC. By contrast, of 46 published epilepsy variants without strong segregation support, 8 occurred recurrently (2-186 times). Similarly, none of the 45 disease-associated variants from other neurologic disorders with strong segregation support occurred more than once in ExAC. Reanalysis using the larger ExAC V2 plus gnomAD reference cohort showed consistent results. CONCLUSIONS: Variants causing autosomal dominant epilepsies are ultra-rare in the general population. Variants observed more than once in ExAC were only found among reports without strong segregation support, suggesting that they may be benign. Clinicians are increasingly faced with the interpretation of genetic variants of unknown significance. These data illustrate that variants present more than once in ExAC are less likely to be pathogenic, reinforcing the valuable clinical role of ExAC.
Recent advances in DNA technologies have increased the influence that genomic data have in epilepsy clinical practice. Finding a genetic cause can have important implications for diagnosis and prognosis, treatment and genetic counseling, as well as the psychological and financial benefits associated with ending the diagnostic journey.[1,2]Clinical laboratories are understandably conservative in ascribing pathogenicity; however, the elevated reporting of variants of unknown significance (VOUS) makes interpretation complex and renders follow-up efforts, such as segregation analyses and functional studies, impractical at a large scale. Novel missense variants are particularly challenging. Establishing the frequency of a VOUS in control populations is currently among the most reliable considerations in variant interpretation.The Exome Aggregation Consortium (ExAC) database contains exome data for more than 60,000 individuals compiled at the Broad Institute.[3] For de novo mutations affecting established epilepsy genes, absence from ExAC is widely accepted as one supportive argument for pathogenicity and this is particularly true for epilepticencephalopathies and most neurodevelopmental disorders.[4] The question is whether this utility extends to the milder epilepsies where negative selection may not have been as strong.The milder presentations and the observation of reduced penetrance within dominant families raise the possibility that some causative variants might be present in the general population at low frequencies. Here, to assess how informative ExAC might be to ascribing pathogenicity to inherited variants among milder epilepsies, we evaluated the frequency in ExAC of published epilepsy variants, where de novo status or segregation in large families supported pathogenicity.
METHODS
A list of 12 genes was generated for the “milder” epilepsies (table 1). We excluded PRRT2, as it is prone to alignment errors in next-generation sequencing.
Table 1
Genes for autosomal dominant epilepsy syndromes
Genes for autosomal dominant epilepsy syndromes
Literature search.
We attempted an exhaustive PubMed search for missense and splice variants deemed causative for dominant familial epilepsies, published in English, on the basis of strong segregation support, which we defined as prior linkage analysis or familial segregation in at least 4 affected family members. We note that segregation in 4 individuals is inadequate to support gene discovery but, in the context of an established gene for an appropriate phenotype, such segregation provides considerable support for the role of the variant in the particular family. Sanger validated de novo variants found in our list of 12 epilepsy genes, among sporadic cases of “mild” epilepsies, were also included. We deliberately limited our search to the literature published prior to 2015, as this literature can realistically be considered to precede the launch of the ExAC database. A separate review was conducted for each of the 12 genes by searching the gene name and “epilepsy.” For genes associated with both milder and more severe phenotypes, such as genetic epilepsy with febrile seizures plus (GEFS+) and Dravet syndrome with SCN1A, only variants found among families or individuals with the milder phenotype were included. Variants reported exclusively among the epilepticencephalopathies were excluded.A second list of variants without strong segregation support was generated. Lack of strong segregation support was defined as segregation data based on less than 4 affected individuals or not performed at all. We hypothesized that this group of variants might be more enriched for background genetic variants in epilepsy genes.To determine whether findings in mild epilepsies were reflective of other autosomal dominant neurologic diseases, we evaluated variants with strong segregation support from 45 published families with familial hemiplegic migraine, CADASIL, autosomal dominant Alzheimer disease or frontotemporal dementia. The genes examined were ATP1A2, CACNA1A, SCN1A, NOTCH3, APP, PSEN1, PSEN2, GRN, MAPT, and TARDBP1.Finally, after the study was completed, the expanded ExAC V2 plus gnomAD was released and the analysis was rerun. This effectively doubled the size of the ExAC control cohort, with whole-exome sequencing (WES) and whole-genome sequencing data available on 123,136 individuals and 15,496 individuals, respectively.[3]
RESULTS
Sixty-five missense or predicted splice variants for the milder familial epilepsies with strong segregation support were identified. Of these, 61 (94%) were absent from ExAC and 4 (6%) were present in ExAC once (a single allele was reported) (figure; table e-1 at Neurology.org/ng).
Figure
Frequency distribution of allele counts in ExAC for variants in epilepsy families
Variants with strong segregation support, including confirmed de novo mutations among sporadic common epilepsies, are shown in blue (n = 65); variants without strong segregation support are shown in red (n = 46); and variants from other neurologic disorders are shown in green (n = 45). ExAC = Exome Aggregation Consortium.
Frequency distribution of allele counts in ExAC for variants in epilepsy families
Variants with strong segregation support, including confirmed de novo mutations among sporadic common epilepsies, are shown in blue (n = 65); variants without strong segregation support are shown in red (n = 46); and variants from other neurologic disorders are shown in green (n = 45). ExAC = Exome Aggregation Consortium.By contrast, of the 46 missense or splice variants contained in the same publications but lacking the strong segregation support, 13 (28%) were present in ExAC. Of these, 8 variants were observed among 2–186 carriers (figure; table e-1). These 8 variants are listed in table 2.
Table 2
Eleven variants without linkage analysis or strong segregation support, which are represented in ExAC V2 plus gnomAD more than once
Eleven variants without linkage analysis or strong segregation support, which are represented in ExAC V2 plus gnomAD more than onceOf the 45 missense variants with strong segregation support for other neurologic disorders, 43 (96%) were absent from ExAC and 2 (4%) were present once (figure; table e-1).Reanalysis with ExAC V2 and gnomAD datafiles showed that the epilepsy variants in our current study with strong segregation support are present no more than once, with the exception of 2 variants in SCN1B associated with febrile seizures and GEFS+ which are present 3 and 4 times (table e-1).For variants without strong segregation support, their frequency in ExAC V2 approximately doubled, as expected, compared with the initial analysis and 3 new variants were observed multiple times in ExAC V2 (table 2).
DISCUSSION
The interpretation of putative epilepsy genetic variants remains complex for researchers and clinicians alike. Here, we assessed the allele count of literature-reported epilepsy variants in the ExAC database and found that variants with strong segregation evidence for pathogenicity were present no more than once in the ExAC collection of over 60,000 individuals, illustrating that these variants are ultra-rare in the general population. This pattern was also seen in the variants found in other neurologic disorders that were reported based on strong segregation support. However, the literature-reported epilepsy variants without strong segregation support were more likely to be present in more than 1 carrier among the ExAC collection (figure), suggesting that many of these variants (table 2) are unlikely to be penetrant disease-causing alleles, but rather background variation in established epilepsy genes or possibly risk alleles contributing to a more complex genetic architecture that is not currently well understood.WES in patients with epilepsy can be a powerful diagnostic tool, but interpretation of the data remains complex. In the field of epilepsy genetics, this complexity is highlighted by the likely false attribution of pathogenicity to 2 genes, EFHC1 and SRPX2. SRPX2 was deemed a pathogenic gene in 2006 in 2 families with epilepsy, speech dyspraxia, and intellectual impairment, including one from our centre.[5] The key family was subsequently shown to have a GRIN2A variant, and in fact the SRPX2 variant from this family appears in ExAC 47 times. Similarly, the pathogenicity of EFHC1 as a gene for juvenile myoclonic epilepsy has been disputed.[6]Even for widely accepted epilepsy genes, the interpretation of VOUS is complex. A large contribution to false-positive epilepsy variants comes from variants identified through “secondary” studies, wherein pathogenicity is largely inferred based on the knowledge of prior disease-causing variants within the gene. These variants are often not subject to the same rigorous evaluation of pathogenicity as variants from the original discovery,[6] resulting in more permissive pathogenicity assignments. Indeed, we know that epilepsy genes carry background variation in the general population. For example, the variants in table 2, which are in established epilepsy genes, were published as pathogenic and some of these are present in ExAC at relatively high frequencies, emphasizing the caution necessary when assessing pathogenicity solely on the basis of a known epilepsy gene. This is one of the most important considerations in the interpretation of results from clinical gene panels.The issue of variant misclassification extends well beyond the field of epilepsy to other monogenic diseases. It has recently been demonstrated that each ExAC participant carries, on average, ∼54 variants that have previously been reported as disease causing.[3] Of these, ∼41 variants have a minor allele frequency of >1%.[3] It is therefore likely that many of these variants are either benign variants or contributing to complex inheritance.The frequency of VOUS in control samples is thus an invaluable line of evidence in interpretation, and the ExAC database offers a control cohort of unprecedented size. ExAC is not enriched for pathogenic variants in genes that are commonly tested in a clinical diagnostics setting, thus supporting its utility in classifying variants.[7] In a recent study, the ExAC database was found to have an excess of ostensibly pathogenic variants for prion disease but found that the variants with the strongest independent evidence of pathogenicity were absent from ExAC, which is consistent with our findings.[8] The authors attributed the excess allele frequency for variants with weak evidence for pathogenicity to the incorrect attribution of pathogenicity for certain variants or reduced penetrance of these alleles, an interpretation shared here.Thus, the interrogation of ExAC provides a highly useful way of assessing the relevance of VOUS for patients with epilepsy and is likely to be widely applicable to other neurologic disorders. We have shown that disease-causing variants in epilepsy are very rare, and thus variants that are present more than once in the current ExAC cohort are unlikely to contribute to disease in a large way. It is important that the inverse is definitely not true—we cannot infer that a variant's absence in ExAC is sufficient evidence of pathogenicity.The use of large reference cohorts as a clinical tool is not without limitations. As the minor allele frequency resolution improves with additional samples added to large reference cohorts (like ExAC), it is possible that clinically relevant variants for the milder epilepsies might begin to appear more than once. Indeed, our results with ExAC were supported with ExAC V2 plus gnomAD where the sample size was doubled. The only exception was 2 variants in SCN1B associated with febrile seizures and GEFS+ which are present 3 and 4 times (table e-1). A possible explanation for this slightly elevated frequency is that variants causing febrile seizures (a mild phenotype often resolving with age) are expected to have a subtler impact on reproductive fitness than variants causing other common genetic generalized and nonacquired focal epilepsies, and this is also coupled with the higher rates of febrile seizures in the general population than other common epilepsies. Thus, it is unlikely that variants with a minor allele frequency >0.0005% (approximate current ExAC V2 resolution) have a major contribution to autosomal dominant forms of mild epilepsies.Our study and the message presented here are independently supported by our recent Epi4K study, where we found that when comparing to a control population, the epilepsy risk signals identified in epilepsy genes among large collections of epilepsy samples were driven by ultra-rare variants absent in ExAC.[9] Our current study takes an alternative approach, comparing the frequency in ExAC of variants with and without strong segregation support, to reach the same conclusion. It is important to note that although European and some other genetic ancestries are well represented in ExAC, many ethnic minorities are underrepresented or not represented at all, posing additional issues in interpretation when assessing data from a patient from an ethnic background not well represented in these databases.[10]
Authors: Patrice Roll; Gabrielle Rudolf; Sandrine Pereira; Barbara Royer; Ingrid E Scheffer; Annick Massacrier; Maria-Paola Valenti; Nathalie Roeckel-Trevisiol; Sarah Jamali; Christophe Beclin; Caroline Seegmuller; Marie-Noëlle Metz-Lutz; Arnaud Lemainque; Marc Delepine; Christophe Caloustian; Anne de Saint Martin; Nadine Bruneau; Danièle Depétris; Marie-Geneviève Mattéi; Elisabeth Flori; Andrée Robaglia-Schlupp; Nicolas Lévy; Bernd A Neubauer; Rivka Ravid; Christian Marescaux; Samuel F Berkovic; Edouard Hirsch; Mark Lathrop; Pierre Cau; Pierre Szepetowski Journal: Hum Mol Genet Date: 2006-02-23 Impact factor: 6.150
Authors: Leanne M Dibbens; Boukje de Vries; Simona Donatello; Sarah E Heron; Bree L Hodgson; Satyan Chintawar; Douglas E Crompton; James N Hughes; Susannah T Bellows; Karl Martin Klein; Petra M C Callenbach; Mark A Corbett; Alison E Gardner; Sara Kivity; Xenia Iona; Brigid M Regan; Claudia M Weller; Denis Crimmins; Terence J O'Brien; Rosa Guerrero-López; John C Mulley; Francois Dubeau; Laura Licchetta; Francesca Bisulli; Patrick Cossette; Paul Q Thomas; Jozef Gecz; Jose Serratosa; Oebele F Brouwer; Frederick Andermann; Eva Andermann; Arn M J M van den Maagdenberg; Massimo Pandolfo; Samuel F Berkovic; Ingrid E Scheffer Journal: Nat Genet Date: 2013-03-31 Impact factor: 38.330
Authors: Todor Arsov; Saul A Mullen; Sue Rogers; A Marie Phillips; Kate M Lawrence; John A Damiano; Hadassa Goldberg-Stern; Zaid Afawi; Sara Kivity; Chantal Trager; Steven Petrou; Samuel F Berkovic; Ingrid E Scheffer Journal: Ann Neurol Date: 2012-11 Impact factor: 10.422
Authors: Jack A Kosmicki; Kaitlin E Samocha; Daniel P Howrigan; Stephan J Sanders; Kamil Slowikowski; Monkol Lek; Konrad J Karczewski; David J Cutler; Bernie Devlin; Kathryn Roeder; Joseph D Buxbaum; Benjamin M Neale; Daniel G MacArthur; Dennis P Wall; Elise B Robinson; Mark J Daly Journal: Nat Genet Date: 2017-02-13 Impact factor: 38.330
Authors: Monkol Lek; Konrad J Karczewski; Eric V Minikel; Kaitlin E Samocha; Eric Banks; Timothy Fennell; Anne H O'Donnell-Luria; James S Ware; Andrew J Hill; Beryl B Cummings; Taru Tukiainen; Daniel P Birnbaum; Jack A Kosmicki; Laramie E Duncan; Karol Estrada; Fengmei Zhao; James Zou; Emma Pierce-Hoffman; Joanne Berghout; David N Cooper; Nicole Deflaux; Mark DePristo; Ron Do; Jason Flannick; Menachem Fromer; Laura Gauthier; Jackie Goldstein; Namrata Gupta; Daniel Howrigan; Adam Kiezun; Mitja I Kurki; Ami Levy Moonshine; Pradeep Natarajan; Lorena Orozco; Gina M Peloso; Ryan Poplin; Manuel A Rivas; Valentin Ruano-Rubio; Samuel A Rose; Douglas M Ruderfer; Khalid Shakir; Peter D Stenson; Christine Stevens; Brett P Thomas; Grace Tiao; Maria T Tusie-Luna; Ben Weisburd; Hong-Hee Won; Dongmei Yu; David M Altshuler; Diego Ardissino; Michael Boehnke; John Danesh; Stacey Donnelly; Roberto Elosua; Jose C Florez; Stacey B Gabriel; Gad Getz; Stephen J Glatt; Christina M Hultman; Sekar Kathiresan; Markku Laakso; Steven McCarroll; Mark I McCarthy; Dermot McGovern; Ruth McPherson; Benjamin M Neale; Aarno Palotie; Shaun M Purcell; Danish Saleheen; Jeremiah M Scharf; Pamela Sklar; Patrick F Sullivan; Jaakko Tuomilehto; Ming T Tsuang; Hugh C Watkins; James G Wilson; Mark J Daly; Daniel G MacArthur Journal: Nature Date: 2016-08-18 Impact factor: 49.962
Authors: Mahmoud Koko; Joshua E Motelow; Kate E Stanley; Dheeraj R Bobbili; Ryan S Dhindsa; Patrick May Journal: Epilepsia Date: 2022-01-15 Impact factor: 5.864
Authors: Kameryn M Butler; Olivia A Moody; Elisabeth Schuler; Jason Coryell; John J Alexander; Andrew Jenkins; Andrew Escayg Journal: Brain Date: 2018-08-01 Impact factor: 13.501
Authors: Amy J Osborne; Matteo Breno; Nicolo Ghiringhelli Borsa; Fengxiao Bu; Véronique Frémeaux-Bacchi; Daniel P Gale; Lambertus P van den Heuvel; David Kavanagh; Marina Noris; Sheila Pinto; Pavithra M Rallapalli; Giuseppe Remuzzi; Santiago Rodríguez de Cordoba; Angela Ruiz; Richard J H Smith; Paula Vieira-Martins; Elena Volokhina; Valerie Wilson; Timothy H J Goodship; Stephen J Perkins Journal: J Immunol Date: 2018-03-02 Impact factor: 5.422
Authors: Tommaso Pippucci; Laura Licchetta; Sara Baldassari; Caterina Marconi; Monica De Luise; Candace Myers; Elena Nardi; Federica Provini; Cinzia Cameli; Raffaella Minardi; Elena Bacchelli; Lucio Giordano; Giovanni Crichiutti; Giuseppe d'Orsi; Marco Seri; Giuseppe Gasparre; Heather C Mefford; Paolo Tinuper; Francesca Bisulli Journal: Ann Clin Transl Neurol Date: 2019-02-25 Impact factor: 4.511