Literature DB >> 33949685

Modeling seizures in the Human Phenotype Ontology according to contemporary ILAE concepts makes big phenotypic data tractable.

David Lewis-Smith1,2, Peter D Galer3,4,5,6, Ganna Balagura7, Hugh Kearney8,9, Shiva Ganesan3,4,5, Mahgenn Cosico3,4, Margaret O'Brien3,4, Priya Vaidiswaran3,4, Roland Krause10, Colin A Ellis4,6, Rhys H Thomas1,2, Peter N Robinson11,12, Ingo Helbig3,4,5,6.   

Abstract

OBJECTIVE: The clinical features of epilepsy determine how it is defined, which in turn guides management. Therefore, consideration of the fundamental clinical entities that comprise an epilepsy is essential in the study of causes, trajectories, and treatment responses. The Human Phenotype Ontology (HPO) is used widely in clinical and research genetics for concise communication and modeling of clinical features, allowing extracted data to be harmonized using logical inference. We sought to redesign the HPO seizure subontology to improve its consistency with current epileptological concepts, supporting the use of large clinical data sets in high-throughput clinical and research genomics.
METHODS: We created a new HPO seizure subontology based on the 2017 International League Against Epilepsy (ILAE) Operational Classification of Seizure Types, and integrated concepts of status epilepticus, febrile, reflex, and neonatal seizures at different levels of detail. We compared the HPO seizure subontology prior to, and following, our revision, according to the information that could be inferred about the seizures of 791 individuals from three independent cohorts: 2 previously published and 150 newly recruited individuals. Each cohort's data were provided in a different format and harmonized using the two versions of the HPO.
RESULTS: The new seizure subontology increased the number of descriptive concepts for seizures 5-fold. The number of seizure descriptors that could be annotated to the cohort increased by 40% and the total amount of information about individuals' seizures increased by 38%. The most important qualitative difference was the relationship of focal to bilateral tonic-clonic seizure to generalized-onset and focal-onset seizures. SIGNIFICANCE: We have generated a detailed contemporary conceptual map for harmonization of clinical seizure data, implemented in the official 2020-12-07 HPO release and freely available at hpo.jax.org. This will help to overcome the phenotypic bottleneck in genomics, facilitate reuse of valuable data, and ultimately improve diagnostics and precision treatment of the epilepsies.
© 2021 The Authors. Epilepsia published by Wiley Periodicals LLC on behalf of International League Against Epilepsy.

Entities:  

Keywords:  big data; classification; epilepsy; genetics

Mesh:

Year:  2021        PMID: 33949685      PMCID: PMC8272408          DOI: 10.1111/epi.16908

Source DB:  PubMed          Journal:  Epilepsia        ISSN: 0013-9580            Impact factor:   6.740


INTRODUCTION

Information about the clinical features of research participants is essential for diagnostic interpretation of genetic findings and discovery research in epilepsy genetics.[1,2] To fully exploit these data, information must be harmonized to allow reliable comparison of individuals to diagnostic criteria, or to each other.[3] Interpretation of phenotypic data is challenging in epilepsy, particularly because the central clinical features—seizures— are diverse, dynamic, and can occur in different combinations and in different developmental and comorbidity contexts.[4-9] Crucially, the same seizure can be classified at various levels of detail according to clinical need, or to accommodate incomplete information.[5,6,8] Even a single individual’s clinical data can require harmonization between different formal classifications because of the preferences of their various health care providers, the design of study-specific research data collection forms, or because their history may span several decades.[6,8,10] For example, different sources may describe the same seizure as a focal aware non-motor seizure,[6] focal seizure,[6,9,11] aura,[8,9,12] sensory seizure,[9] or simple partial seizure.[10] Harmonization traditionally occurs manually during data entry, dependent on the clinical expertise of the contributor completing a form specific to a particular diagnostic service or research study. Manual harmonization of data places a large burden on clinicians and risks overly constraining the depth and quality of the phenotypic details provided, or even case ascertainment. In addition, the format of clinical data collected for one study may not map onto the variables required for another study, limiting the ease and yield of reuse. The need for rapid and accurate automated phenotypic harmonization of large multicenter data sets is increasingly important as diagnostic services and research studies grow to analyze data from tens of thousands of individuals.[1,3,13-19] Ontologies formally model the concepts and relationships that exist in a particular expert domain.[20] The Human Phenotype Ontology (HPO) is a tool that has become the lingua franca for clinical and research geneticists for concise communication and harmonization of clinical features (human-phenotype-ontology.org).[21] The HPO is central to data capture in the Deciphering Developmental Disorders study [13] and the National Institutes of Health Research BioResource for the 100 000 Genomes Project,[14] and for the upcoming National Health Service England Genomic Medicine Service clinical genome sequencing service.[19] Through incremental development with contributions from experts across the breadth of clinical domains since 2008, the HPO has come to include 15 247 terms describing distinct clinical phenotypic concepts.[21] Each concept has a name, unique identifier (eg, HP:0001250), and definition, and may have synonyms (by which it may be searched for), comments, cross-referenced concepts from other databases such as OMIM (Online Mendelian Inheritance in Man, omim.org)[22] or SNOMED-CT (Systematized Nomenclature of Medicine – Clinical Terms, snomed.org),[23] and PubMed identifiers of supporting references. Current projects include translation of the HPO into multiple languages and improvements in the cross-referencing to clinical diagnostic and research resources such as Orphanet (http://www.orphadata.org) and ontologies of model organism phenotypes.[21] Each relevant clinical feature of an individual is annotated as the most conceptually specific HPO terms necessary to describe it fully. To distinguish HPO terms from formal seizure types or semiological descriptions within the current manuscript, we refer to HPO concepts in italics by their capitalized name and HPO identifier. For example, if an individual has focal aware motor seizures, these should be coded as Focal aware motor seizure (HP:0020217). Automated reasoning algorithms can then infer from the is_a relationship within the HPO that the individual has a type of Focal aware seizure (HP:0002349) and Focal motor seizure (HP:0011153), and from either of these that they have a Focal-onset seizure (HP:0007359), and from this a Seizure (HP:0001250). Hence, from a single input datum, an individual’s clinical features can be automatically described across the breadth of applicable descriptive terms, terms that span phenotypic depth. Consequently, coupled with natural language processing and thesauruses of synonyms, these algorithms can translate medical records and research data into the same HPO format and compare the clinical features of individuals to each other for research,[24-28] or to the clinical features associated with a particular genetic disorder or syndrome in a database for variant prioritization (Figure 1).[29-31] As is the case with clinical records spanning classifications, data coded as HPO terms may span multiple versions because the HPO is regularly updated to incorporate advances in phenotypic conceptualization. Unique term identifiers map to consistent concepts to facilitate translation from one HPO version to another.
FIGURE 1

(A) The Human Phenotype Ontology (HPO) can harmonize phenotypic data from large cohorts to identify phenotypic associations with a particular genetic (or other) categorical factor, in this case de novo variants in SCN1A. Data from Galer et al. 2020;[25] p-values and odds ratio obtained using Fisher’s exact test. (B) Once the set of HPO terms associated with this form of epilepsy is known, the clinical features of patients can be translated into HPO terms to assess how closely they match phenotypically. GTCS, generalized tonic-clonic seizure; PSW, polyspike-wave; sz, seizure

Recognizing the importance of the HPO in genomics, the pertinence of seizure types to the conceptualization of epilepsies,[4,8,32] and the recent revision of seizure classification (ILAE 2017),[6] we have created a new ontology of contemporary seizure concepts for the HPO. Herein we demonstrate harmonization of three cohorts using the HPO, and compare the information captured by our new seizure subontology (release 2020–12-07) to that from a prior iteration (release 2017–12-12).

METHODS

Cohorts providing seizure data

The local epilepsy cohort for this study comprises 150 individuals with genetic generalized epilepsy (GGE, n = 65), focal epilepsy (n = 53), and developmental and epileptic encephalopathy (DEE, n = 32) recruited from Children’s Hospital of Philadelphia for the Epi25 Collaborative study.[18] We used seizure data within the Epi25 format but did not use primary data of the Epi25 Collaborative. Informed consent for participation was obtained from participants themselves or, where necessary, their parents in agreement with the Declaration of Helsinki, and the study was completed per protocol using de-identified data with local approval by the Children’s Hospital of Philadelphia Institutional Review Board (IRB 15–12226). Data from two further and independent cohorts with DEE were reused to demonstrate harmonization of distinct data sets. These have been published previously: Helbig et al. 2019[24] comprising 306 individuals and a subcohort from Galer et al. 2000[25] comprising 335 individuals from the Epilepsy Phenome/Genome Project (EPGP). We included only unique individuals documented as having any form of epileptic seizure. The local cohort’s data were in the form of categorical seizure variables from three Epi25 Collaborative data collection forms mapping to each of GGE, focal epilepsy, and DEE (epi-25.org/epi25-forms). Data from Helbig et al. 2019 were based on HPO release 2017–12-12 and those from Galer et al. 2020 were based on HPO release 2018–11-08.

Harmonization of data using the HPO

For each cohort, we created a dictionary translating seizure data from their raw format into the specified version of the HPO. Once data had been translated into HPO annotations the is_a relationship between terms of the HPO were used to infer the other, conceptually less specific HPO seizure descriptors that must apply to the individual, as described with focal aware motor seizures above. We call this process of capturing all the terms applicable to an initial annotation, comprehensively describing a phenotypic feature at different levels of detail, propagation.[33] The set of propagated terms includes both the initial term annotated through translation using a dictionary and those inferred by logic within the ontology.

Statistical procedures

All statistical tests and ontology-based automatic inferences were performed using the R Statistical Framework,[34] including the tidyverse collection of packages[35] and ontologyIndex.[36]

The process of revising the HPO seizure subontology

We set up an international collaboration to build a seizure subontology for the HPO based on contemporary seizure concepts at the EMBO workshop on Phenotyping neurological syndromes for systems genetics, in Luxembourg, October 4–10, 2018. Our group met fortnightly by videoconference to coordinate ontology design using the online tool WebProtégé (webprotege.stanford.edu),[37] under the supervision of I.H. and R.K. (Epilepsiome Task Force of the Genetics Commission of the International League Against Epilepsy) and P.N.R. (creator of the HPO). We considered feedback and comments provided by HPO users on the HPO GitHub page (github.com/obophenotype/human-phenotype-ontology).

RESULTS

Human Phenotype Ontology–based inference harmonizes seizure descriptions across different levels of precision, semiologies, and classifications

We translated categorical seizure data in Epi25 format from 150 people in our local cohort and HPO release 2019–11-08 seizure annotations of 335 participants in EPGP (Galer 2020 cohort)[25] into HPO release 2017–12-12 (HPO 2017) terms. This harmonized their data with those already coded according to HPO 2017 from 306 individuals (Helbig 2019 cohort).[24] We compared the number of seizure descriptors that could be annotated to each of these 791 individuals prior to and following propagation using this version of the HPO (Figure 2A). We found that the median number of terms by which each individual’s seizures could be described increased from 2 to 6 as a result of propagation, with a median increase of 3 and Wilcoxon signed rank test two-sided p-value <.001.
FIGURE 2

(A) The increase in the number of seizure concepts by which data collected from 791 individuals can be described as a result of automated inference using the Human Phenotype Ontology (HPO) version release 2017–12-12. Green lines indicate an increase in the number of seizure descriptors annotated by inference after translation. The blue line indicates eight individuals whose only HPO term translated from input data was Seizure (HP:0001250), which is insufficient to infer descriptions. (B) Comparison of the three independent cohorts according to the percentage of individuals annotated with each of 16 HPO seizure terms, comprising the 10 most common seizure concepts in each cohort. Note that in this HPO version, Generalized tonic-clonic seizure (HP:0002069) is indifferent to onset. p-values are two-sided from Fisher’s exact test with Holm-Bonferroni adjustment for 16 comparisons; 95% confidence intervals were calculated from the Poisson distribution

As a consequence of having harmonized data from these three independent cohorts onto the same conceptual model, we were able to compare them by the proportion of individuals annotated with particular seizure descriptions (Figure 2B). The relative abundance of seizure types in each cohort reflects the phenotypic focuses of each study. A greater proportion of individuals in the previously published DEE cohorts than in the local cohort had Generalized myoclonic seizures (HP:0002123) and Generalized tonic seizures (HP:0010818). The Helbig 2019 cohort had a high burden of Status epilepticus (HP:0002133) and a prevalence of Focal seizures (HP:0007359) similar to the mixed cohort, but a low prevalence of Generalized tonic-clonic seizures without focal onset (HP:0025190). Epileptic spasms (HP:0011097), Infantile spasms (HP:0012469), and Atypical absences (HP:0007270) were particularly frequently annotated in the Galer 2020 cohort. Conversely, more individuals in the mixed cohort had Dialeptic seizures (HP:0011146), both Typical absences (HP:0011147) and Focal seizures with impairment of awareness (HP:0002384).

Creation of a new subontology of contemporary seizure concepts for the Human Phenotype Ontology

At the time that the ILAE 2017 was published, the seizure subontology of the HPO lacked the lexicon to distinguish between some seizures types yet included redundant terms and poorly integrated concepts. For example, Generalized tonic-clonic seizures with focal onset (HP:0007334) (a focal to bilateral tonic-clonic seizure) was implied to be a type of generalized-onset rather than focal-onset seizure. Recognizing such existing discrepancies, we generated a new ontology of seizure types and then mapped pre-existing HPO terms across synonymous concepts where possible to allow historical HPO data to be mapped to modern concepts for reuse. The core of the new ontology was based on the ILAE 2017, with subsequent incorporation of other classifications and commonly used concepts (Table 1). We selected preferred names for seizure concepts that describe their features concisely for visualization in ontology browsers such as hpo.jax.org. We generally favored terminology from ILAE classifications where applicable and followed precedence where not. However, we preferred the name “Bilateral tonic-clonic seizure with generalized onset” to “Generalized tonic-clonic seizure” to minimize misinterpretation by people who use the latter irrespective of onset, contrary to the ILAE 2017. HPO terms can be searched for by preferred name, identifier, or synonym. We provided ILAE and historical classification terminology as synonyms where necessary.
TABLE 1

Classifications and definitions consulted to generate the HPO 2020–12-07 seizure subontology

DomainReference
Seizure types and semiological descriptionsOperational classification of seizure types by the ILAE: Position Paper of the ILAE Commission for Classification and Terminology. Fisher et al. 2017[6]Instruction manual for the ILAE 2017 operational classification of seizure types. Fisher et al. 2017[5]EpilepsyDiagnosis.org[42]Glossary of descriptive terminology for ictal semiology: Report of the ILAE Task Force on Classification and Terminology. Blume et al. 2001[9]Semiological seizure classification. Lüders et al. 1998[12]Classification of paroxysmal events and the four-dimensional epilepsy classification system. Lüders et al. 2019[8]
Status epilepticusA definition and classification of status epilepticus: Report of the ILAE Task Force on Classification of Status Epilepticus. Trinka et al. 2015[38]
Neonatal seizure typesThe ILAE classification of seizures and the epilepsies: Modification for seizures in the neonate. Position Paper by the ILAE Task Force on Neonatal Seizures. Pressler et al. 2021[43]
Febrile and mild gastroenteritis-associated seizuresPredictors of epilepsy in children who have experienced febrile seizures. Nelson and Ellenberg 1976[44]Febrile seizures: Clinical practice guideline for the long-term management of the child with simple febrile seizures. American Academy of Pediatrics 2008[45]Generalized epilepsy with febrile seizures plus. A genetic disorder with heterogeneous clinical phenotypes. Scheffer and Berkovic 1997[46]Clinical features of benign convulsions with mild gastroenteritis. Uemura et al. 2002[47]Benign infantile seizures with mild gastroenteritis: Study of 22 patients. Caraballo et al. 2009[48]Clinical outcome of recurrent afebrile seizures in children with benign convulsions associated with mild gastroenteritis. Chen et al. 2018[49]
Reflex seizuresA proposed diagnostic scheme for people with epileptic seizures and with epilepsy: Report of the ILAE Task Force on Classification and Terminology. Engel Jr 2001[50]

Although it is clinically important for classifications to include explicit terms to denote diagnostic uncertainty, such as unknown onset tonic-clonic seizure, this explicit uncertainty does not pertain to the seizure itself (the tonic-clonic seizure must either be generalized or focal in onset), but rather to the clinician’s ability to classify the seizure based on the available evidence: “the term ‘unknown onset’ is a placeholder – not a characteristic of the seizure, but of ignorance.” [6] HPO terms represent concepts of phenotypes themselves, rather than the limitations of classification in a particular instance. Accordingly, within the HPO a seizure that is classified as unknown onset tonic-clonic seizure should be mapped to an HPO term that implies that this seizure is either a Bilateral tonic-clonic seizure with generalized onset (HP:0025190) or Bilateral tonic-clonic seizure with focal onset (HP:0007334)—generalized tonic-clonic seizure or focal to bilateral tonic-clonic seizure in the ILAE 2017—rather than a third, intrinsically different seizure type, such as an absence seizure. Hence, the seizure subontology includes additional terms such as Bilateral tonic-clonic seizure (HP:0002069) without specifying onset. This term applies to the ILAE 2017 concepts of unknown onset tonic-clonic seizure, generalized tonic-clonic seizure, and focal to bilateral tonic-clonic seizure. The onset-specific forms have their respective onset-specific is_a relationships to Generalized-onset motor seizure (HP:0032677) and Focal-onset seizure (HP:0007359), and in addition have a second is_a relationship to this onset-agnostic term. Consequently, individuals with unknown onset tonic-clonic seizures are annotated with the onset-agnostic term in the absence of the onset-specific terms. This arrangement of concepts is analogous to the ILAE 2017 classification of focal-onset seizures, which allows a particular instance of a focal-onset seizure to be classified by either awareness or motor versus non-motor onset independently without explicit reference to the other when necessary, rather than requiring all focal seizures to be classified according to both of these dimensions. In addition, it allows optimal integration of the proposed classification of convulsive status epilepticus.[38] So-called “dual parentage” can also help to mitigate misclassification in diagnostic decision support tools using HPO-based algorithms. In some circumstances, distinguishing between atypical absence seizures and focal impaired awareness seizures can be difficult, and nonspecialists often mislabel focal impaired awareness seizures as “absences” or even “petit-mal.” The pre-existing HPO term Dialeptic seizure (HP:0011146) recognizes the semiological similarity between Absence seizure (HP:0002121) and Focal impaired awareness seizure (HP:0002384).[12] Without a parent term in the HPO spanning electrographic onsets, these two concepts would appear no more similar to each other than Absence seizure (HP:0002121) does to Bilateral tonic-clonic seizure with focal onset (HP:0007334): their most specific common ancestor would be Seizure (HP:0001250). Thus, were an individual with a focal impaired awareness seizure to be annotated incorrectly with Absence seizure (HP:0002121), an algorithm comparing this to a reference of Focal impaired awareness seizure (HP:0002384) can use “fuzzy matching” to recognize that a correct (yet imprecise) match would have been made, if the individual was coded with a parent of their actual annotation: Dialeptic seizure (HP:0011146). We extended this strategy of creating dual parentage independent of electrographic onset beyond its clinical scope for mitigating uncertainty and misclassification to handle those semiologies that can occur with either focal or generalized onset. Researchers may be interested in identifying individuals whose seizures share semiological features independent of electrographic onset in order to investigate shared associations and mechanisms. Dual parentage allows the ontology to recognize that a focal tonic seizure is similar to other types of focal motor seizure but also that it is more similar to a generalized tonic seizure than it is to a generalized myoclonic seizure by introducing a second parent term, Tonic seizure (HP:0032792), which is indifferent to onset. Users not wishing to use these or other concepts, need not do so simply by overlooking them in their data dictionaries and sets of propagated terms. A major component of the phenotypic bottleneck in clinical genetic diagnostics and research is data entry. We provide very specific seizure concepts and relationships that allow the ontology to infer as much information as possible about the seizures from the provision of the minimum number of input terms. For example, on provision of the single highly specific term Focal aware cognitive seizure with deja vu/jamais vu (HP:0032883) a reasoning algorithm can use the ontology to infer the applicability of 12 other ontological seizure descriptors capturing its focal onset, cognitive ictal manifestation, and the retention of awareness throughout in different combinations and at different degrees of granularity. Pre-existing HPO terms were retired only if they could not be mapped among current seizure concepts or were redundant (Table S1). Otherwise, terms were integrated into the new ontology, with modification of their definition or relationships if required, or as a synonym for a new term.

The 2020 HPO seizure subontology captures more seizure information than the 2017 version

After completing our integration of the new seizure subontology into the HPO release 2020–12-07 (HPO 2020), we compared this to the version released 2017–12-12 (HPO 2017) prior to commencement of our revision according to the number of terms by which seizures could be described and the relationships between these concepts (Table 2). The HPO 2020 contained 5-fold as many terms and 7-fold as many relationships.
TABLE 2

Description of the HPO 2017 and the HPO 2020 seizure subontologies and the frequency of the 32 most frequent seizure concepts in the combined cohort of 791 individuals according to each. Terms with frequency “NA” were not conceptualized

HPO subontology
Metadata of the seizure subontologies 2017 release 2020 release
Number of seizure terms71347
Number of is_a relationships between two seizures terms73518
Number of is_a relationships between a seizure and non-seizure term34

Frequency of different seizure concepts captured from the cohort of 791 individuals

Term identifier Seizure descriptor from 2020 release Frequency in 2017 release Frequency in 2020 release
HP:0001250Seizure11
HP:0020219Motor seizureNA0.8078
HP:0002197Generalized-onset seizure0.79520.7320
HP:0032677Generalized-onset motor seizureNA0.4994
HP:0011146Dialeptic seizure0.44500.4450
HP:0011097Epileptic spasm0.40710.4071
HP:0002069Bilateral tonic-clonic seizure0.36920.3704
HP:0012469Infantile spasms0.34770.3477
HP:0033259Non-motor seizureNA0.3350
HP:0007359Focal-onset seizure0.31230.3211
HP:0032794Myoclonic seizureNA0.3148
HP:0002123Generalized myoclonic seizure0.32490.3135
HP:0002121Generalized non-motor (absence) seizure0.30470.3047
HP:0032792Tonic seizureNA0.2617
HP:0010818Generalized tonic seizure0.24400.2440
HP:0010819Atonic seizure0.23260.2326
HP:0002384Focal impaired awareness seizure0.18960.1896
HP:0025190Bilateral tonic-clonic seizure with generalized onset0.18710.1871
HP:0007270Atypical absence seizure0.18080.1795
HP:0002133Status epilepticus0.15930.1593
HP:0011153Focal motor seizure0.13150.1315
HP:0032892Infection-related seizureNA0.1302
HP:0032894Seizure precipitated by febrile infectionNA0.1302
HP:0002373Febrile seizure (within age range 3 months – 6 years)0.12260.1226
HP:0020221Clonic seizureNA0.0872
HP:0007334Bilateral tonic-clonic seizure with focal onset0.06700.0670
HP:0011147Typical absence seizure0.06700.0670
HP:0002266Focal clonic seizure0.06320.0632
HP:0011170Generalized myoclonic-atonic seizure0.04930.0493
HP:0032679Focal non-motor seizureNA0.0442
HP:0011175Focal motor seizure with version0.03410.0341
HP:0002349Focal aware seizure0.01520.0303
Applying this, we harmonized categorical data from our local cohort and HPO 2017 data from the previously published DEE cohorts into the HPO 2020 subontology and then compared the annotated seizure data from the combined cohort of 791 individuals by HPO version (Table 2). Several new terms depict novel concepts and thus were not captured by the HPO 2017. Focal-onset seizures (HP:0007359) appear more common and Generalized-onset seizures (HP:0002197) appear less so in the HPO 2020. This is a result of revised relationships to make Bilateral tonic-clonic seizure with focal onset (HP:0007334) a form of Focal-onset seizure (HP:0007359) rather than Generalized-onset seizure (HP:0002197). The increase in frequency of Bilateral tonic-clonic seizure (HP:0002069), which is indifferent to onset, results from the addition of a specific term for Convulsive status epilepticus (HP:0032660) as a prolonged form of the former. Phenotypic depth can be conceptualized as the amount of information provided about an individual’s or cohort’s phenotypes. We compared the depth of phenotypic information about our combined cohort encoded using each seizure subontology (Table 3). The number of seizure terms annotated to an individual after translation is a measure of the concepts within the raw data that can be captured by the ontology without inference. The number of terms annotated after propagation measures the total number of potential ways that the seizures in the raw data can be described, or categorized, by the ontology after inference. We found a modest detectable increase in annotations to the cohort after translation: the HPO 2020 increased the number of annotations in 46 individuals and reduced the number of annotations in 13. Compared to the HPO 2017, the HPO 2020 increased the number of propagated annotations in the combined cohort by 40%. The median increase in the number of propagated terms per individual was 50% (Table 3 and Figure 3A).
TABLE 3

Comparison of the amount of phenotypic information captured about the seizures of 791 individuals using the HPO 2017 and the HPO 2020

ComparisonHPO subontology
Difference (% change or median, p-value)
2017 release2020 release
Number of input annotations
 Entire cohort (total)22752308Increase by 1.5%
 Per individual (median)220, p < .001
Number of annotations after propagation (inference)
 Entire cohort (total)50297063Increase by 40%
 Per individual (median)683, p < .001
Total information content (bits)
 Entire cohort (total)828011 403Increase by 38%
 Per individual (median)7.9011.433.15, p < .001

Differences between total cohort measures are given as percentages of the HPO 2017 figures; paired differences between individuals are given as median absolute differences; p-values are two-sided from a Wilcoxon signed-rank test.

FIGURE 3

Comparison of the phenotypic information encoded from the same data from 791 individuals using the two seizure subontologies. (A) Green lines indicate an increase, red lines a decrease, and blue lines no difference in the number of seizure descriptors annotated after inference. (B) The total amount of information encoded about each individual’s seizure types according to each Human Phenotype Ontology (HPO) seizure subontology. Numbers of individuals shown correspond to those falling within 2 bit by 2 bit bins, the dashed gray line indicates equality, and the regression line is shown in purple

However, the number of annotations does not distinguish between common and rare features of seizures. Types of seizure that are encountered more rarely are intuitively of greater value for distinguishing an individual from the rest of the cohort than common types of seizure. In other words, the presence of a rare seizure type is more informative than that of a common seizure type. In information theory this can be quantified as information content, usually measured in bits and defined as the negative logarithm of their frequency in the cohort (IC = −log(f)). This gives annotations of rare seizure terms more bits of information than those that are more common. Because annotations are propagated from conceptually narrow to broader parent terms, information content tends to increase as one goes from broad to more detailed terms and thus is a good measure of specificity. Accordingly, after propagation one can measure the true frequency of each seizure term and calculate the information content of each term annotated in a cohort. The total amount of information captured by the ontology describing an individual’s seizure repertoire can be calculated as the sum of the information content of each of their seizure annotations. We found that the total amount of information about the cohort’s seizures increased by 38% when using HPO 2020 (Table 3 and Figure 3B). The information about each individual’s seizures increased for 686 individuals and decreased for 86, demonstrating a large effect size (|z/√N| = 0.80). Although there was a small reduction in the quantity of information about 86 individuals, this was attributable to the changes described earlier, correcting the relationships of the concept of Bilateral tonic-clonic seizure with focal onset (HP:0007334) and introducing Convulsive status epilepticus (HP:0032660) as a type of Bilateral tonic-clonic seizure (HP:0002069). Hence, despite a reduction in quantity of information in a minority of individuals, the quality of information captured by HPO 2020 is greater.

DISCUSSION

We describe the revision of the seizure domain within the HPO to align it with contemporary classification concepts, and the resulting effect on precise seizure description in 791 individuals. We demonstrate how seizure data can be encoded in HPO terms, automatically classifying each seizure according to multiple conceptually broader descriptors, and how it can harmonize data provided in different formats. We show that compared to a previous iteration, the new subontology increased the amount of information that could be encoded in HPO terms about an individual’s seizure repertoire, thereby facilitating combination with other data sets, or analysis, based on automated inference of seizure concepts. Our newly developed seizure subontology is fully implemented in the official HPO release as of 2020–12-07 and freely available at hpo.jax.org. The new subontology encompasses the breadth of epileptic seizure types encountered in clinical practice, taking evidence from formal classifications, consensus statements, and pertinent peer-reviewed literature that describe particular seizure types that define clinical entities. It is not a clinical classification and serves a different purpose: to facilitate automated inference of clinical seizure data for harmonized analysis or comparison. Although we did not amend classification concepts themselves, we used ontological approaches to retain as much information as possible about seizures that can be classified only incompletely. This allows semiological descriptions of seizures in medical records that do not specify electroclinical onset (for example, “she continues to have tonic seizures”) to be coded without assuming the electrographic onset or losing all description of the clinical manifestation. Similarly, the subontology includes semiological terms that can code seizures without 80% certainty of onset.[6] For example, the term Non-motor seizure (HP:0033259) may be relevant for people with epilepsy who are unable to disclose sufficient subjective experience to confirm a specific type of focal non-motor seizure, and in whom insufficient supporting evidence can be obtained to differentiate between a focal and absence seizure, thus including the ILAE 2017 concept of an unknown onset behavior arrest seizure. The ability to code seizures informatively according to only a limited patient or witness history optimizes harmonization of data emanating from health care settings with limited access to resources such as video-electroencephalography. The HPO continues to include older concepts such as Epileptic aura (HP:0033348) and Dialeptic seizure (HP:0011146) that some epileptologists find useful, and we have integrated these with ILAE 2017 concepts where possible.[5,8,9,12,32] This maximizes the precision of information from historical data that can be harmonized with current ILAE concepts using the HPO. A priority when developing the new subontology was to try to make the encoding of seizure types as efficient as possible to reduce the burden of manual data entry and dictionary design. Single input terms allow harmonization to seizure types and descriptors at different levels of detail and also unambiguously bind together features such as level of awareness and initial manifestation for a particular instance of a seizure. An exception to this is the concept of a focal to bilateral tonic-clonic seizure, which the ILAE 2017 considers a propagation pattern rather than unitary seizure type.[5] Similarly, we have represented this concept as a single independent term, Bilateral tonic-clonic seizure with focal onset (HP:0007334), which avoids making the number of terms in the subontology unwieldy. Consistent with the ILAE 2017, the structure of the new HPO subontology was based on electrographic onset and prioritization of initial manifestations. An ontology-based clinical diagnostic support tool using ILAE 2017 terms to suggest syndromic or genetic diagnoses has been developed.[39] Although the list of seizure concepts in this ontology was published, the relationships between specific concepts are not openly available for comparison with our subontology. The semiological evolution of a focal seizure adds granularity that is important for mapping its spread across eloquent cortex and can be recorded using descriptors from the ILAE 2017,[5] ILAE glossary,[9] or a sequence of components within the semiology classification of the Four-dimensional Epilepsy Classification.[12] Beyond loss of awareness and bilateral propagation, different semiological evolutions of the same unitary focal seizure type are indistinguishable in the HPO. This may be insufficient for detailed surgical and network analyses. However, the HPO is predominantly a tool of genomics, and phenotypic analysis in epilepsy genetics has focused on seizure types rather than differences in the evolution of seizures of the same type. The Epilepsy Syndrome Seizure Ontology (version 1.03, bioportal.bioontology.org/ontologies/ESSO) and Epilepsy and Seizure Ontology (version 1.0, bioportal.bioontology.org/ontologies/EPSO) predate the ILAE 2017, and their multidimensional representation of seizures (aligned to the Four-dimensional Epilepsy Classification) may be better suited to such tasks.[8,40] Another ontological approach to semiology prior to ILAE 2017 has shown that analysis of combinations of semiological features can be used for classification of temporal lobe and extra-temporal lobe epilepies.[41] Advances in understanding and classification of focal seizures may facilitate prioritization of important sequences of semiological components for inclusion in the HPO. Currently, as in this study, HPO terms are usually annotated manually on review of the medical records or through bulk translation of structured categorical data from research or clinical genetic clinical data forms. However, the burden of HPO coding will decrease as the availability of electronic medical record systems with integrated HPO annotation modules increases, and with advances in natural language processing. Although this coordinated revision of the HPO seizure subontology has improved the integration of its terms and its consistency with current ILAE classifications, scope remains for refinement, including integration of concepts from other classifications to maximize its value for harmonization of non-ILAE 2017 data, such as that from historical cohorts. Continuing expert engagement (we encourage suggestions via https://github.com/obophenotype/human-phenotype-ontology/issues) will lead to incremental improvements in HPO domains pertinent to epilepsy that will benefit clinical diagnostic algorithms and research for people with epilepsy over years to come.
  43 in total

Review 1.  Could the 2017 ILAE and the four-dimensional epilepsy classifications be merged to a new "Integrated Epilepsy Classification"?

Authors:  Felix Rosenow; Naoki Akamatsu; Thomas Bast; Sebastian Bauer; Christoph Baumgartner; Selim Benbadis; Adriana Bermeo-Ovalle; Stefan Beyenburg; Andrew Bleasel; Alireza Bozorgi; Milan Brázdil; Mar Carreño; Norman Delanty; Michael Devereaux; John Duncan; Guadalupe Fernandez-Baca Vaca; Stefano Francione; Naiara García Losarcos; Lauren Ghanma; Antonio Gil-Nagel; Hajo Hamer; Hans Holthausen; Shirin Jamal Omidi; Philippe Kahane; Giri Kalamangalam; Andrés Kanner; Susanne Knake; Stjepana Kovac; Karsten Krakow; Günter Krämer; Gerhard Kurlemann; Nuria Lacuey; Patrick Landazuri; Shi Hui Lim; Luisa V Londoño; Giorgio LoRusso; Hans Luders; Jayanti Mani; Riki Matsumoto; Jonathan Miller; Soheyl Noachtar; Rebecca O'Dwyer; André Palmini; Jun Park; Philipp S Reif; Jan Remi; Americo C Sakamoto; Bettina Schmitz; Susanne Schubert-Bast; Stephan Schuele; Asim Shahid; Bernhard Steinhoff; Adam Strzelczyk; C Akos Szabo; Nitin Tandon; Kiyohito Terada; Manuel Toledo; Walter van Emde Boas; Matthew Walker; Peter Widdess-Walsh
Journal:  Seizure       Date:  2020-03-05       Impact factor: 3.184

2.  Operational classification of seizure types by the International League Against Epilepsy: Position Paper of the ILAE Commission for Classification and Terminology.

Authors:  Robert S Fisher; J Helen Cross; Jacqueline A French; Norimichi Higurashi; Edouard Hirsch; Floor E Jansen; Lieven Lagae; Solomon L Moshé; Jukka Peltola; Eliane Roulet Perez; Ingrid E Scheffer; Sameer M Zuberi
Journal:  Epilepsia       Date:  2017-03-08       Impact factor: 5.864

3.  Ultra-Rare Genetic Variation in the Epilepsies: A Whole-Exome Sequencing Study of 17,606 Individuals.

Authors: 
Journal:  Am J Hum Genet       Date:  2019-07-18       Impact factor: 11.025

4.  A proposed diagnostic scheme for people with epileptic seizures and with epilepsy: report of the ILAE Task Force on Classification and Terminology.

Authors:  J Engel
Journal:  Epilepsia       Date:  2001-06       Impact factor: 5.864

5.  Clinical features of benign convulsions with mild gastroenteritis.

Authors:  Naoko Uemura; Akihisa Okumura; Tamiko Negoro; Kazuyoshi Watanabe
Journal:  Brain Dev       Date:  2002-12       Impact factor: 1.961

6.  Generalized epilepsy with febrile seizures plus. A genetic disorder with heterogeneous clinical phenotypes.

Authors:  I E Scheffer; S F Berkovic
Journal:  Brain       Date:  1997-03       Impact factor: 13.501

7.  Benign infantile seizures with mild gastroenteritis: study of 22 patients.

Authors:  Roberto H Caraballo; Laura Gañez; Cecilia De los Santos; Alberto Espeche; Ricardo Cersósimo; Natalio Fejerman
Journal:  Seizure       Date:  2009-10-13       Impact factor: 3.184

8.  PhenoTips: patient phenotyping software for clinical and research use.

Authors:  Marta Girdea; Sergiu Dumitriu; Marc Fiume; Sarah Bowdin; Kym M Boycott; Sébastien Chénier; David Chitayat; Hanna Faghfoury; M Stephen Meyn; Peter N Ray; Joyce So; Dimitri J Stavropoulos; Michael Brudno
Journal:  Hum Mutat       Date:  2013-05-24       Impact factor: 4.878

9.  Clinical diagnostics in human genetics with semantic similarity searches in ontologies.

Authors:  Sebastian Köhler; Marcel H Schulz; Peter Krawitz; Sebastian Bauer; Sandra Dölken; Claus E Ott; Christine Mundlos; Denise Horn; Stefan Mundlos; Peter N Robinson
Journal:  Am J Hum Genet       Date:  2009-10       Impact factor: 11.025

10.  Whole-genome sequencing of patients with rare diseases in a national health system.

Authors:  Ernest Turro; William J Astle; Karyn Megy; Stefan Gräf; Daniel Greene; Olga Shamardina; Hana Lango Allen; Alba Sanchis-Juan; Mattia Frontini; Chantal Thys; Jonathan Stephens; Rutendo Mapeta; Oliver S Burren; Kate Downes; Matthias Haimel; Salih Tuna; Sri V V Deevi; Timothy J Aitman; David L Bennett; Paul Calleja; Keren Carss; Mark J Caulfield; Patrick F Chinnery; Peter H Dixon; Daniel P Gale; Roger James; Ania Koziell; Michael A Laffan; Adam P Levine; Eamonn R Maher; Hugh S Markus; Joannella Morales; Nicholas W Morrell; Andrew D Mumford; Elizabeth Ormondroyd; Stuart Rankin; Augusto Rendon; Sylvia Richardson; Irene Roberts; Noemi B A Roy; Moin A Saleem; Kenneth G C Smith; Hannah Stark; Rhea Y Y Tan; Andreas C Themistocleous; Adrian J Thrasher; Hugh Watkins; Andrew R Webster; Martin R Wilkins; Catherine Williamson; James Whitworth; Sean Humphray; David R Bentley; Nathalie Kingston; Neil Walker; John R Bradley; Sofie Ashford; Christopher J Penkett; Kathleen Freson; Kathleen E Stirrups; F Lucy Raymond; Willem H Ouwehand
Journal:  Nature       Date:  2020-06-24       Impact factor: 49.962

View more
  4 in total

Review 1.  Precision medicine for genetic epilepsy on the horizon: Recent advances, present challenges, and suggestions for continued progress.

Authors:  Juliet K Knowles; Ingo Helbig; Cameron S Metcalf; Laura S Lubbers; Lori L Isom; Scott Demarest; Ethan M Goldberg; Alfred L George; Holger Lerche; Sarah Weckhuysen; Vicky Whittemore; Samuel F Berkovic; Daniel H Lowenstein
Journal:  Epilepsia       Date:  2022-07-17       Impact factor: 6.740

Review 2.  Computational analysis of neurodevelopmental phenotypes: Harmonization empowers clinical discovery.

Authors:  David Lewis-Smith; Shridhar Parthasarathy; Julie Xian; Michael C Kaufman; Shiva Ganesan; Peter D Galer; Rhys H Thomas; Ingo Helbig
Journal:  Hum Mutat       Date:  2022-05-22       Impact factor: 4.700

Review 3.  Clinical and therapeutic significance of genetic variation in the GRIN gene family encoding NMDARs.

Authors:  Tim A Benke; Kristen Park; Ilona Krey; Chad R Camp; Rui Song; Amy J Ramsey; Hongjie Yuan; Stephen F Traynelis; Johannes Lemke
Journal:  Neuropharmacology       Date:  2021-09-22       Impact factor: 5.250

4.  The clinical and biochemical hallmarks generally associated with GLUT1DS may be caused by defects in genes other than SLC2A1.

Authors:  Obdulia Sánchez-Lijarcio; Delia Yubero; Fátima Leal; María L Couce; Luis González Gutiérrez-Solana; Eduardo López-Laso; Àngels García-Cazorla; Leticia Pías-Peleteiro; Begoña de Azua Brea; Salvador Ibáñez-Micó; Gonzalo Mateo-Martínez; Monica Troncoso-Schifferli; Scarlet Witting-Enriquez; Magdalena Ugarte; Rafael Artuch; Belén Pérez
Journal:  Clin Genet       Date:  2022-04-15       Impact factor: 4.296

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.