| Literature DB >> 34031551 |
David Lewis-Smith1,2,3,4, Shiva Ganesan5,6,7, Peter D Galer5,6,7, Katherine L Helbig5,6,7, Sarah E McKeown5,7, Margaret O'Brien6,7, Pouya Khankhanian8, Michael C Kaufman5,6,7, Alexander K Gonzalez5,6, Alex S Felmeister6, Roland Krause9, Colin A Ellis5,6,8, Ingo Helbig10,11,12,13.
Abstract
While genetic studies of epilepsies can be performed in thousands of individuals, phenotyping remains a manual, non-scalable task. A particular challenge is capturing the evolution of complex phenotypes with age. Here, we present a novel approach, applying phenotypic similarity analysis to a total of 3251 patient-years of longitudinal electronic medical record data from a previously reported cohort of 658 individuals with genetic epilepsies. After mapping clinical data to the Human Phenotype Ontology, we determined the phenotypic similarity of individuals sharing each genetic etiology within each 3-month age interval from birth up to a maximum age of 25 years. 140 of 600 (23%) of all 27 genes and 3-month age intervals with sufficient data for calculation of phenotypic similarity were significantly higher than expect by chance. 11 of 27 genetic etiologies had significant overall phenotypic similarity trajectories. These do not simply reflect strong statistical associations with single phenotypic features but appear to emerge from complex clinical constellations of features that may not be strongly associated individually. As an attempt to reconstruct the cognitive framework of syndrome recognition in clinical practice, longitudinal phenotypic similarity analysis extends the traditional phenotyping approach by utilizing data from electronic medical records at a scale that is far beyond the capabilities of manual phenotyping. Delineation of how the phenotypic homogeneity of genetic epilepsies varies with age could improve the phenotypic classification of these disorders, the accuracy of prognostic counseling, and by providing historical control data, the design and interpretation of precision clinical trials in rare diseases.Entities:
Mesh:
Year: 2021 PMID: 34031551 PMCID: PMC8560769 DOI: 10.1038/s41431-021-00908-8
Source DB: PubMed Journal: Eur J Hum Genet ISSN: 1018-4813 Impact factor: 4.246
Fig. 1Genetic etiologies of epilepsy demonstrate distinct time-dependent distributions of encounters in the electronic medical records.
For each of the 36 genetic etiologies identified in two or more individuals in this cohort, the number of individuals contributing data to each 3-month interval between birth and 25 years is shown. The total number of individuals with a particular genetic etiology is given in brackets after the gene symbol.
The evidence for distinctive trajectories of phenotypic homogeneity and EMR usage.
| Genetic etiology | Number of individuals with the etiology | Cumulative PhenSim score from birth to age 25 years | Raw | Holm’s adjusted |
|---|---|---|---|---|
| 8 | 62.1 | 1.00E−06 | 3.60E−05 | |
| 29 | 43.3 | 1.00E−06 | 3.60E−05 | |
| 12 | 33.4 | 1.00E−06 | 3.60E−05 | |
| 22 | 38.8 | 1.00E−06 | 3.60E−05 | |
| 6 | 26.3 | 1.05E−04 | 0.00336 | |
| 5 | 36.4 | 1.22E−04 | 0.00378 | |
| 4 | 41.1 | 1.94E−04 | 0.00582 | |
| 6 | 22.2 | 2.30E−04 | 0.00667 | |
| 3 | 43.5 | 3.76E−04 | 0.0105 | |
| 5 | 26.4 | 4.63E−04 | 0.0125 | |
| 4 | 32.1 | 6.70E−04 | 0.0174 | |
| 6 | 9.89 | 0.00223 | 0.0556 | |
| 3 | 15.4 | 0.00883 | 0.212 | |
| 4 | 9.25 | 0.0273 | 0.627 | |
| 4 | 6.64 | 0.0511 | 1.00 | |
| 2 | 3.39 | 0.216 | 1.00 | |
| 2 | 3.04 | 0.234 | 1.00 | |
| 2 | 2.22 | 0.284 | 1.00 | |
| 2 | 2.17 | 0.288 | 1.00 | |
| 2 | 1.96 | 0.302 | 1.00 | |
| 2 | 1.82 | 0.313 | 1.00 | |
| 2 | 1.60 | 0.331 | 1.00 | |
| 4 | 0.858 | 0.360 | 1.00 | |
| 2 | 0.749 | 0.413 | 1.00 | |
| 2 | 0.577 | 0.431 | 1.00 | |
| 2 | 0.325 | 0.459 | 1.00 | |
| 2 | 0.00 | 1.00 | 1.00 |
The empirical p value derived from 1,000,000 permutations indicating the probability of observing a cumulative PhenSim score at least as great as that of each etiology due to chance is provided before and after Holm’s adjustment for 36 genetic etiologies.
Fig. 2Genetic epilepsies show phenotypic similarities that vary over time.
PhenSim scores are shown for the 27 etiologies for which PhenSim could be calculated in this cohort. The height of each ridge indicates PhenSim score at for the corresponding etiology at that age. PhenSim scores significant after Holm’s adjustment for 3,600 hypotheses are shown in color, and nonsignificant PhenSim scores in gray.
Fig. 3The relationship between PhenSim and the number of individuals with EMR usage at the corresponding age.
The effect of the number of (A) gene-positive and (B) total individuals with EMR usage on PhenSim scores at the corresponding age. The number of etiologies with PhenSim scores of zero at each number of gene-positive individuals where this was observed is shown above the x-axis in (A). The median PhenSim score for each value on the x-axes is shown by the black lines.
Fig. 4The relationship between strong associations with single phenotypic terms and phenotypic similarity according to age for SCN1A.
Associations are shown only for terms reaching a strong association at any given age. Associations are plotted as solid lines at ages with a p-value < 0.01 and as a dotted line at ages where p-value > 0.01. PhenSim scores are shown as solid lines where significant after Holm’s adjustment and dashed lines where not.