| Literature DB >> 29873705 |
Charles Tapley Hoyt1,2, Daniel Domingo-Fernández1,2, Nora Balzer2, Anka Güldenpfennig1, Martin Hofmann-Apitius1,2.
Abstract
Cross-sectional epidemiological studies have shown that the incidence of several nervous system diseases is more frequent in epilepsy patients than in the general population. Some comorbidities [e.g. Alzheimer's disease (AD) and Parkinson's disease] are also risk factors for the development of seizures; suggesting they may share pathophysiological mechanisms with epilepsy. A literature-based approach was used to identify gene overlap between epilepsy and its comorbidities as a proxy for a shared genetic basis for disease, or genetic pleiotropy, as a first effort to identify shared mechanisms. While the results identified neurological disorders as the group of diseases with the highest gene overlap, this analysis was insufficient for identifying putative common mechanisms shared across epilepsy and its comorbidities. This motivated the use of a dedicated literature mining and knowledge assembly approach in which a cause-and-effect model of epilepsy was captured with Biological Expression Language. After enriching the knowledge assembly with information surrounding epilepsy, its risk factors, its comorbidities, and anti-epileptic drugs, a novel comparative mechanism enrichment approach was used to propose several downstream effectors (including the GABA receptor, GABAergic pathways, etc.) that could explain the therapeutic effects carbamazepine in both the contexts of epilepsy and AD. We have made the Epilepsy Knowledge Assembly available at https://www.scai.fraunhofer.de/content/dam/scai/de/downloads/bioinformatik/epilepsy.bel and queryable through NeuroMMSig at http://neurommsig.scai.fraunhofer.de. The source code used for analysis and tutorials for reproduction are available on GitHub at https://github.com/cthoyt/epicom.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29873705 PMCID: PMC6007221 DOI: 10.1093/database/bay050
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1.A graphical abstract of the methodology, analysis, and results presented in this work. The two upper boxes represent the methodology while the two lower boxes represent the analysis and results. The upper-left box outlines the quantification of gene overlap between epilepsy and its well-known comorbidities described in Keezer et al. (23; e.g. AD, PD, etc.) using literature based methods. The upper-right box outlines the assembly of knowledge from epilepsy literature with chemical and pathway enrichment as described in the ‘Preparation for Mechanism Enrichment’ and ‘Relation Extraction’ Sections. The lower-right box represents the comparative mechanism enrichment that was used to generate comorbidity insights (lower-left box) after literature-based methods proved insufficient.
Results of the Epilepsy comorbidity analysis using SCAIView
| Disease (MeSH ID) | Associated documents | Disease associated genes | Comorbidity associated genes | Epilepsy pleiotropy rate (%) |
|---|---|---|---|---|
| Epilepsy (D004827) | 192 245 | 2901 | –– | –– |
| Stroke (D020521) | 210 846 | 4533 | 633 | 17.78 |
| AD (D000544) | 109 495 | 4968 | 396 | 13.65 |
| Migraine (D008881) | 30 928 | 1230 | 306 | 10.54 |
| PD (D010300) | 79 103 | 3646 | 258 | 8.89 |
| Hypertension (D006973) | 391 190 | 5574 | 252 | 8.68 |
| Dementia (D003704) | 183 802 | 5833 | 220 | 7.58 |
| Diabetes mellitus (D003920) | 394 411 | 6661 | 184 | 6.34 |
| Intestinal diseases (D007410) | 629 691 | 9093 | 166 | 5.72 |
| Thyroid diseases (D013959) | 153 025 | 4366 | 133 | 4.58 |
| Anxiety (D001007) | 84 138 | 1782 | 124 | 4.27 |
| Arthritis (D001168) | 259 327 | 5367 | 122 | 4.2 |
| Cataract (D002386) | 52 150 | 2238 | 119 | 4.1 |
| Asthma (D001249) | 147 697 | 3761 | 86 | 2.96 |
| Glaucoma (D005901) | 56 679 | 2303 | 48 | 1.65 |
| Depressive disorder, major (D003865) | 15 706 | 1249 | 46 | 1.58 |
| Urinary incontinence (D014549) | 34 170 | 720 | 24 | 0.82 |
| Peptic ulcer (D010437) | 68 234 | 1445 | 21 | 0.72 |
| Back pain (D001416) | 48 516 | 1191 | 17 | 0.58 |
| Pulmonary disease, chronic obstructive (D029424) | 35 627 | 2244 | 15 | 0.51 |
| Fibromyalgia (D005356) | 9021 | 468 | 10 | 0.34 |
| Emphysema (D004646) | 25 511 | 1261 | 9 | 0.31 |
| Bronchitis, chronic (D029481) | 9085 | 580 | 2 | 0.06 |
Notes: The number of (column 2) retrieved from SCAIView for each disease is shown given a reference query using corresponding the MeSH term from column 1. The (column 3) contain the number of genes relevant to the corpus retrieved from a disease-specific query. The (column 4) contain the number of genes relevant to the comorbidity query between the target disease and epilepsy. Lastly, the (column 5) describes the ratio of the count of genes reported in column 4 with the total number of epilepsy-associated genes (2901).
Statistics over the Epilepsy Knowledge Assembly generated by PyBEL, grouped by sub-graph
| Sub-graph name | Biological entities | Relationships | Connected Components | Citations |
|---|---|---|---|---|
| Adaptive immune system sub-graph | 12 | 12 | 4 | 5 |
| Adenosine signaling sub-graph | 76 | 154 | 3 | 15 |
| Apoptosis signaling sub-graph | 228 | 503 | 5 | 115 |
| Brain-derived neurotrophic factor signaling sub-graph | 75 | 142 | 1 | 29 |
| Calcium dependent sub-graph | 302 | 793 | 8 | 73 |
| Chromatin organization sub-graph | 8 | 10 | 2 | 2 |
| Energy metabolic sub-graph | 91 | 177 | 4 | 24 |
| Estradiol metabolism | 7 | 8 | 2 | 1 |
| G-protein-mediated signaling | 78 | 140 | 5 | 25 |
| Gaba sub-graph | 262 | 632 | 2 | 56 |
| Glutamatergic sub-graph | 121 | 246 | 5 | 32 |
| Hormone signaling sub-graph | 126 | 256 | 9 | 16 |
| Inflammatory response sub-graph | 42 | 49 | 4 | 21 |
| Innate immune system sub-graph | 41 | 63 | 8 | 18 |
| Interleukin signaling sub-graph | 46 | 111 | 3 | 17 |
| Long term synaptic depression | 64 | 129 | 2 | 21 |
| Long term synaptic potentiation | 125 | 252 | 2 | 46 |
| Mapk-erk sub-graph | 313 | 706 | 5 | 68 |
| Metabolism | 340 | 600 | 14 | 126 |
| Mirna sub-graph | 5 | 4 | 1 | 3 |
| Mossy fiber sub-graph | 38 | 66 | 2 | 14 |
| Mtor signaling sub-graph | 166 | 336 | 3 | 42 |
| Neurotransmitter release sub-graph | 552 | 1667 | 5 | 131 |
| Notch signaling sub-graph | 105 | 205 | 3 | 20 |
| Protein kinase signaling sub-graph | 377 | 850 | 6 | 87 |
| Protein metabolism | 129 | 185 | 8 | 44 |
| Reelin signaling sub-graph | 117 | 253 | 2 | 21 |
| Regulation of actin cytoskeleton sub-graph | 5 | 3 | 2 | 2 |
| Serotonergic sub-graph | 148 | 478 | 2 | 18 |
| Thyroid hormone signaling sub-graph | 106 | 228 | 2 | 9 |
| Transport related sub-graph | 49 | 60 | 7 | 25 |
| Wnt signaling sub-graph | 27 | 38 | 3 | 15 |
Notes: The first column, corresponds to the number of genes, chemicals, proteins, biological process, etc. in each sub-graph. The second column, (i.e. edges), corresponds to the number of connections between each sub-graphs’ biological entities. The third column, corresponds to the number of ‘connected’ groups of nodes within each sub-graph. The final column, corresponds to the total number of articles from which information was extracted to build each sub-graph. A more detailed summary is included in the Supplementary Material.
Figure 2.A schematic representation of the knowledge surrounding carbamazepine retrieved by querying its targets with NeuroMMSig. The relevant portions of the most significantly enriched graphs in the context of epilepsy and AD, the adenosine signaling and the GABA sub-graphs, were merged and displayed in order to highlight a potential explanatory mechanism for the therapeutic effects of carbamazepine (Supplementary Text S2). It is rendered with a hierarchical layout to mirror the flow from molecular entities to proteins, biological processes and pathologies.