| Literature DB >> 27392431 |
Anandhi Iyappan1,2, Shweta Bagewadi Kawalia3,4, Tamara Raschka1,5, Martin Hofmann-Apitius1,2, Philipp Senger1.
Abstract
BACKGROUND: Neurodegenerative diseases are incurable and debilitating indications with huge social and economic impact, where much is still to be learnt about the underlying molecular events. Mechanistic disease models could offer a knowledge framework to help decipher the complex interactions that occur at molecular and cellular levels. This motivates the need for the development of an approach integrating highly curated and heterogeneous data into a disease model of different regulatory data layers. Although several disease models exist, they often do not consider the quality of underlying data. Moreover, even with the current advancements in semantic web technology, we still do not have cure for complex diseases like Alzheimer's disease. One of the key reasons accountable for this could be the increasing gap between generated data and the derived knowledge.Entities:
Keywords: Alzheimer's disease; Data curation; Data harmonization; Data integration; Disease modeling; Neurodegenerative diseases; RDF; Semantic web
Mesh:
Substances:
Year: 2016 PMID: 27392431 PMCID: PMC4939021 DOI: 10.1186/s13326-016-0079-8
Source DB: PubMed Journal: J Biomed Semantics
Fig. 1Overall workflow of NeuroRDF. The workflow illustrates the collection of data from various resources such as databases, and literature, followed by steps taken to pre-process and prune the collected data. These high-quality data are represented semantically as RDF models and stored in a triplestore. The stored knowledge can later be queried for biologically interesting questions
Fig. 2Schematic representation of the Diseased PPIs in RDF. The figure describes AD specific PPI interactions along with supporting evidence mined from literature
Fig. 3Schematic representation of MiRNA-target interactions in RDF. The figure encapsulates miRNA mentions along with their corresponding gene identifier from literature
Fig. 4Schematic representation of Healthy PPIs in RDF. The figure represents PPIs of healthy subjects extracted from literature and PPI specific databases. The schema also contains meta-information about these PPIs
Fig. 5Schematic representation of Gene Expression Data in RDF. This figure represents gene expression data obtained from public resources such as GEO and ArrayExpress
Statistics of generated RDF models stored in Virtuoso endpoint
| Models | No. of triples | No. of entries | No. of properties | Size (mb) |
|---|---|---|---|---|
| Alzheimer’s disease PPI | 8353 | 19900 | 11 | 0.894 |
| Healthy State PPI | 1204194 | 78852 | 11 | 99.102 |
| MTI | 667 | 300 | 5 | 0.095 |
| Microarray | 20454 | 9477 | 16 | 303.5 |
Fig. 6Example SPARQL query for information retrieval from NeuroRDF. SPARQL query as seen in the figure retrieves the miRNAs for a given gene
Prioritized AD candidate genes
| Intersected genes between healthy and AD PPI | MiRNAs | Differentially expressed neighbors | Number of literature articles for intersected genes | |
|---|---|---|---|---|
| Healthy PPI | AD PPI | |||
| APP | MIR101-1, | ADAM10, | TGFB1, | |
| MIR106A, | MAPT, | BACE1, | ||
| MIR106B, | MIF, | LRP1 | ||
| MIR124-1, | BACE1, | 24550 | ||
| MIR137, | LRP1 | |||
| MIR153-1, | ||||
| MIR181-C, | ||||
| MIR29A, | ||||
| MIR520C, | ||||
| MIR19-1 | ||||
| BACE1 | MIR107, | |||
| MIR124-1, | APP, | |||
| MIR145, | APP | LRP1 | 1883 | |
| MIR298, | ||||
| MIR29A, | ||||
| MIR29B1, | ||||
| MIR328, | ||||
| MIR9-1 | ||||
| ADAM10 | MIR451, | |||
| MIR144, | ||||
| MIR1306, | APP | - | 231 | |
| MIR107, | ||||
| MIR103 | ||||
| IL1B | MIR146A, | |||
| MIR155 | PTGS2 | - | 1099 | |
| MAPK3 | MIR15A, | - | STMN2, | 276 |
| MIR155 | JUN | |||
| MAPT | MIR16-1, | APP | TUBA4A | 3367 |
| MIR132 | ||||
| APLP2 | MIR153-1 | - | - | 134 |
| DLG4 | MIR485 | - | LRP1 | 151 |
| IL6 | MIR27B | - | - | 748 |
| JUN | MIR144 | - | STAT4, | 142 |
| MAPK3 | ||||
| LRP1 | MIR205 | APP | DLG4, | 305 |
| APP, | ||||
| BACE1 | ||||
| PTGS2 | MIR146A | IL1B | - | 474 |
| TGFB1 | MIR155 | - | APP | 276 |
This table summarizes the literature based evidences of intersected genes between healthy and AD PPI and their corresponding miRNAs and differentially expressed genes
Fig. 7Extracted sub-networks from AD PPIs network. This figure symbolizes the diseased sub-graphs that were generated using prioritized candidates and their differentially expressed neighbors
Fig. 8Extracted sub-networks from healthy PPIs network. This figure symbolizes the healthy sub-graphs that were generated using prioritized candidates and their differentially expressed neighbors
Fig. 9Statistics of the literature evidence for emerging candidate genes in the speculated sub-networks