| Literature DB >> 31196182 |
Lucy Lu Wang1, G Thomas Hayman2, Jennifer R Smith2, Monika Tutaj2, Mary E Shimoyama2, John H Gennari3.
Abstract
BACKGROUND: To improve the outcomes of biological pathway analysis, a better way of integrating pathway data is needed. Ontologies can be used to organize data from disparate sources, and we leverage the Pathway Ontology as a unifying ontology for organizing pathway data. We aim to associate pathway instances from different databases to the appropriate class in the Pathway Ontology.Entities:
Keywords: Ontology mapping; Ontology-based data integration; Pathway data interoperability; Pathway ontology; Semi-automated ontology curation
Mesh:
Year: 2019 PMID: 31196182 PMCID: PMC6567466 DOI: 10.1186/s13326-019-0202-8
Source DB: PubMed Journal: J Biomed Semantics
Fig. 1Semi-automated curation pipeline
Training data by source
| Source | No. positive | No. negative |
|---|---|---|
| PW mappings to KEGG, NCI-PID, and SMPDB | 860 | 7116 |
| GO/MeSH mappings | 732 | 325 |
| Bootstrapped PW/Reactome mappings | 730 | 720 |
| Total | 2322 | 8161 |
Fig. 2Bootstrapping procedure. The initial training data is derived from existing PW mappings and UMLS mappings between MeSH and GO. A simple logistic regression model is trained on this data and used to bootstrap training samples from Reactome. The best matches between Reactome pathways and PW classes are added to the training data set over 10 iterations to generate a final training data set
Fig. 3Architecture of neural network model. The neural network computes similarity between a pathway definition and a PW class definition. A bidirectional LSTM is used to encode the definition texts. This example shows the definition for Reactome pathway R-HSA-109606 and PW class PW:0000104 being encoded and compared in the neural network
Top ranked predicted mappings for Reactome pathway R-HSA-109581, “Apoptosis”
| PW ID | PW class name | Beginning of definition text | |
|---|---|---|---|
| 1 | PW_0000104 | intrinsic apoptotic pathway | The apoptotic pathway involving organelles, primarily the mitochon... |
| 2 | PW_0000009 | apoptotic cell death pathway | Apoptosis is a programmed cell death pathway that is characterized by... |
| 3 | PW_0000106 | extrinsic apoptotic pathway | The apoptotic pathway involving the death receptors mediated route of... |
| 4 | PW_0000718 | p53 signaling pathway | p53 transcription factor is a tumor suppressor frequently mutated in... |
| 5 | PW_0000124 | cellular detoxification pathway | A pathway triggered by exogenous or endogenous elements, compounds... |
| 6 | PW_0000823 | humoral immunity pathway | Humoral immunity is mediated by antibodies secreted by the B cell... |
| 7 | PW_0000824 | cell-mediated immunity pathway | Cell-mediated immune response pathways are carried out by T cell... |
| 8 | PW_0000499 | nuclear factor kappa B signaling pathway | NF-kB signaling plays an essential role in the mammalian immune... |
| 9 | PW_0000680 | altered extrinsic apoptotic pathway |
|
| 10 | PW_0000233 | tumor necrosis factor mediated signaling pathway | Tumor necrosis factor (Tnf) signaling plays pivotal roles in immunity... |
Inter-rater agreement for mapping labeling task
| Rater #1 | ||||
|---|---|---|---|---|
| Rater #2 | y(es) | r(elated) | n(o) | Totals |
| y(es) | 24 | 8 | 0 | 32 |
| r(elated) | 0 | 69 | 4 | 73 |
| n(o) | 0 | 46 | 60 | 106 |
| Totals | 24 | 123 | 64 | 211 |
Comparison of BOW and NN model predictions
| Model | Precision ( | Recall ( | Yield |
|---|---|---|---|
| BOW | 0.49 | 0.42 | 0.50 |
| NN | 0.39 | 0.78 | 0.80 |
Precision and recall are calculated from a 5% sample of Reactome pathways; yield is calculated over all Reactome pathways
Top pathways predicted to map to PW:0000029 ("fatty acid biosynthetic pathway")
| HumanCyc | Reactome | WikiPathways |
|---|---|---|
| PWY-5966: fatty acid biosynthesis initiation II | R-HSA-77288: mitochondrial fatty acid beta-oxidation of unsaturated fatty acids | WP357: Fatty Acid Biosynthesis |
| PWY-5143: fatty acid activation | R-HSA-77289: Mitochondrial Fatty Acid Beta-Oxidation | |
| R-HSA-390247: Beta-oxidation of very long chain fatty acids | ||
| R-HSA-75105: Fatty acyl-CoA biosynthesis | ||
| R-HSA-500753: Pyrimidine biosynthesis | ||
| R-HSA-8978868: Fatty acid metabolism |