| Literature DB >> 31017923 |
Abstract
BACKGROUND: In the literature-based discovery, considerable research has been done based on the ABC model developed by Swanson. ABC model hypothesizes that there is a meaningful relation between entity A extracted from document set 1 and entity C extracted from document set 2 through B entities that appear commonly in both document sets. The results of ABC model are relations among entity A, B, and C, which is referred as paths. A path allows for hypothesizing the relationship between entity A and entity C, or helps discover entity B as a new evidence for the relationship between entity A and entity C. The co-occurrence based approach of ABC model is a well-known approach to automatic hypothesis generation by creating various paths. However, the co-occurrence based ABC model has a limitation, in that biological context is not considered. It focuses only on matching of B entity which commonly appears in relation between two entities. Therefore, the paths extracted by the co-occurrence based ABC model tend to include a lot of irrelevant paths, meaning that expert verification is essential.Entities:
Mesh:
Year: 2019 PMID: 31017923 PMCID: PMC6481912 DOI: 10.1371/journal.pone.0215313
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Outline of methodology.
Fig 2Example of tree parsing.
Example of entity relations extracted from an abstract using the biological context extraction module.
| PMID | Sentence_ID | Entity 1 | Entity 1 | Entity 2 | Entity 2 |
|---|---|---|---|---|---|
| 23829269 | 1 | hsf-1 | nr5a1 | nphs1 | nphs1 |
| 23829269 | 8 | wt1 | wt1 | hsf-1 | nr5a1 |
| 23829269 | 1 | POSITIVE | ACTIVE | affect | AFFECTS |
| 23829269 | 8 | POSITIVE | ACTIVE | unaffected | AFFECTS |
| 23829269 | 1 | NoData | NoData | amyloidosis | transgenic |
| 23829269 | 8 | NoData | NoData | NoData | NoData |
Example of entity relations extracted by Context-assignment-based ABC model.
| PMID | Sentence_ID | Entity 1 | Entity 1 | Entity 2 | Entity 2 |
|---|---|---|---|---|---|
| 23829269 | 1 | hsf-1 | nr5a1 | nphs1 | nphs1 |
| 23829269 | 8 | wt1 | wt1 | hsf-1 | nr5a1 |
| 23829269 | 1 | POSITIVE | ACTIVE | affect | AFFECTS |
| 23829269 | 8 | POSITIVE | ACTIVE | unaffected | AFFECTS |
| 23829269 | 1 | NoData | NoData | amyloidosis | transgenic |
| 23829269 | 8 | NoData | NoData | amyloidosis | transgenic |
Fig 3Example of MeSH hierarchical structure of Alzheimer’s disease.
Fig 4Example of method to connect entity relations.
The results of relation extraction.
| Co-occurrence based ABC model | Context based ABC model | Context-assignment based ABC model |
|---|---|---|
| 53,850 | 13,640 | 33,448 |
An example of entity relations in context-based ABC model.
| PMID | Sentence_ID | Entity 1 | Entity 1 | Entity 2 | Entity 2 |
|---|---|---|---|---|---|
| 23829269 | 1 | hsf-1 | nr5a1 | nphs1 | nphs1 |
| 23829269 | 1 | hsf-1 | nr5a1 | nphs2 | nphs2 |
| 23829269 | 1 | hsf-1 | nr5a1 | wt1 | wt1 |
| 23829269 | 1 | POSITIVE | ACTIVE | Affect | AFFECTS |
| 23829269 | 1 | POSITIVE | ACTIVE | Affect | AFFECTS |
| 23829269 | 1 | POSITIVE | ACTIVE | Affect | AFFECTS |
| 23829269 | 1 | NoData | NoData | Amyloidosis | transgenic |
| 23829269 | 1 | NoData | NoData | Amyloidosis | transgenic |
| 23829269 | 1 | NoData | NoData | Amyloidosis | transgenic |
An example of entity relations in co-occurrence-based ABC model.
| PMID | Sentence_ID | Entity 1 | Entity 1 | Entity 2 | Entity 2 |
|---|---|---|---|---|---|
| 23829269 | 1 | hsf-1 | nr5a1 | nphs1 | nphs1 |
| 23829269 | 1 | hsf-1 | nr5a1 | nphs2 | nphs2 |
| 23829269 | 1 | hsf-1 | nr5a1 | wt1 | wt1 |
| 23829269 | 1 | nphs1 | nphs1 | nphs2 | nphs2 |
| 23829269 | 1 | nphs1 | nphs1 | wt1 | wt1 |
| 23829269 | 1 | nphs2 | nphs2 | wt1 | wt1 |
The example of results from context-based ABC model (APOE–MAPT).
| Path | A-B Context | B-C Context | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Entity A | Entity B | Entity C | Cell | Drug | Disease | Organ-ism | Cell | Drug | Disease | Organ-ism |
| Apoe | ins | Mapt | · | Glucose | · | · | · | Glucose | · | · |
| Apoe | c9orf72 | Mapt | · | · | frontotemporal dementia | · | · | · | frontotemporal dementia | · |
| Apoe | snca | Mapt | · | · | parkinson disease | · | · | · | parkinson disease | · |
| Apoe | snca | Mapt | · | · | parkinson disease | · | · | · | parkinson disease | · |
| Apoe | phgdh | Mapt | · | · | Dementia | · | · | · | Dementia | · |
| Apoe | snca | Mapt | · | · | supranuclear palsy, progressive | · | · | · | supranuclear palsy, progressive | · |
| Apoe | c9orf72 | Mapt | · | · | frontotemporal dementia | · | · | · | frontotemporal dementia | · |
| Apoe | snca | Mapt | · | · | parkinson disease | · | · | · | parkinson disease | · |
| Apoe | bche | Mapt | · | glucose | · | · | · | glucose | · | · |
| Apoe | c9orf72 | Mapt | · | · | frontotemporal dementia | · | · | · | frontotemporal dementia | · |
| Apoe | src | Mapt | · | l-tyrosine | · | · | · | l-tyrosine | · | · |
| Apoe | phgdh | Mapt | · | · | dementia | · | · | · | dementia | · |
| Apoe | snca | Mapt | · | · | multiple system atrophy | · | · | · | multiple system atrophy | · |
| Apoe | src | Mapt | · | l-tyrosine | · | · | · | l-tyrosine | · | · |
| Apoe | ins | Mapt | · | glucose | · | · | · | glucose | · | · |
| Apoe | ins | Mapt | · | glucose | · | · | · | glucose | · | · |
| Apoe | src | Mapt | · | l-tyrosine | · | · | · | l-tyrosine | · | · |
| Apoe | bche | Mapt | · | Glucose | · | · | · | Glucose | · | · |
| Apoe | mcidas | Mapt | · | · | Dementia | · | · | · | dementia | · |
Verification of B entities using the context-based ABC model (APOE—MAPT).
| B entity | Verifi- | Evidence | |
|---|---|---|---|
| 1 | snca | X | 1) SNCA as well as ApoE has been associated with cognitive decline in neurodegenerative disease [ |
| 2 | phgdh | O | 1) ApoE and phgdh have interaction [ |
| 3 | ins | O | 1) ApoE4 reduces brain insulin (ins) [ |
| 4 | bche | O | 1) ApoE and bche functions as modulators of cerebral amyloid deposition [ |
| 5 | c9orf72 | O | 1) c9orf72 is a stronger determinant than Apoe of cognitive impairment in ALS [ |
| 6 | mcidas | X | 1) ApoE is required for cell cycle regulation (mcidas plays a role in mitotic cell cycle progression by promoting cell cycle exit) [ |
| 7 | src | O | 1) ApoE binding stimulates intracellular activation of Src[ |
Fig 5Change of precision according to biological context similarity threshold.
Verification of B entities from the context-assignment-based ABC model (APOE-MAPT).
| B entity | Verifi- | Evidence | |
|---|---|---|---|
| 1 | app | O | 1) ApoE elevates the transcription of APP. |
| 2 | bace1 | O | 1) APOE and bace1 levels show relation |
| 3 | bche | O | 1) ApoE and bche functions as modulators of cerebral amyloid deposition [ |
| 4 | c9orf72 | O | 1) c9orf72 is a stronger determinant than Apoe of cognitive impairment in ALS [ |
| 5 | clu | X | ApoE, clu, and mapt have no direct relation. |
| 6 | cr1 | O | CR1 interacts with APOE and affects mapt. |
| 7 | ctsd | O | 1) CTSD and APOE have been associated with cognitive ability |
| 8 | cyp46a1 | O | CYP46A1 may interact with APOE to influence phospho-mapt protein |
| 9 | gfap | O | GFAP-apoE is associated with increased phosphorylation of mapt |
| 10 | grn | X | APOE and grn are in independent pathway having no relation |
| 11 | hfe | O | 1) HFE mutations correlate with APOE |
| 12 | ins | O | 1) ApoE4 reduces brain insulin (ins) [ |
| 13 | mcidas | X | 1) ApoE is required for cell cycle regulation (mcidas plays a role in mitotic cell cycle progression by promoting cell cycle exit) [ |
| 14 | phgdh | O | 1) ApoE and phgdh have interaction [ |
| 15 | picalm | O | 1) PICALM and APOE is associated |
| 16 | prnp | X | Pnrp and fus are in independent path ways not affecting each other |
| 17 | rcan1 | O | ApoE genotype shows higher levels of RCAN1 and phospho-ta |
| 18 | sars2 | X | APOE and sars2 are in independent pathway having no relation |
| 19 | snca | X | 1) SNCA as well as ApoE has been associated with cognitive decline in neurodegenerative disease [ |
| 20 | tardbp | O | 1) APOE formed complex with TARDBP |
Fig 6Comparison of Context-based, context-assignment based and co-occurrence based ABC model (APOE—MAPT).
Verification of B entities from context-based ABC model (FUS-TARDBP).
| B entity | Verifi- | Evidence | |
|---|---|---|---|
| 1 | sod1 | X | Sod1 acts independently of fus and tardbp [ |
| 2 | gli3 | O | 1) Fus is associated with ALS while, gli3 is associated with ALS through the shh pathway [ |
| 3 | c9orf72 | O | 1) Fus is associated with endosomal trafficking [ |
| 4 | vapb | O | 1) Fus disrupts the vapb interactions to other signaling proteins [ |
| 5 | mapt | O | 1) Fus alternatively splices mapt [ |
| 6 | optn | O | ALS-linked cellular aggregates, include FUS, TDP-43(TARDBP), and OPTN [ |
| 7 | ang | O | 1) Angiogenin promotes tumoral growth and angiogenesis, fus inhibitons repress angiogenesis [ |
| 8 | grn | O | 1) Grn affects tau phosphorylation [ |
| 9 | taf15 | O | TDP-43, FUS and TAF15 is associated with ALS and ALS-associated mutations identified in these genes are found in their C-terminal Gly-rich domains [ |
Verification of B entities from context-assignment based ABC model(FUS-TARDBP).
| B entity | Verifi- | Evidence | |
|---|---|---|---|
| 1 | sod1 | X | sod1 acts independently of fus and tardbp [ |
| 2 | gli3 | O | 1) Fus is associated with ALS while, gli3 is associated with ALS through the shh pathway [ |
| 3 | c9orf72 | O | 1) Fus is associated with endosomal trafficking [ |
| 4 | vapb | O | 1) Fus disrupts the vapb interactions to other signaling proteins [ |
| 5 | mapt | O | 1) Fus alternatively splices mapt [ |
| 6 | optn | O | ALS-linked cellular aggregates, include FUS, TDP-43(TARDBP), and OPTN [ |
| 7 | grn | O | 1) Grn affects tau phosphorylation [ |
| 8 | taf15 | O | TDP-43, FUS and TAF15 is associated with ALS and ALS-associated mutations identified in these genes are found in their C-terminal Gly-rich domains [ |
Example of context in relation APP-PSEN1.
| Relation | Context | |
|---|---|---|
| Name | Type | |
| APP—PSEN1 | myoclonus | DISEASE |
| Lewy body dementia | DISEASE | |
| Dementia | DISEASE | |
| transgenic mouse | ORGANISM | |
| Tumor | DISEASE | |
| Alzheimer disease | DISEASE | |
| Melatonin | DRUG | |
| t-cell | CELL | |
| amyloid plaque | DISEASE | |
Fig 7Precision, recall and F-measure of Context-based ABC model (APOE–MAPT).