| Literature DB >> 20406501 |
Ali Z Ijaz1, Min Song, Doheon Lee.
Abstract
BACKGROUND: Since Swanson proposed the Undiscovered Public Knowledge (UPK) model, there have been many approaches to uncover UPK by mining the biomedical literature. These earlier works, however, required substantial manual intervention to reduce the number of possible connections and are mainly applied to disease-effect relation. With the advancement in biomedical science, it has become imperative to extract and combine information from multiple disjoint researches, studies and articles to infer new hypotheses and expand knowledge.Entities:
Mesh:
Year: 2010 PMID: 20406501 PMCID: PMC3165192 DOI: 10.1186/1471-2105-11-S2-S3
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Swanson’s UPK model – the connection of fish oils and Raynaud disease
Figure 2Data flow of MKEM
Figure 3SEPDB information model
Effect list and types
| Effect | Type |
|---|---|
| Induce | Increase |
| Contribute | Increase |
| Reduce | Reduction |
| Increase | Increase |
| Resistant | Reduction |
Figure 4Rule creation
Similarity measure comparative values
| Comparative Values | |||
|---|---|---|---|
| 0: Not Similar | 0: Not Similar | 0: Not Similar | 0: Not Similar |
| 1: Similar | 0.5: Substructure | 1: Similar | 0.5: Somewhat similar |
Calculated similarity measure for two substances
| Cordycepin | Fludarabine | Similarity | |
|---|---|---|---|
| Pharmacologic Substance | Pharmacologic Substance | 1 | |
| 0.9 | 1 | ||
| C10H13N5O3 | C10H12FN5O4 | 1 | |
| -1.25 | -1.38 | 1 | |
Extracted entities count
| Entity Type | # of extracted entities |
|---|---|
| Substances | 410 |
| Processes | 357 |
| Diseases | 44 |
| Body Parts | 82 |
Figure 5Formulae
System performance analysis
| System Performance | ||
|---|---|---|
| Accuracy | Precision | Recall |
| 56% | 75% | 56% |
Sample dataset with raw sentences and extracted information
| PubMed ID: 19264955 | ||||
|---|---|---|---|---|
| Substance | Effect Type | Process | Disease | Body Part |
| fisetin | increase | apoptosis | N/A | HCT-116 Cells |
| PubMed ID: 19262372 | ||||
| Substance | Effect Type | Process | Disease | Body Part |
| Docetaxel | increase | apoptosis | N/A | N/A |
| PubMed ID: 18070986 | ||||
| Substance | Effect Type | Process | Disease | Body Part |
| Wogonin | increase | Apoptosis | N/A | malignant T Cells. |
| PubMed ID: 19258429 | ||||
| Substance | Effect Type | Process | Disease | Body Part |
| Tolfenamic Acid | increase | Sp protein degradation | N/A | Cancer cell lines |
Example relationships
| Substance | Effect Type | Process | Disease | Body Part |
|---|---|---|---|---|
| Wogonin | Increase | Apoptosis | N/A | Malignant T Cells |
| Fisetin | Increase | Apoptosis | N/A | HCT-116 Cells |
Similarity measure
| Wogonin | Fisetin | Similarity | |
|---|---|---|---|
| MetaMap Type | Organic Chemical | 1 | |
| Structural Similarity | 0.75 | 1 | |
| Atomic Count | C16H12O5 | C15H10O6 | 1 |
| XLogP | 2.74 | 2.77 | 1 |
| Total | 4 | ||
Newly formed relationships
| Substance | Effect Type | Process | Disease | Body Part | Score |
|---|---|---|---|---|---|
| Wogonin | Increase | Apoptosis | N/A | HCT-116 Cells | 4 |
| Fisetin | Increase | Apoptosis | N/A | Malignant T-Cells | 4 |
Sample of newly formed relationships and associated scores
| Substance | Effect Type | Process | Disease | Body Part | Score |
|---|---|---|---|---|---|
| Wogonin | Increase | Apoptosis | N/A | HCT-116 Cells | 4 |
| Fisetin | Increase | Apoptosis | N/A | Malignant T Cells | 4 |
| Docetaxel | Increase | mRNA expression of IL-1 | N/A | N/A | 3.5 |
| Genistein | Increase | Apoptosis | N/A | HCT-116 Cells | 2 |
| Fisetin | Increase | Apoptosis | N/A | Tumor Cells | 2 |