| Literature DB >> 28678852 |
Seung Han Baek1, Dahee Lee2, Minjoo Kim3, Jong Ho Lee3, Min Song2.
Abstract
BACKGROUND: Most of earlier studies in the field of literature-based discovery have adopted Swanson's ABC model that links pieces of knowledge entailed in disjoint literatures. However, the issue concerning their practicability remains to be solved since most of them did not deal with the context surrounding the discovered associations and usually not accompanied with clinical confirmation. In this study, we aim to propose a method that expands and elaborates the existing hypothesis by advanced text mining techniques for capturing contexts. We extend ABC model to allow for multiple B terms with various biological types.Entities:
Mesh:
Year: 2017 PMID: 28678852 PMCID: PMC5498031 DOI: 10.1371/journal.pone.0180539
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Open (left) and closed (right) discovery process defined by Weeber at al. [16].
Fig 2Extension of Swanson’s ABC model.
Resources and statistics of the dictionaries for named entity recognition.
| Entity Type | Resources | Number of | Number of Entries |
|---|---|---|---|
| Gene / Protein | EntrezGene [ | 78,432 | 289,210 |
| Disease | KEGG Disease [ | 10,734 | 24,605 |
| Metabolite | HMDB [ | 80,838 | 452,273 |
| Body / Organ | Medical Subject Headings (MeSH) | 3,616 | 4,643 |
| Pathway | Gene Ontology [ | 27,895 | 27,934 |
| 11 | 201,515 | 798,665 |
Fig 3Overview of our proposed approach.
Fig 4Visualization of a portion of the directed network generated by literature mining.
Top ranked candidates of the developed hypothesis.
| Rank | A term | Relation | 1st B term | Relation | 2nd B term | Relation | 3rd B term | Relation | C term | Average |
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | LacCer | Co-occur | MGAM | Transmit | Equol | Co-occur | Daidzein | Co-occur | Arterial Stiffness | 0.209499 |
| 2 | LacCer | Contain | Inulobiose | Co-occur | 4-aminohippuric acid | Co-occur | Arterial Stiffness | 0.143843667 | ||
| 3 | LacCer | Co-occur | MGAM | Co-occur | Rosuvastatin | Co-occur | 3-nitrotyrosine | Co-occur | Arterial Stiffness | 0.104826 |
| 4 | LacCer | Method | FLT3LG | Method | - | - | - | - | Arterial Stiffness | 0.060555 |
| 5 | LacCer | Contain | Inulobiose | Co-occur | - | - | - | - | Arterial Stiffness | 0.059905 |
| 6 | LacCer | Co-occur | MGAM | Co-occur | Doxazosin | Report | - | - | Arterial Stiffness | 0.044014333 |
| 7 | LacCer | Co-occur | MGAM | Co-occur | Lipoamide | Co-occur | - | - | Arterial Stiffness | 0.041227333 |
| 8 | LacCer | Report | Breast cancer | Co-occur | ATP8A2 | Co-occur | - | - | Arterial Stiffness | 0.039473333 |
| 9 | LacCer | Co-occur | MGAM | Report | Inulobiose | Co-occur | - | - | Arterial Stiffness | 0.039037333 |
| 10 | LacCer | Co-occur | FBF1 | Co-occur | Atrial fibrillation | Co-occur | KIAA0101 | Modify | Arterial Stiffness | 0.035331 |
| 11 | LacCer | Report | Tay-sachs disease | Report | - | - | - | - | Arterial Stiffness | 0.03115 |
| 12 | LacCer | Co-occur | ENG | Report | Malondialdehyde | Co-occur | - | - | Arterial Stiffness | 0.027345333 |
| 13 | LacCer | Co-occur | SEPT5 | Plain | POMT1 | Co-occur | SLC26A3 | Increase | Arterial Stiffness | 0.027308 |
| 14 | LacCer | Co-occur | Nitric Oxide | Co-occur | Malondialdehyde | Co-occur | - | - | Arterial Stiffness | 0.026963333 |
| 15 | LacCer | Co-occur | CAMK4 | Co-occur | FN1 | Co-occur | - | - | Arterial Stiffness | 0.026768 |
| 16 | LacCer | Co-occur | MGAM | Co-occur | ATP8A2 | Co-occur | - | - | Arterial Stiffness | 0.025134333 |
| 17 | LacCer | Co-occur | Propyl Gallate | Co-occur | ATP8A2 | Co-occur | - | - | Arterial Stiffness | 0.025118667 |
| 18 | LacCer | Co-occur | Acrylamide | Report | CAMK4 | Co-occur | FN1 | Co-occur | Arterial Stiffness | 0.0242185 |
| 19 | LacCer | Plain | ATN1 | Co-occur | CISH | Report | - | - | Arterial Stiffness | 0.021235 |
| 20 | LacCer | Co-occur | ABCB1 | Increase | ELOVL6 | Co-occur | - | - | Arterial Stiffness | 0.021224 |
| 21 | LacCer | Increase | fut4[GE] | Co-occur | HMGB1 | Modify | - | - | Arterial Stiffness | 0.019730333 |
| 22 | LacCer | Increase | Phosphate | Co-occur | Folic acid | Decrease | - | - | Arterial Stiffness | 0.019715 |
| 23 | LacCer | Co-occur | FBF1 | Modify | ETV3 | Co-occur | ATP8A2 | Co-occur | Arterial Stiffness | 0.01826025 |
| 24 | LacCer | Increase | LPA | Co-occur | - | - | - | - | Arterial Stiffness | 0.0169255 |
| 25 | LacCer | Increase | DYM | Modify | - | - | - | - | Arterial Stiffness | 0.015764 |
In the Entity columns, square brackets contain the biological type information of the corresponding entity. (MB): Metabolite, (GE): Gene/Protein, (BP): Biological Process/Pathway, (DS): Disease, and (BD): Body/Organ. LacCer is the acronym for lactosylceramide.
Top ranked candidates with multiple B terms of metabolites.
| Rank | A term | Relation | 1st B term | Relation | 2nd B term | Relation | 3rd B term | Relation | C term | Average Semantic Relatedness |
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | LacCer | Contain | Inulobiose | Co-occur | 4-Aminohippuric acid | Co-occur | - | - | Arterial Stiffness | 0.1438 |
| 2 | LacCer | Co-occur | Nitric Oxide | Co-occur | Malondialdehyde | Increase | - | - | Arterial Stiffness | 0.0270 |
| 3 | LacCer | Increase | Phosphate | Co-occur | Folic acid | Decrease | - | - | Arterial Stiffness | 0.0197 |
| 4 | LacCer | Increase | Phosphate | Co-occur | Hydrogen carbonate | Contain | Sucrose | Co-occur | Arterial Stiffness | 0.0036 |
| 5 | LacCer | Plain | Silicon | Plain | Hydrogen carbonate | Contain | Sucrose | Co-occur | Arterial Stiffness | -0.0021 |
| 6 | LacCer | Increase | LacCer surfate | Plain | Phospholipid | Report | - | - | Arterial Stiffness | -0.0034 |
In the Entity columns, square brackets contain the biological type information of the corresponding entity. (MB): Metabolite, (GE): Gene/Protein, (BP): Biological Process/Pathway, (DS): Disease, and (BD): Body/Organ. LacCer is the acronym for lactosylceramide. ‘Plain’ describes relations that did not have causality nor can be classified by our category, due to the ‘verb’ extracted between two entities have no causality nor can it be classified by our category. While, ‘co-occurrence’ a ‘verb’ does not exist to describe the relation.
Clinical and biochemical characteristics in male subjects under 50 yrs.
| Smoker ( | Non-smoker ( | Total ( | ||
|---|---|---|---|---|
| Age (year) | 39.7±0.92 | 41.4±0.90 | 40.6±0.65 | 0.284 |
| Body mass index (kg/m2) | 23.7±0.51 | 23.9±0.47 | 23.8±0.34 | 0.982 |
| Systolic BP (mmHg) | 121.0±2.22 | 120.6±2.23 | 120.8±1.56 | 0.982 |
| Diastolic BP (mmHg) | 75.2±1.83 | 76.5±1.96 | 75.9±1.33 | 0.676 |
| Triglyceride (mg/dL) | 127.6±9.11 | 116.0±14.8 | 121.5±8.83 | 0.113 |
| Total-cholesterol (mg/dL) | 179.6±7.04 | 193.3±7.45 | 186.7±5.19 | 0.132 |
| HDL-cholesterol (mg/dL) | 50.3±1.87 | 54.5±3.35 | 52.5±1.97 | 0.454 |
| LDL-cholesterol (mg/dL) | 103.8±6.75 | 115.6±6.51 | 110.0±4.72 | 0.111 |
| Glucose (mg/dL) | 89.8±2.40 | 91.8±2.01 | 90.9±1.54 | 0.366 |
| Insulin (μIU/dL) | 8.31±0.69 | 9.35±0.84 | 8.85±0.55 | 0.317 |
| Malondialdehyde (nmol/mL) | 9.98±0.65 | 8.86±0.39 | 9.39±0.38 | 0.253 |
| Nitric oxide (μmol/L) | 34.1±3.81 | 37.8±3.48 | 36.1±2.56 | 0.410 |
| ba-PWV (cm/s) | 1316.2±34.0 | 1280.8±26.7 | 1297.7±21.3 | 0.575 |
| Lactosylceramide (d18:1/12:0) | 3156288±246573 | 2437072±290662 | 2781045±197400 | 0.086 |
Mean ± SE.
∮tested by logarithmic transformation, P-values derived from independent t-test between smoker and non-smoker.
Fig 5Statistical relations of lactosylceramide, nitric oxide, malondialdehyde, and ba-PWV.
Relationship of lactosylceramide (d18:1/12:0), nitric oxide, malondialdehyde, and ba-PWV in male subjects under 50 yrs.Tested by log-transformed. Tested by Pearson correlation (r: smoker, r: non-smoker, r: total). (A) r = -0.739, P<0.001; r = -0.388, P = 0.061; r = -0.551, P<0.001. (B) r = -0.751, P<0.001; r = -0.400, P = 0.053; r = -0.612, P<0.001. (C) r = 0.526, P = 0.012; r = 0.628, P = 0.001; r = 0.570, P<0.001. (D) r = 0.527, P = 0.012; r = 0.414, P = 0.044; r = 0.470, P = 0.001.
Fig 6Overall view of nitric oxide, malondialdehyde, and ba-PWV with lactosylceramade.
Fig 7Network relations of lactosylceramade, nitric oxide, malondialdehyde, and arterial stiffness.
Fig 8Scatterplot of database-based versus semantic relatedness score (both normalized).