| Literature DB >> 30096807 |
Sunyong Yoo1,2, Kwansoo Kim3,4, Hojung Nam5, Doheon Lee6,7.
Abstract
Identifying the health benefits of phytochemicals is an essential step in drug and functional food development. While many in vitro screening methods have been developed to identify the health effects of phytochemicals, there is still room for improvement because of high cost and low productivity. Therefore, researchers have alternatively proposed in silico methods, primarily based on three types of approaches; utilizing molecular, chemical or ethnopharmacological information. Although each approach has its own strength in analyzing the characteristics of phytochemicals, previous studies have not considered them all together. Here, we apply an integrated in silico analysis to identify the potential health benefits of phytochemicals based on molecular analysis and chemical properties as well as ethnopharmacological evidence. From the molecular analysis, we found an average of 415.6 health effects for 591 phytochemicals. We further investigated ethnopharmacological evidence of phytochemicals and found that on average 129.1 (31%) of the predicted health effects had ethnopharmacological evidence. Lastly, we investigated chemical properties to confirm whether they are orally bio-available, drug available or effective on certain tissues. The evaluation results indicate that the health effects can be predicted more accurately by cooperatively considering the molecular analysis, chemical properties and ethnopharmacological evidence.Entities:
Keywords: chemical property; ethnopharmacology; health benefits; herbal medicine; molecular analysis; network medicine; phytochemical
Mesh:
Substances:
Year: 2018 PMID: 30096807 PMCID: PMC6115900 DOI: 10.3390/nu10081042
Source DB: PubMed Journal: Nutrients ISSN: 2072-6643 Impact factor: 5.717
Health effects-related UMLS semantic types. Among 135 semantic types, the following 20 semantic types were selected as related to health effects.
| Abbreviation | Semantic Type |
|---|---|
| acab | Acquired Abnormality |
| anab | Anatomical Abnormality |
| biof | Biologic Function |
| cgab | Congenital Abnormality |
| comd | Cell or Molecular Dysfunction |
| dsyn | Disease or Syndrome |
| emod | Experimental Model of Disease |
| fndg | Finding |
| inpo | Injury or Poisoning |
| lbtr | Laboratory or Test Result |
| menp | Mental Process |
| mobd | Mental or Behavioral Dysfunction |
| neop | Neoplastic Process |
| patf | Pathologic Function |
| phsf | Physiologic Function |
| sosy | Sign or Symptom |
| clna | Clinical attribute |
| hops | Hazardous or Poisonous Substance |
| bpoc | Body Part, Organ, or Organ Component |
| tisu | Tissue |
Figure 1A systematic pipeline for the prediction of the health effects of phytochemicals. (a) Phenotype values of a phytochemical were obtained by calculating the propagated effects on the molecular network. In the molecular network, the random walk with restart (RWR) algorithm was performed based on direct targets (star) and indirect targets (triangle) of a phytochemical, in which the RWR results are shown as colored nodes. Based on gene-phenotype associations, sums of gene values are mapped to phenotypes. (b) For all phytochemicals, chemical properties, including physicochemical properties and physiological effects, were calculated. (c) Plants containing the phytochemical were extracted. For each extracted plant, we calculated the semantic similarity between the predicted health effect of the phytochemical and the ethnopharmacological effects of the plant. To do this, we constructed phenotypic network and calculated the shortest path length between phenotype pairs and depth of the phenotypes. Plants with the similarity score larger than the user-defined threshold were selected.
Figure 2An overview of the findings of the ethnopharmacological use of phytochemicals. (a) From public databases, we collected ethnopharmacological evidence of medicinal plants. We then extracted phenotype-associated terms from the narrative text of the collected information by applying the MetaMap tool. (b) For a queried phytochemical, plants containing the phytochemical were extracted. (c) For each extracted plant, we mapped its ethnopharmacological effects to the phenotypic network (blue circle). Then, we calculated semantic similarities between all possible pairs of predicted health effects of phytochemicals and ethnopharmacological effects of the plant. In this example, the semantic similarity between stroke and nephrosis is 0.57, based on the semantic similarity formula, because the depth of lcs is 2, the shortest path length between nephrosis and lcs is 1 and the shortest path length between stroke and lcs is 2. Plants with a similarity score larger than 0.8 were selected.
Figure 3The distribution of the number of predicted health effects. The distribution of the number of predicted health effects by molecular network analysis (red violin plot). The mean of predicted health effects is 415.6 ± 27.3. Next, we investigated the intersection between predicted health effects of the phytochemicals and ethnopharmacological use of the plant containing the phytochemicals. The distribution of the number of predicted health effects by molecular network analysis and ethnopharmacological use evidence (blue violin plot). The mean of predicted health effects is 129.1 ± 11.4.
The number of phytochemicals which satisfy RO5, HIA, Caco-2 and BBB. We also investigated the number of phytochemicals which satisfy two physiological effects.
| RO5 | HIA | Caco-2 | BBB | |
|---|---|---|---|---|
| RO5 | 446 | 401 | 280 | 365 |
| HIA | 482 | 330 | 407 | |
| Caco-2 | 335 | 303 | ||
| BBB | 428 |
Precision and recall performance of molecular network analysis in predicting therapeutic effects, side effects and potential candidate effects.
| Skewness | Therapeutic Effects | Side Effects | Inferred Candidates | |
|---|---|---|---|---|
| Precision | 1:1 | 0.921 ± 0.032 | 0.922 ± 0.021 | 0.942 ± 0.005 |
| 1:10 | 0.518 ± 0.059 | 0.432 ± 0.040 | 0.706 ± 0.013 | |
| All | 0.006 ± 0.001 | 0.049 ± 0.010 | 0.522 ± 0.022 | |
| Recall | All | 0.738 ± 0.062 | 0.576 ± 0.061 | 0.909 ± 0.011 |
Precision performance of the method, which uses molecular network analysis and ethnopharmacological use evidence in predicting therapeutic effects, side effects and potential candidate effects.
| Skewness | Therapeutic Effects | Side Effects | Inferred Candidates |
|---|---|---|---|
| 1:1 | 0.941 ± 0.035 | 0.761 ± 0.033 | 0.948 ± 0.014 |
| 1:10 | 0.541 ± 0.069 | 0.319 ± 0.055 | 0.732 ± 0.037 |
| All | 0.014 ± 0.003 | 0.025 ± 0.005 | 0.563 ± 0.059 |
Literature validation was performed by comparing co-occurrence, the Jaccard index and Fisher’s exact test values between predicted and random association sets. Statistical significance was calculated by the p-value of the Mann-Whitney U test.
| Co-Occurrence | Jaccard Index | |||
|---|---|---|---|---|
| Predicted association set | 1.25 | 1.8 × 10−4 | 2984 | 1341 |
| Random association set | 0.09 | 9.5 × 10−6 | 612 | 274 |
| Mann-Whitney | <0.001 | <0.001 | <0.001 | <0.001 |
1 The number of phytochemical-health effects associations which satisfy the p-value of Fisher’s exact test is lower than 0.001. 2 The number of phytochemical-health effects associations which satisfy q-value of FDR test is lower than 0.05.
Summary of evidence indicating potential health effects of exemplary phytochemicals: isoquercitrin, niacin and choline.
| Phytochemical | Potential Health Effects | Rank |
| Clinical Trials (Phase) | Exemplary Studies (PMID) |
|---|---|---|---|---|---|
| Choline | Neurological disorder | 2 | 48 | - | - |
| Alzheimer’s disease | 81 | 31 | NCT02648906 | 12787861, 15647594 | |
| Cognitive impairment | 419 | 3 | NCT01363648 | 21195433, 19304299 | |
| Pain | 6 | 44 | NCT00720343 | 15780465, 19372354, | |
| Isoquercitrin | Hypertension | 2 | 64 | NCT01691404 | 20134098, 25460361 |
| Thrombosis | 10 | 56 | NCT02195232 | 12854360, 15234778 | |
| Niacin | Neurological Disease | 2 | 128 | - | - |
| Parkinson’s Disease | 59 | 65 | NCT03462680 | 26273459, 25455298 | |
| Heart condition | 7 | 117 | NCT00120289 | 28057839, 23916935 | |
| Vascular disease | 9 | 113 | - | - | |
| Cardiovascular disease | 20 | 105 | NCT00715273 | 3295315, 19159436 |