| Literature DB >> 30076354 |
Sunyong Yoo1,2, Hojung Nam3, Doheon Lee4,5.
Abstract
Although natural compounds have provided a wealth of leads and clues in drug development, the process of identifying their pharmacological effects is still a challenging task. Over the last decade, many in vitro screening methods have been developed to identify the pharmacological effects of natural compounds, but they are still costly processes with low productivity. Therefore, in silico methods, primarily based on molecular information, have been proposed. However, large-scale analysis is rarely considered, since many natural compounds do not have molecular structure and target protein information. Empirical knowledge of medicinal plants can be used as a key resource to solve the problem, but this information is not fully exploited and is used only as a preliminary tool for selecting plants for specific diseases. Here, we introduce a novel method to identify pharmacological effects of natural compounds from herbal medicine based on phenotype-oriented network analysis. In this study, medicinal plants with similar efficacy were clustered by investigating hierarchical relationships between the known efficacy of plants and 5,021 phenotypes in the phenotypic network. We then discovered significantly enriched natural compounds in each plant cluster and mapped the averaged pharmacological effects of the plant cluster to the natural compounds. This approach allows us to predict unexpected effects of natural compounds that have not been found by molecular analysis. When applied to verified medicinal compounds, our method successfully identified their pharmacological effects with high specificity and sensitivity.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30076354 PMCID: PMC6076245 DOI: 10.1038/s41598-018-30138-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1A systematic overview of the phenotype-oriented network analysis. (a) Phenotype values of a plant were obtained by calculating the quantified relationship between phenotypes on the phenotypic network. In the phenotypic network, the RWR algorithm was performed based on the known efficacy of the plant (star), and the RWR results are shown as colored nodes. The phenotype vector of a plant was constructed based on the RWR results. (b) Plants with similar pharmacological effects were grouped by applying hierarchical clustering analysis to the matrix of phenotype vectors. Hierarchical clustering was performed by using the pvclust. The approximately unbiased p-values (bracketed values) calculated for each branch in the clustering represent the support in the data for the observed subtree clustering. Clusters with p-value over 0.95 (red box) are strongly supported by the data. (c) All natural compounds contained in the plant cluster were extracted. For each natural compound (c), Fisher’s exact test was performed to check whether the natural compound was significantly enriched in the cluster. Finally, natural compounds with p-values (p) of Fisher’s exact test lower than a threshold value (λ) were selected. (d) The pharmacological effects of an enriched natural compound were obtained by mapping the averaged phenotype vectors of the plant cluster enriched this specific natural compound.
Figure 2Quantifying the pharmacological effects of medicinal plants in the phenotypic network. (a) A phenotypic network was constructed based on the UMLS hierarchical relationships. (b) A semantic similarity between two phenotype concepts was calculated by considering the depth of the phenotypes and the distance between phenotypes. (c) In the phenotypic network, the RWR algorithm was performed based on the known efficacy of the plant (circle), and the RWR results are shown as colored nodes. (d) A transition matrix (W) is generated by the column normalization of the adjacency matrix based on the edge weights.
Figure 3Performance evaluation of identifying pharmacological effects of natural compounds. (a,b) Average AUC scores of ROC and PR for our method (blue), our method without considering hierarchical relationships (light blue) and the target-closeness method (gray) to evaluate the performance of the prediction of pharmacological effects of natural compounds. The known therapeutic and potential candidate effects were used as gold and silver standard positive sets, respectively. Average AUC scores were calculated based on (a) natural compounds and (b) phenotypes.
Literature validation was performed by comparing the co-occurrence, Jaccard index and Fisher’s exact test values among high-scored, low-scored and random sets. Statistical significance was calculated by the p-value of the Mann-Whitney U test.
|
|
|
| ||
|---|---|---|---|---|
| High-scored (H) | 1.41 | 2.2 × 10−4 | 3,281 | |
| Low-scored (L) | 0.11 | 1.1 × 10−5 | 746 | |
| Random (R) | 0.37 | 7.8 × 10−5 | 1,136 | |
| Mann-Whitney U test, | H vs L | <0.001 | <0.001 | <0.001 |
| H vs R | <0.001 | <0.001 | <0.001 | |
| L vs R | 0.028 | 0.73 | 0.052 | |
ap-value threshold of Fisher’s exact test is 0.001.
Literature evidence on the predicted pharmacological effects of natural compounds.
| Compound | Phenotype | Score | Literature evidence |
|---|---|---|---|
| Puerarin | Stroke | 0.773 | PMID: 28072733 |
| Berberine | Insomnia | 0.819 | PMID: 28579756 |
| Quercitrin | Amenorrhea | 0.804 | PMID: 22212502 |
| Spermidine | Hemorrhage | 0.765 | PMID: 14913342 |
| Choline | Anaemia | 0.818 | PMID: 15571243 |
| Genistein | Stroke | 0.773 | PMID: 29063799 |
| Eugenol | Retention of urine | 0.853 | PMID: 28733207 |
| Daidzein | Stroke | 0.773 | PMID: 26558782 |
| Amentoflavone | Asthma | 0.755 | PMID: 27916586 |
| Ononin | Chronic disease | 0.726 | PMID: 19103273 |