| Literature DB >> 30867892 |
Ning Wang1, Peng Li2, Xiaochen Hu1, Kuo Yang1, Yonghong Peng3, Qiang Zhu4, Runshun Zhang5, Zhuye Gao6, Hao Xu6, Baoyan Liu7, Jianxin Chen8, Xuezhong Zhou1,7.
Abstract
Traditional Chinese Medicine (TCM) has received increasing attention as a complementary approach or alternative to modern medicine. However, experimental methods for identifying novel targets of TCM herbs heavily relied on the current available herb-compound-target relationships. In this work, we present an Herb-Target Interaction Network (HTINet) approach, a novel network integration pipeline for herb-target prediction mainly relying on the symptom related associations. HTINet focuses on capturing the low-dimensional feature vectors for both herbs and proteins by network embedding, which incorporate the topological properties of nodes across multi-layered heterogeneous network, and then performs supervised learning based on these low-dimensional feature representations. HTINet obtains performance improvement over a well-established random walk based herb-target prediction method. Furthermore, we have manually validated several predicted herb-target interactions from independent literatures. These results indicate that HTINet can be used to integrate heterogeneous information to predict novel herb-target interactions.Entities:
Keywords: Herb target prediction; Network embedding; Network medicine; Symptoms
Year: 2019 PMID: 30867892 PMCID: PMC6396098 DOI: 10.1016/j.csbj.2019.02.002
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Fig. 1The overview of HTINet method. (A) HTINet firstly builds a heterogeneous network by integrating 5 types of nodes and corresponding 11 types of edges from diverse datasets (see details in Materials and Methods). (B) Secondly, the latent representations of herb and protein nodes were extracted in the heterogeneous network using an embedding learning algorithm node2vec. And the feature matrix of herb-protein interactions was constructed by Hadamard product using the feature vectors of all herb and protein nodes. (C) Finally, the classification models were constructed to predict the herb-target interactions.
Fig. 2The performance of PRINCE and HTINet for herb-target interaction prediction and parameters adjustment of HTINet. A 10-fold cross-validation procedure was used to evaluate the performance of PRINCE, DT-HTINet, RF-HTINet, LR-HTINet, SVM-HTINet, GBDT-HTINet and KNN-HTINet. The area under the ROC curve (AUROC) and area under the precision-recall curve (AUPR) were used to evaluate the performance of the model. (A) The AUROC distribution of the models. (B) The AUPR distribution of the models. (C) The AUROC distribution under different number of walks and dimensions. (D) The AUPR distribution under different number of walks and dimensions. (E) The AUROC distribution under different parameter k. (F) The AUPR distribution under different parameter k.
Fig. 3Prediction of novel herb-target interaction. The distributions of the prediction probability of all unknown herb-target pairs (A), Polygonum bistorta(B), Tussilago farfara(C) and Rhododendron dauricum(D). (E) The herb-target interaction network of Polygonum bistorta, Tussilago farfara and Rhododendron dauricum. Herb-target interactions were collected from TCMID or literature. The herb nodes were shown in red circles, and the validated targets were marked by blue circles, while the non-validated targets were shown by green circles. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)