| Literature DB >> 31979314 |
Hsiang-Yuan Yeh1, Chia-Ter Chao2,3,4, Yi-Pei Lai1, Huei-Wen Chen4.
Abstract
Natural products are the most important and commonly used in Traditional Chinese Medicine (TCM) for healthcare and disease prevention in East-Asia. Although the Meridian system of TCM was established several thousand years ago, the rationale of Meridian classification based on the ingredient compounds remains poorly understood. A core challenge for the traditional machine learning approaches for chemical activity prediction is to encode molecules into fixed length vectors but ignore the structural information of the chemical compound. Therefore, we apply a cost-sensitive graph convolutional neural network model to learn local and global topological features of chemical compounds, and discover the associations between TCM and their Meridians. In the experiments, we find that the performance of our approach with the area under the receiver operating characteristic curve (ROC-AUC) of 0.82 which is better than the traditional machine learning algorithm and also obtains 8%-13% improvement comparing with the state-of-the-art methods. We investigate the powerful ability of deep learning approach to learn the proper molecular descriptors for Meridian prediction and to provide novel insights into the complementary and alternative medicine of TCM.Entities:
Keywords: Meridian classification; Traditional Chinese Medicine; graph convolutional neural network
Year: 2020 PMID: 31979314 PMCID: PMC7036907 DOI: 10.3390/ijerph17030740
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Figure 1The entire workflow of our study.
Figure 2The simple illustration of the graph convolutional neural network (GCN) model.
Figure 3Cosine similarity among Meridians.
The performance among different features and methods.
| Features | Methods | ROC-AUC |
|---|---|---|
| ECFP4 | Logistic regression | 0.66 |
| ECFP4 | Random forest | 0.67 |
| ECFP4 | AdaBoost | 0.65 |
| ECFP4 | NN | 0.68 |
| Neural fingerprint | Cost-sensitive GCN | 0.82 |
The performance of ROC-AUC in training, validation, and test datasets among 12 Meridians.
| Meridian | Train | Validate | Test |
|---|---|---|---|
| Bladder | 0.89 | 0.86 | 0.85 |
| Cardiovascular | 0.97 | 0.94 | 0.94 |
| Gallbladder | 0.91 | 0.87 | 0.87 |
| Heart | 0.82 | 0.80 | 0.79 |
| Kidney | 0.81 | 0.78 | 0.78 |
| Large intestine | 0.88 | 0.84 | 0.84 |
| Liver | 0.79 | 0.75 | 0.77 |
| Lung | 0.79 | 0.77 | 0.75 |
| Small intestine | 0.97 | 0.94 | 0.93 |
| Spleen | 0.80 | 0.78 | 0.78 |
| Stomach | 0.78 | 0.74 | 0.75 |
| Three end | 0.99 | 0.94 | 0.95 |
The ROC-AUC performance of the split methods.
| Split Methods | ROC-AUC |
|---|---|
| Index | 0.60 |
| Random | 0.67 |
| Scaffold | 0.63 |
| Random stratified | 0.82 |
Figure 4(a) ROC-AUC performance among different numbers of hidden layers; (b) ROC-AUC performance among different numbers of hidden neurons.
The ROC-AUC performance of the model compared with state-of-the-art methods.
| Features | Methods | ROC-AUC |
|---|---|---|
| Average fingerprint | Random forest [ | 0.65 |
| Neural fingerprint | GCN [ | 0.70 |
| Neural fingerprint | Cost-sensitive GCN | 0.78 |
Figure 5The major substructure of the heart Meridian exists in the component of the herb Lily Bulb, Colchicine.
Figure 6(a) Vascular smooth muscle cells were subjected to control medium (left upper) and high inorganic phosphate containing osteogenic medium without (middle upper) and with 0.1 (right upper), 1 (left lower), 5 (middle lower), and 10 (right lower) microM ATX. ATX, astaxanthin; Ctrl, control Pi, inorganic phosphate; ATX, astaxanthin; (b) the barplot of the relative alizarin red (AR) stain density.