| Literature DB >> 29234434 |
Lidong Wang1, Yin Zhang2, Yun Zhang3, Xiaodong Xu4, Shihua Cao1.
Abstract
Determining a prescription's function is one of the challenging problems in Traditional Chinese Medicine (TCM). In past decades, TCM has been widely researched through various methods in computer science, but none concentrates on the prediction method for a new prescription's function. In this study, two methods are presented concerning this issue. The first method is based on a novel supervised topic model named Label-Prescription-Herb (LPH), which incorporates herb-herb compatibility rules into learning process. The second method is based on multilabel classifiers built by TFIDF features and herbal attribute features. Experiments undertaken reveal that both methods perform well, but the multilabel classifiers slightly outperform LPH-based method. The prediction results can provide valuable information for new prescription discovery before clinical test.Entities:
Year: 2017 PMID: 29234434 PMCID: PMC5662811 DOI: 10.1155/2017/8279109
Source DB: PubMed Journal: Evid Based Complement Alternat Med ISSN: 1741-427X Impact factor: 2.629
Figure 1The framework of our methods.
Figure 2Graphical model of improved Labeled LDA.
Dosage standardization for “Ma Huang Tang” (g).
| Ma Huang Tang |
|
|
|
|
|---|---|---|---|---|
|
| 9 | 2 | 9 | 0.82 |
|
| 6 | 3 | 9 | 0.50 |
|
| 6 | 4.5 | 9 | 0.44 |
|
| 3 | 1.5 | 9 | 0.29 |
An example of a formula.
| Formula | Ma Huang Tang |
| Herbs |
|
| Function | Relieving exterior syndrome |
The detailed information about “Ephedrae Herba.”
| Herb |
|
| Efficiency | Inducing perspiration, relieving superficies by cooling, opening the inhibited lung-energy, relieving asthma, clearing dam, subsidence of a swelling |
| Nature & flavor | Spicy, slightly bitter, warm |
| Channel tropism | Lungs, bladder |
| Usual dosage | 2 g ~ 9 g |
Experimental results of compatibility rule mining.
| Number of returned herb pairs | Precision@ | Number of returned herb pairs | Precision@ |
|---|---|---|---|
| 100 | 100/100 | 1000 | 913/1000 |
| 200 | 200/200 | 1100 | 974/1100 |
| 300 | 294/300 | 1200 | 1026/1200 |
| 400 | 383/400 | 1300 | 1078/1300 |
| 500 | 472/500 | 1400 | 1135/1400 |
| 600 | 550/600 | 1500 | 1166/1500 |
| 700 | 630/700 | 1600 | 1171/1600 |
| 800 | 711/800 | 1700 | 1173/1700 |
| 900 | 809/900 | 1800 | 1174/1800 |
Figure 3Detected 1500 pairs of herbs.
Topics discovered by LPH model.
| Cleaning heat | Probability | Relieving uneasiness of mind | Probability |
|---|---|---|---|
|
| 0.05953 |
| 0.04842 |
|
| 0.05431 |
| 0.03805 |
|
| 0.03238 |
| 0.03574 |
|
| 0.02507 |
| 0.03259 |
|
| 0.02403 |
| 0.02017 |
|
| 0.02403 |
| 0.01960 |
|
| 0.02298 |
| 0.01615 |
|
| 0.02194 |
| 0.01615 |
|
| 0.01881 |
| 0.01384 |
|
| 0.01776 |
| 0.01384 |
|
| 0.01672 |
| 0.01038 |
|
| 0.01567 |
| 0.01038 |
|
| 0.01463 |
| 0.01038 |
|
| 0.01254 |
| 0.01038 |
|
| 0.01254 |
| 0.01038 |
|
| 0.01254 |
| 0.01038 |
|
| 0.01254 |
| 0.00923 |
|
| 0.01150 |
| 0.00923 |
|
| 0.01150 |
| 0.00923 |
|
| 0.00856 |
| 0.00923 |
Topics discovered by LPH model.
| Replenishing and restoring | Probability | Dispelling internal cold | Probability |
|---|---|---|---|
|
| 0.05533 |
| 0.04842 |
|
| |||
|
| 0.05297 |
| 0.03805 |
|
| |||
|
| 0.03708 |
| 0.03574 |
|
| |||
|
| 0.03120 |
| 0.03459 |
|
| |||
|
| 0.02767 |
| 0.02421 |
|
| |||
|
| 0.02649 |
| 0.01960 |
|
| |||
|
| 0.02531 |
| 0.01615 |
|
| |||
|
| 0.02096 |
| 0.01499 |
|
| |||
|
| 0.01325 |
| 0.01384 |
|
| |||
|
| 0.01325 |
| 0.01384 |
|
| |||
|
| 0.00943 |
| 0.01384 |
|
| |||
|
| 0.00943 |
| 0.01269 |
|
| |||
|
| 0.00943 | | 0.01154 |
|
| |||
|
| 0.00943 |
| 0.01038 |
|
| |||
|
| 0.00943 |
| 0.01038 |
|
| |||
|
| 0.00825 |
| 0.01038 |
|
| |||
|
| 0.00825 |
| 0.00923 |
|
| |||
|
| 0.00825 |
| 0.00923 |
|
| |||
|
| 0.00825 |
| 0.00923 |
|
| |||
|
| 0.00707 | | 0.00820 |
Topics discovered by Labeled LDA model.
| Cleaning heat | Probability | Relieving uneasiness of mind | Probability |
|---|---|---|---|
|
| 0.03172 |
| 0.04112 |
|
| 0.02984 |
| 0.03945 |
|
| 0.02773 |
| 0.03712 |
|
| 0.02678 |
| 0.03226 |
|
| 0.02421 |
| 0.03226 |
|
| 0.01933 |
| 0.02110 |
|
| 0.01933 |
| 0.02110 |
|
| 0.01847 |
| 0.02110 |
|
| 0.01847 |
| 0.01958 |
|
| 0.01847 |
| 0.01646 |
|
| 0.01811 | | 0.01617 |
| | 0.01652 |
| 0.01617 |
|
| 0.01584 |
| 0.01617 |
| Cinnamomi Ramulus | 0.01437 |
| 0.01025 |
|
| 0.01437 |
| 0.00943 |
| | 0.01394 |
| 0.00943 |
|
| 0.01394 |
| 0.00943 |
|
| 0.01386 |
| 0.00872 |
|
| 0.00945 |
| 0.00845 |
|
| 0.00835 | | 0.00845 |
Average performance of topic model-based method.
| Threshold | Labeled LDA | LPH | ||||
|---|---|---|---|---|---|---|
| Precision | Recall | Micro- | Precision | Recall | Micro- | |
| 1 | 0.6102 | 0.1187 | 0.1987 | 0.8124 | 0.1025 | 0.1820 |
| 1 | 0.7317 | 0.2658 | 0.3899 | 0.6075 | 0.2031 | 0.3044 |
| 1 | 0.6567 | 0.3278 | 0.4373 | 0.6874 | 0.3295 | 0.4455 |
| 1 | 0.5927 | 0.4076 | 0.4830 |
|
| 0.5300 |
| 1 | 0.5365 | 0.4127 | 0.4665 | 0.6267 | 0.4203 | 0.5031 |
Average performance of multilabel classifiers.
| Classifier | Feature space | Precision | Recall | Micro- |
|---|---|---|---|---|
| SVM | TFIDF | 0.6202 | 0.3945 | 0.4822 |
| Attributes | 0.6510 | 0.4102 | 0.5033 | |
| TFIDF + attributes |
| 0.4823 |
| |
|
| ||||
| Adaboost | TFIDF | 0.5729 | 0.3102 | 0.4025 |
| Attributes | 0.6856 | 0.3358 | 0.4508 | |
| TFIDF + attributes | 0.6894 | 0.3475 | 0.4621 | |
|
| ||||
| Bayes Network | TFIDF | 0.5126 | 0.4325 | 0.4691 |
| Attributes | 0.6179 | 0.4218 | 0.5013 | |
| TFIDF + attributes | 0.6397 | 0.5124 | 0.5690 | |