| Literature DB >> 21876710 |
Huihui Zhao1, Jianxin Chen, Na Hou, Peng Zhang, Yong Wang, Jing Han, Qin Hou, Qige Qi, Wei Wang.
Abstract
Coronary heart disease (CHD) is still the leading cause of death for adults worldwide. Traditional Chinese medicine (TCM) has a history of 1000 years fighting against the disease and provides a complementary and alternative treatment to it. Syndrome is the core of TCM diagnosis and it is traditionally diagnosed based on macroscopic symptoms as well as tongue and pulse recognitions of patients. Establishment of the diagnosis method in the microcosmic level is an urgent and major problem in TCM. The aim of this study was to establish characteristic diagnosis pattern for CHD with Qi deficiency syndrome (QDS). Thirty-four biological parameters were detected in 52 patients having unstable angina (UA) with or without QDS. Then, we presented a novel data mining method, t-test-based Adaboost algorithm, to establish highest prediction accuracy with the least number of biological parameters for UA with QDS. We gained a pattern composed of five biological parameters that distinguishes UA with QDS patients from non-QDS patients. The diagnosis accuracy of the patterns could reach 84.5% based on a 3-fold cross validation technique. Moreover, we included 85 UA cases collected from hospitals located in the north and south of China to further verify the association between the pattern and QDS. The classification accuracy is 83.5%, which keeps consistent with the accuracy obtained by the cross-validation technique. The association between a symptom and the five biological parameters was established by the data mining method and it reached an accuracy of ∼80%. These results showed that the t-test-based Adaboost algorithm might be a powerful technique for diagnosing syndrome in TCM in the context of CHD.Entities:
Year: 2011 PMID: 21876710 PMCID: PMC3163006 DOI: 10.1155/2011/408650
Source DB: PubMed Journal: Evid Based Complement Alternat Med ISSN: 1741-427X Impact factor: 2.629
Figure 1The flowchart of data mining scheme.
TP, TN, FP and FN of the pattern.
| Feature selection method | TP | FN | FP | TN | Sensitivity (%) | Specificity (%) | Accuracy (%) |
|---|---|---|---|---|---|---|---|
|
| 35 | 4 | 5 | 8 | 89.7 | 61.5 | 84.5 |
Figure 2The illustration of a 3-fold cross validation technique.
Figure 4The t-test-based algorithm calculated different biological parameters combinations by nine times. The optimal number of biological parameters is five.
Figure 3The absolute value of 10 biological parameters is given in a descending way.
Using 52 samples, classification models reach an accuracy of 100%.
| Samples | TP | FN | FP | TN | Sensitivity (%) | Specificity (%) | Accuracy (%) |
|---|---|---|---|---|---|---|---|
| Full samples | 39 | 0 | 0 | 13 | 100 | 100 | 100 |
The five indexes and their P value in the pattern included.
| Biological parameter |
|
|---|---|
| MCH | 0.023899 |
| CHO | 0.027355 |
| PDW | 0.072665 |
| LDL | 0.083111 |
| TCO2 | 0.11374 |
The clinically further validation of the association established: it is found that the results are in accordance with the cross-validation counterpart.
| Region | Total | QDS | Non-QDS | TP | FN | FP | TN | Sensitivity (%) | Specificity (%) | Accuracy (%) |
|---|---|---|---|---|---|---|---|---|---|---|
| NeiMeng region | 42 | 29 | 13 | 25 | 4 | 3 | 10 | 86.2 | 76.9 | 83.3 |
| Hubei region | 43 | 31 | 12 | 26 | 5 | 2 | 10 | 83.9 | 83.3 | 83.7 |
| Total | 85 | 60 | 25 | 51 | 9 | 5 | 20 | 85 | 80 | 83.5 |
The association between symptom and the five biological parameters was established by the algorithm.
| Samples | TP | FN | FP | TN | Sensitivity (%) | Specificity (%) | Accuracy (%) |
|---|---|---|---|---|---|---|---|
| Full samples | 79 | 11 | 17 | 30 | 87.7 | 63.8 | 79.6 |
The classification accuracy is ∼80%.