Literature DB >> 22475180

Inquiry diagnosis of coronary heart disease in Chinese medicine based on symptom-syndrome interactions.

Guo-Zheng Li1, Sheng Sun, Mingyu You, Ya-Lei Wang, Guo-Ping Liu.   

Abstract

BACKGROUND: There is a long history of coronary heart disease (CHD) diagnosis and treatment in Chinese medicine (CM), but a formalized description of CM knowledge is still unavailable. This study aims to analyze a set of CM clinical data, which is important and urgent.
METHODS: Relative associated density (RAD) was used to analyze the one-way links between the symptoms or syndromes or both. RAD results were further used in symptom selection.
RESULTS: Analysis of a dataset of clinical CHD diagnosis revealed some significant relationships, not only between syndromes but also between symptoms and syndromes. Using RAD to select symptoms based on different classifiers improved the accuracy of syndrome prediction. Compared with other traditional symptom selection methods, RAD provided a higher interpretability of the CM data.
CONCLUSION: The RAD method is effective for CM clinical data analysis, particular for analysis of relationships between symptoms in diagnosis and generation of compact and comprehensible symptom feature subsets.

Entities:  

Year:  2012        PMID: 22475180      PMCID: PMC3341182          DOI: 10.1186/1749-8546-7-9

Source DB:  PubMed          Journal:  Chin Med        ISSN: 1749-8546            Impact factor:   5.455


Background

Western medicine classifies coronary heart disease (CHD) as a kind of myocardial dysfunction and organic lesion, occasionally accompanied by coronary artery stenosis and vertebrobasilar insufficiency [1]. In contrast, Chinese medicine (CM) classifies CHD as a type of chest paralysis and heart pain, for which effective diagnosis and treatment are available [2]. CM treatment is based primarily on syndrome differentiation and physiology and pathology of Zang-fu organs and meridians. In CM, a symptom represents an observable indicator of abnormality, while a syndrome is the disease state manifested by symptoms. The connections between symptoms and syndromes in CM are not clearly defined. Therefore, it is necessary to delineate different relationships between symptoms and syndromes and explain the diagnosis results in comprehensible terms [3]. Machine learning builds empirical models on data for analysis and forecasting, which has recently been used for CM data analysis. Huang and Gao [4] reviewed several classifiers of data mining in CM. Li and Huang [5] used fuzzy neural network for analysis of CM ingredients. Wang et al. [6] used a decision tree method to generate prediction models for CM hepatitis data and liver cirrhosis data. Zhang et al. [7] combined factor and cluster analysis in the classification of CM syndromes related to post-hepatitic cirrhosis. Zhang et al. [8] used latent tree models to aid CM diagnosis. Knowledge discovery in database (KDD) [9], rough set [10], and expert system [11], have also been applied to CM. Most CM machine learning works does not consider the medical meaning and links among features. However, CM data contain a large quantity of symptoms or syndromes which have specific medical meaning. Therefore, seeking the links between features including symptoms and syndromes in CM data analysis is also important. Conventional methods usually use only one numerical value to describe the relationship of two symptoms. In this study, we use a pair of characteristic values to describe a relative link between the symptoms as a relative associated density (RAD). By analysing the characteristic value pairs, we searched significant one-way links between symptoms and confirmed the links according to CM theory [12,13]. The RAD method was also used to find one-way links among multiple syndromes in the clinical data. Among a large number of symptoms in CM diagnosis data sets for a certain disease, some symptoms may be redundant. Therefore, selecting major or relevant symptoms is crucial to the performance of machine learning. Wang et al. [14] used support vector machine (SVM) to generalize symptom weights in CHD predictions. Liu et al. [15] used symptom frequency analysis to enhance modelling results in learning. Zhou et al. [16] developed a clinical reference information model (RIM) and a physical data model to manage various entities and relationships in CM clinical data. Principal component analysis (PCA) [17], partial least squares (PLS) [18], maximum relevance and minimum redundancy (MRMR) [19] have been used to perform symptom selection to improve prediction accuracy. The results from conventional primary symptom selection or reduction methods are difficult to be interpreted in CM. For instance, PCA reduces symptom dimensionality at the expense of loss of medical meaning [20]. Although MRMR can predict fairly using only a few major symptoms [21], the results are often inconsistent with basic CM theory [12,13]. This study aims to use RAD to perform symptom selection, and evaluate whether the results can be better explained by CM theory [12,13].

Methods

Data set of CHD in CM

A total of 555 clinical cases were collected from the cardiology departments of Longhua Hospital, Shuguang Hospital, Shanghai Renji Hospital, and Shanghai Hospital of CM form March 2007 to May 2008 to compile the CHD data set used in this study. It could be obtained from the address http://levis.tongji.edu.cn/gzli/publication.htm[15]. Out of the 555 cases, 265 patients (47.7%) were male, age (mean ± standard deviation): 65.15 ± 13.17 and 290 patients (52.3%) are female, age: 65.24 ± 13.82. The symptoms collected from inquiry diagnosis include 125 symptoms in eight dimensions (cold or warm, sweating, head, body, chest and abdomen, urine and stool, appetite, sleeping, mood, and gynecology). The differentiation diagnosis includes 15 syndromes, as described in Liu et al. [15]. For unification of the results, specific types and feeling information of some symptoms were combined and some symptoms unique to females were deleted. The variables analyzed in this study include 63 symptoms and 10 syndromes. The 63 included symptoms were listed in Table 1. The 10 included syndromes were (I) heart-qi deficiency syndrome; (II) heart-yang deficiency syndrome; (III) heart-yin deficiency syndrome; (IV) heart-blood deficiency syndrome; (V) turbid phlegm syndrome; (VI) blood stasis syndrome; (VII) qi stagnation syndrome; (VIII) heart-fire hyperactivity syndrome; (IX) heart-kidney yang deficiency syndrome; (X) cardiopulmonary-qi deficiency syndrome.
Table 1

The 63 symptoms in the data set

No.Symptom
1Chills

2Cold limbs

3Dampness-heat

4Feverish palms and soles

5Spontaneous sweating

6Night sweat

7Palpitation

8Chest distress

9Chest pain

10Short breath/dyspnea/suffocation

11Edema

12Hypodynamia

13Dysphoria

14Paroxysmal night dyspnea

16Amnesia

16Dizziness

17Tinnitus

18Mouth and tongue sore

19Cough

20Cough with sputum

21Hiccup

22Acid regurgitation

23Gastric stuffiness

24Gastralgia

25Epigastric upset

26Nausea and vomiting

27Heavy breathing

28Lateral thorax distending pain

29Abdomen distending pain

30Soreness and weakness of waist and knees

31Numbness of hands and feet

32Body soreness

33Thirsty and dry pharynx

34Absence of thirst and no desire for water drink

35Intake of fluid failing resolve thirst

36Like cold drink

37Like hot drink

38Poor appetite and less amount of food

39Always hungry

40Hunger without desire to eat

41Bitter taste

42Mucosity in mouth

43Tastelessness in mouth

44Loose stool

45Water like stool

46Diarrhea with undigested food

47Diarrhea in the morning

48Stool sometimes sloppy and sometimes bound

49Constipation

50Dry stool like sheep feces

51Non-smooth defecation or tenesmus

52Clear urine in large amounts

53Dark urine

54Frequent micturition

55Deficient urine

56Stranguria

57Urinating burning heat

58Dribble of urine

59The frequent and increased urination at night

60Aggravating gloom

61Sleepiness

62Impetuosity and susceptibility to rage

63Easily frightened and scared
The 63 symptoms in the data set

The RAD method

Probability and statistics

In the medical diagnosis of CHD, frequency of symptom occurrence may be different. For instance, the chest tightness symptom and the dizziness symptom are frequent symptoms, while the sleepiness symptom and the diarrhea with undigested food symptom are rare symptoms. In the data analysis, the first step is to distinguish between the frequent and the rare symptoms. In probability of symptoms, Pfstands for the appearance probability of the ith symptom across all cases, which is defined as where F= 1 if the ith symptom appears in the mth case, or else F= 0. N denotes the number of the cases. Similarly, Plstands for the appearance probability of the ith syndrome across all cases, which is defined as where the ith syndrome appears in the mth sample, L= 1, or else L= 0.

Building the symptom-symptom interaction network

Equations (1) and (2) calculate the appearance probability of all symptoms and syndromes. But these values cannot reveal their potential connections. Symptom-symptom interaction (SSI) network in the same manner as used for human social networks was used to find the connections [21,22]. When two different symptoms occur simultaneously in the same case, sign G= 1 indicating that symptom Fand symptom Fappear at the same time in the mth case, or else G= 0. Fstands for the number of simultaneous occurrences of Fand F. Then for N cases, which contains two types of information: the frequency of features and the relevancy of two features.

Relative associated density

Equation (3) is largely concerned with the frequency of symptoms. In other words, frequent relationships between symptoms are obvious, while less frequent relationships are hard to be detected. The difference is even more than 300 folds. Therefore, this study used RAD, which uses conditional probability to measure the relationships of symptoms and syndromes. The term C(Fi, Fj) represents the RAD values of symptom Fassociated with Fand use C(Fj, Fi) represents the RAD values of symptom Fassociated with F. According,

Symptom selection with RAD

In the mth case, if symptom Fappears with syndrome L= 1; otherwise, H= 0. Then for all N cases, RAD estimates the influence of the appearance probability on the interaction between a symptom and a syndrome. Equation (6) calculates the RAD value between symptoms and syndromes, This kind of association could be recognized as the contribution of one symptom to the syndrome. Each syndrome was considered a single label; thus we selected corresponding symptoms regardless of their RAD values. For each single label prediction, the symptoms with low RAD values were removed one by one, and the predictions were calculated with SVM and KNN. The symptoms that lead to the highest prediction were recorded as the result of symptom selection. MRMR symptom selection was used for a comparison [19]. The idea of MRMR is to search the optimal subset by maximizing relevance while minimizing redundancy based on mutual information. To maintain consistency with the RAD method, we used SVM [23] and KNN [24] for classification. To evaluate the prediction results, we calculated the true positive rate (TPR), and true negative rate (TNR) criteria: TPR = TP/(TP + FN), TNR = TN/(FP + TN), where TP is the number of true positives, TN is that of true negatives, FP is that of false positives, and FN is that of false negatives. The G-means criterion was used to describe the equilibrium of the positive and negative classes of the prediction results, where G-means = (TPR * TNR)1/2.

Results and discussion

RAD performed better than MRMR in feature selection for machine learning to discover CM relationships among the symptoms, syndromes, and even between the symptoms and syndromes in a CHD data set. RAD analysis found one-way connections among symptoms and the syndromes that are consistent with CM theory. RAD not only improves prediction accuracy but also enhanced interpretability.

Common and rare symptoms

We used equation (1) to determine the symptom frequency in the data set. The first 20 frequent symptoms were identified as listed in Table 2. Table 3 lists the first 10 rare symptoms in the data set.
Table 2

The most frequent symptoms and their appearance probability

OrderSymptomAppearance probability
1Chest distress78.6%

2Short breath/dyspnea/suffocation69.7%

3Hypodynamia65.4%

4Palpitation64.5%

5Soreness and weakness of waist and knees50.8%

6Chest pain48.6%

7Thirsty and dry pharynx48.6%

8Dizziness48.5%

9Aggravating gloom43.4%

10Dysphoria40.4%

11Spontaneous sweating39.1%

12Numbness of hands and feet37.1%

13Night sweat36.2%

14Tinnitus35.1%

15Chills35.0%

16Cough32.6%

17Impetuosity and susceptibility to rage32.3%

18The frequent and increased urination at night29.5%

19Like cold drink25.9%

20Cough with sputum25.4%
Table 3

The 10 rarest appeared symptoms and their frequency

OrderSymptomFrequency
1Urinating burning heat0.2%

2Sleepiness0.7%

3Diarrhea in the morning0.9%

4Hunger without desire to eat1.1%

5Non-smooth defecation or tenesmus1.3%

6Water like stool1.4%

7Diarrhea with undigested food1.4%

8Stool sometimes sloppy and sometimes bound1.6%

9Dribble of urine2%

10Always hungry2.2%
The most frequent symptoms and their appearance probability The 10 rarest appeared symptoms and their frequency SSI was calculated by equation (3). Figure 1 shows a network constructed from the SSI results, i.e., the frequency and relationship among the symptoms. Table 4 lists the important symptoms shown in Figure 1.
Figure 1

The network of SSI. The points denote the symptoms; solid lines connect the high SSI.

Table 4

Symptoms with high SSI values shown in Figure 1

SymptomSymptom
TinnitusSoreness and weakness of waist and knees

Spontaneous sweatingThirsty and dry pharynx

Impetuosity and susceptibility to rageChills

PalpitationAggravating gloom

Numbness of hands and feetNight sweat

Chest painCough

HypodynamiaLike cold drink

DizzinessCough with sputum

Short breath/dyspnea/suffocationThe frequent and increased urination at night

DysphoriaChest distress
The network of SSI. The points denote the symptoms; solid lines connect the high SSI. Symptoms with high SSI values shown in Figure 1 CHD was identified as a kind of deficiency syndromes or excess syndromes. As shown in Tables 2 and 4, CHD was associated with kidney deficiency, diet disloyalty, mental disturbance, cold pathogen invasion, and other factors. CHD occurred in the heart but was related to the liver, the kidney, and the spleen. CHD was also bound with heart-qi deficiency, heart-yang deficiency, heart-blood deficiency, and heart-yin deficiency. The imbalance of liver, kidney, and spleen was often accompanied by turbid phlegm syndrome, qi stagnation syndrome, blood stasis syndrome. From the first 20 most frequent symptoms, the symptoms of chest distress, hard breath/dyspnoea/suffocation, palpitation, and chest pain were found to be the locating syndrome of syndrome patterns of the heart, in consistency with modern clinical practice of CHD in CM. Other symptoms among the top 20 were also basic factors in CM heart system diseases diagnosis [12,13]. Table 3 lists the top 10 rare symptoms and their probabilities. The symptoms of the heart syndrome patterns were hunger without desire to eat and water-like stool symptom. This result was also consistent with CM theory [12,13].

Analysis using the RAD method

RAD analysis of the SSI networks was used to determine the connections between symptoms, and identified major symptoms in CHD. Equation (4) was used to determine the RAD values of SSI, as shown in Table 5.
Table 5

Some RAD values of SSI

Fi
ChillsCold limbsDampness-heatSpontaneous sweatingPalpitationChest distressChestpain

Fj

 Chills0.0%71.5%28.8%36.9%41.1%37.6%38.1%

 Cold limbs45.4%0.0%22.0%21.7%25.7%22.9%23.3%

 Dampness-heat8.8%10.6%0.0%15.7%11.7%11.0%10.4%

 Spontaneous sweating41.2%38.2%57.6%0.0%41.9%42.4%41.5%

 Palpitation75.8%74.8%71.2%69.1%0.0%68.6%61.5%

 Chest distress84.5%81.3%81.4%85.3%83.5%0.0%80.7%

 Chest pain53.1%51.2%47.5%51.6%46.4%50.0%0.0%
Some RAD values of SSI Pand Palways appeared as a pair. Some symptoms were obviously one-way connections. For example, only 11.4% of occurrences of the hard breath symptom were accompanied by the hot flash symptom, while 74.6% of occurrences of the hot flash symptom appeared with the hard breath symptom. This was typical one-way connection between two symptoms. Table 6 lists more connections between two symptoms. CM theory holds that chills occur with yang asthenia [12,13]. Yin asthenia occurs with hot flashes and night sweats [12,13]. The probabilities of chills appearing with hot flashes and night sweats is low, and their occurring probabilities are 0.087 and 0.061, separately.
Table 6

One-way connections between symptoms

SymptomRAD (L to R)SymptomsRAD (R to L)
28 Lateral thorax distending pain0.5291 Chills0.046

56 Stranguria0.5711 Chills0.041

47 Diarrhea in the morning0.6003 Dampness-heat0.050

28 Lateral thorax distending pain0.5295 Spontaneous sweating0.041

42 Mucosity in mouth0.7645 Spontaneous sweating0.059

43 Tastelessness in mouth0.5505 Spontaneous sweating0.050

52 Clear urine in large amounts0.6255 Spontaneous sweating0.046

53 Dark urine0.5715 Spontaneous sweating0.055

42 Mucosity in mouth0.5296 Night sweat0.044

14 Paroxysmal night dyspnea0.9337 Palpitation0.078

25 Epigastric upset0.8267 Palpitation0.053

35 Intake of fluid failing resolve thirst0.7007 Palpitation0.058
One-way connections between symptoms Table 6 also lists the RAD values of one-way connections between symptoms. For instance, the probability of chills accompanied by body coldness was 71.5%, while the probability of body coldness accompanied by chills was only 45.4%. These unequal results indicate that a patient suffering from chills would be more likely to have the body coldness symptom. By contrast, a patient suffering from body coldness would be less likely to have the chills symptom. Furthermore, the locating symptom of chest distress occurred with qualitative and locating symptoms, such as paroxysmal night dyspnoea or orthopnoea, tastelessness and tediousness, nausea and vomiting, epigastric upset, deficient urine, dark urine, feverish palms and soles, intake of fluid failing to resolve thirst, stool resembling sheep's droppings. When paroxysmal night dyspnoea or orthopnoea happened, chest distress symptoms rarely appeared at the same time. Therefore, the one-way connections between the symptoms calculated by RAD explained the clinical results in CM. For example, yang asthenia was the representation of chills, and when chills present, distending pain in the hypochondrium and urine astringent pain appeared at the same time. However, the latter two symptoms did not represent chills; thus, they would not be accompanied by the symptom of chills. For another example, spontaneous sweating was an expression of the qi asthenia symptom and possibly appeared with distending pain in the hypochondrium, a sticky slimy sensation in the mouth, dark urine, but not vice versa. From these two examples, we can see that the contribution of chills to yang asthenia was greater than that of spontaneous sweating to qi asthenia. In the meantime, we may infer that distending pain in the hypochondrium, a sticky slimy sensation in the mouth, and dark urine are not typical features of qi asthenia and yang asthenia. This association analysis of symptoms can show which symptoms are major features and identify possible relationships between symptoms and syndromes. This kind of analysis would provide an objective basis for standardization of dialectic diagnosis.

Relationships among the syndromes

Table 7 shows the frequencies of all 10 syndromes calculated using equation (2). Table 8 lists the RAD values of the syndrome.
Table 7

Frequency values of 10 syndromes

OrderSyndromeFrequency
1Blood stasis syndrome (VI)76.0%

2Heart-qi deficiency syndrome (I)60.9%

3Turbid phlegm syndrome (V)48.3%

4Heart-yin deficiency syndrome (III)38.6%

5Heart-yang deficiency syndrome (II)31.4%

6Qi stagnation syndrome (VII)20.7%

7Heart-kidney yang deficiency syndrome (IX)11.7%

8Heart-fire hyperactivity syndrome (VIII)5.4%

9Heart-blood deficiency syndrome (IV)2.9%

10Cardiopulmonary-qi deficiency syndrome (X)2.5%
Table 8

RAD values of syndromes

Li
12345678910

Lj

 10.000.010.810.690.620.640.590.600.030.71

 20.010.000.080.060.330.300.270.100.970.00

 30.510.100.000.130.440.380.310.600.090.43

 40.030.010.010.000.030.020.020.030.000.00

 50.490.500.550.440.000.550.500.430.460.79

 60.800.730.750.630.870.000.840.530.630.86

 70.200.180.170.130.220.230.000.330.110.14

 80.050.020.080.060.050.040.090.000.020.07

 90.010.360.030.000.110.100.060.030.000.00

 100.030.000.030.000.040.030.020.030.000.00
Frequency values of 10 syndromes RAD values of syndromes

High correlation of the syndromes

Relevant analysis of the relationships between syndromes found high correlations in heart-qi insufficiency, such as heart-yin deficiency, heart-blood deficiency, turbid phlegm, blood stasis, qi stagnation, heart-fire hyperactivity, and cardiopulmonary qi deficiency. For example, blood stasis was highly correlated with heart-qi insufficiency, heart-yang insufficiency, heart-yin deficiency, heart-blood deficiency, turbid phlegm, qi stagnation, heart-kidney yang deficiency, and cardiopulmonary qi deficiency. The one-way RAD values of these syndromes were 0.80, 0.73, 0.75, 0.63, 0.87, 0.84, 0.63, and 0.86, respectively. The finding of high correlation of heart-qi insufficiency with heart-blood deficiency and heart-yin deficiency is consistent with CM theory that a long period of heart-qi insufficiency would result in yin blood, causing fluid and blood deficiency and then qi yin deficiency [25]. In consistency with this theory, qi yin deficiency syndrome was common. The correlations of heart-qi insufficiency with turbid phlegm, blood stasis, qi stagnation, heart-fire hyperactivity, and cardiopulmonary qi deficiency were high, and consistent with the feature of deficiency syndrome or excess syndrome of CHD [12,13]. According to CM theory [12,13], turbid phlegm, qi stagnation, and blood stasis are symptoms, while qi deficiency is the radical that causes heart vessel stagnation and then CHD. The high RAD values of turbid phlegm and cardiopulmonary qi deficiency would explain that cardiopulmonary qi deficiency causes retention of water and dampness, and then sputum and more turbid phlegm [12,13]. The high degree of correlation of blood stasis with heart-qi insufficiency, heart-yang insufficiency, heart-yin deficiency, heart-blood deficiency, turbid phlegm, qi stagnation, heart-kidney yang deficiency, and cardiopulmonary qi deficiency indicates that blood stasis appeared in these syndromes. According to CM theory [12,13], heart controlling the blood vessel, yang asthenia, and qi asthenia may cause degradation of driving blood ability, and then blood stasis. Heart-fire hyperactivity and heat scorching blood viscous may cause blood stasis [12,13]. Qi stagnation and poor blood flow may also cause blood stasis [12,13]. Blood stasis may be the basic pathogenesis of CHD [26].

One-way connection of the syndromes

Table 8 shows some syndrome pairs with obvious one-way connections. For example, the RAD value of heart-qi insufficiency to insufficiency of the heart blood was 0.69, but the reversed RAD value was only 0.03. The RAD value of heart-qi insufficiency to heart-fire hyperactivity was 0.60, while the reversed RAD was 0.05. Table 9 summarizes the one-way connections of the syndrome pairs.
Table 9

One-way connections of the syndrome pairs

SyndromeRAD (L to R)SyndromeRAD (R to L)
Heart-blood deficiency syndrome0.687Heart-qi deficiency syndrome0.032
Heart-fire hyperactivity syndrome0.600Heart-qi deficiency syndrome0.053
Cardiopulmonary-qi deficiency syndrome0.714Heart-qi deficiency syndrome0.029
Heart-fire hyperactivity syndrome0.600Heart-yin deficiency syndrome0.084
Cardiopulmonary-qi deficiency syndrome0.785Turbid phlegm syndrome0.041
Heart-blood deficiency syndrome0.625Blood stasis syndrome0.023
Qi stagnation syndrome0.834Blood stasis syndrome0.227
Heart-fire hyperactivity syndrome0.533Blood stasis syndrome0.037
Heart-kidney yang deficiency syndrome0.630Blood stasis syndrome0.097
Cardiopulmonary-qi deficiency syndrome0.857Blood stasis syndrome0.028
One-way connections of the syndrome pairs Taking heart-qi insufficiency and insufficiency of the heart blood as an example, CM theory [12,13] emphasizes the interdependence between qi and blood, and long-term qi insufficiencies will cause blood deficiency. However, insufficiency of the heart blood is not always accompanied by heart-qi insufficiency [12,13]. In elder patients, viscera function is weak, a pure sthenic syndrome is rare, and an asthenia with sthenia syndrome is more common. The RAD value of heart-qi insufficiency to heart-yin deficiency was 0.81, indicating that most CHD patients were qi asthenia together with yin asthenia. According to CM theory [12,13], heart-fire hyperactivity is not directly related to heart-qi insufficiency or insufficiency of heart-yin. High one-way connections were found for blood stasis to cardiopulmonary qi deficiency, insufficiency of the heart blood, heart-fire hyperactivity, qi stagnation, and heart-kidney yang deficiency. However, the RAD values of reversed connections were low, indicating that blood stasis was not the only reason for CHD.

Two-ways connections of the syndrome

In addition to the observations of one-way connections, two-way connections were also found. For example, the mutual RAD values of blood stasis and qi asthenia were 0.80 and 0.64, respectively, indicating that these two syndromes were highly correlated. CM theory [12,13] holds that qi asthenia and then poor blood flow would lead to blood stasis, in reverse. Long-term blood stasis may also cause qi asthenia. These two syndromes causally influence with each other.

Relationships between symptoms and syndromes

According to CM theory [12,13], a symptom is an expression of internal syndrome, and a syndrome is essential to symptom appearance. The RAD results (Table 10) calculated by equation (6) showed the one-way connections of symptoms to syndromes, whose connections could be viewed as the contributions of symptoms to syndromes.
Table 10

Some RAD values between symptoms and syndromes

Symptom
SyndromeChillsColdlimbsNightsweatPalpitationChestdistressChestpain

Heart-qi deficiency0.2600.1270.3670.6270.7900.441

Heart-yang deficiency0.5920.4370.3100.6840.7820.546

Heart-yin deficiency0.2940.1820.5090.6960.8270.453

Heart-blood deficiency0.2500.2500.2500.6250.7500.250

Turbid phlegm0.3540.2390.3730.7010.8020.522

Blood stasis0.3480.2160.3440.6520.7870.512

Qi stagnation0.3740.2350.4000.6700.7390.522
Some RAD values between symptoms and syndromes Figure 2 illustrates the data in Table 10, where the x-axis represents the 63 symptoms and the y-axis represents the 10 syndromes. Red rectangles represent high RAD values, and the blue ones represent low RAD values. From Figure 2, the correlations between symptoms and syndromes were determined. As shown in Figure 2, the symptoms of palpitation, chest distress, short breath, weakness, soreness, and weakness of waist and knees were related to most of the syndromes. At the same time, chills and some other symptoms showed strong connections to some syndromes, such as heart-kidney yang deficiency and yang asthenia. Table 11 lists the symptoms and syndromes with high and low RAD values. In Table 11, chills showed a low relation to most of the syndromes except for heart-yang insufficiency and heart-kidney yang deficiency, indicating that chills were closely related to the latter syndromes. CM theory [12,13] holds that weakness of yang and qi and lack of warmth may cause chills. The high RAD values of night sweats to insufficiency of heart-yin did confirm the CM theory that yang cannot be restricted by yin asthenia, and then deficiency fire will be an internal disturbance and cause night sweats [12,13]. Constipation and insufficiency of heart blood showed a strong connection. Inner Canon of Yellow Emperor points out that "people over 40 years old may lose half of the yin qi", and CM theory [12,13] holds that insufficiency of the heart blood causes body fluid deficiency, which in turn causes insufficient lubrication of the colon, leading to constipation. The strong connections between nocturnal frequent micturition and heart-kidney yang deficiency can be explained by the lack of yang in the heart and kidney which resulted in a decrease of the controlling and qi transformation functions, bladder retention failure, and then nocturnal frequent micturition.
Figure 2

The RAD values of symptoms to syndromes.

Table 11

Symptoms with relative high and low RAD values to syndromes

SymptomSyndrome
Strong relation

 ChillsHeart-yang deficiency syndrome, Heart-kidney yang deficiency syndrome

 Night sweatHeart-yin deficiency syndrome, Cardiopulmonary-qi deficiency syndrome

 CoughCardiopulmonary-qi deficiency syndrome

 Soreness and weakness ofwaist and kneesHeart-blood deficiency syndrome

 ConstipationHeart-blood deficiency syndrome

 The frequent and increasedurination at nightHeart-kidney yang deficiency syndrome

 EdemaCardiopulmonary-qi deficiency syndrome

 Chest painHeart-blood deficiency syndrome

Weak relation

 The frequent and increasedurination at nightHeart-blood deficiency syndrome, Cardiopulmonary-qi deficiency syndrome

 EdemaHeart-blood deficiency syndrome
The RAD values of symptoms to syndromes. Symptoms with relative high and low RAD values to syndromes The weak connections (Table 11) of chest pain and insufficiency of the heart blood, nocturnal frequent micturition and insufficiency of the heart blood, and edema and insufficiency of the heart blood were also significant and consistent with CM theory [12,13].

Symptom selection with RAD

In this study, RAD was used for symptom selection, and then SVM [23] and K-nearest neighbours (KNN) [24] were used for the prediction. Table 11 shows individual contributions of symptoms to the syndromes. The predictions were not sound as the syndromes 4, 8, 9, and 10 in this data set showed serious imbalance; therefore, we omitted these results. For syndromes 1, 2, 3, 5, 6, and 7, (Table 12), the results were much better. Table 12 indicates that the prediction results with MRMR favoured either the positive class or the negative class. In the G-means results of the syndromes, these maximum values were obtained by the RAD method, indicating that RAD achieved a good balance between the positive class and the negative class. Although for some syndromes, the prediction results of RAD and MRMR were close when the TPR, TNR, and G-means values were all considered. In general, the results obtained by RAD were more reasonable.
Table 12

Statistical Results of TPR, TNR and G-means by using SVM and KNN with RAD and MRMR or without symptom selection

Syndrome123567Average
No SymptomSelection-SVMTPR0.7080.4630.7290.4720.7990.9060.680

TNR0.4110.7700.5350.6020.5160.6670.583

G-m0.5390.5970.6250.5330.6420.7770.630

RAD-SVMTPR0.7230.5180.7860.5880.7960.7710.713

TNR0.4290.7810.5470.5360.5920.8650.609

G-m0.5570.6360.6560.5610.6860.8170.652

MRMR-SVMTPR0.9550.3370.1310.4120.9550.0200.468

TNR0.0700.8930.9700.7040.0270.9700.606

G-m0.2590.5490.3560.5390.1610.1410.334

No SymptomSelection-KNNTPR0.7570.5530.4390.4610.8260.5340.595

TNR0.3800.7950.7320.6570.6730.7840.670

G-m0.5360.6630.5670.5500.7460.6470.631

RAD-KNNTPR0.7490.6700.5090.4850.8870.5220.607

TNR0.3910.7120.7290.6630.7040.8510.706

G-m0.5410.6910.6010.5670.7900.6670.643

MRMR-KNNTPR1.0000.4010.1700.3540.9420.1610.505

TNR0.1460.9010.9810.7830.0180.8970.621

G-m0.3820.6010.4090.5260.1300.3790.405
Statistical Results of TPR, TNR and G-means by using SVM and KNN with RAD and MRMR or without symptom selection

Conclusions

The RAD method is effective for CM clinical data analysis, particular for analysis of relationships between symptoms in diagnosis and generation of compact and comprehensible symptom feature subsets.

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

GZL designed the study, supervised the data analysis, and organized discussion of the results. MYY designed the experiment and write the manuscript. SS dedicated in experiment results analysis and manuscript revision. YLW implemented the analysis method and performed the experiments. GPL participated into analysis implementation, data acquisition, and result discussion. All authors read and approved the final manuscript.
  10 in total

1.  Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy.

Authors:  Hanchuan Peng; Fuhui Long; Chris Ding
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2005-08       Impact factor: 6.226

Review 2.  Knowledge discovery in traditional Chinese medicine: state of the art and perspectives.

Authors:  Yi Feng; Zhaohui Wu; Xuezhong Zhou; Zhongmei Zhou; Weiyu Fan
Journal:  Artif Intell Med       Date:  2006-08-22       Impact factor: 5.326

3.  Data mining and predictive modeling of biomolecular network from biomedical literature databases.

Authors:  Xiaohua Hu; Daniel D Wu
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2007 Apr-Jun       Impact factor: 3.710

4.  Development of traditional Chinese medicine clinical data warehouse for medical knowledge discovery and decision support.

Authors:  Xuezhong Zhou; Shibo Chen; Baoyan Liu; Runsun Zhang; Yinghui Wang; Ping Li; Yufeng Guo; Hua Zhang; Zhuye Gao; Xiufeng Yan
Journal:  Artif Intell Med       Date:  2010-02-01       Impact factor: 5.326

5.  Metabonomic study on 'Kidney-Yang Deficiency syndrome' and intervention effects of Rhizoma Drynariae extracts in rats using ultra performance liquid chromatography coupled with mass spectrometry.

Authors:  Xiumei Lu; Zhili Xiong; Jingjing Li; Shuning Zheng; Taoguang Huo; Famei Li
Journal:  Talanta       Date:  2010-09-24       Impact factor: 6.057

6.  [Combined use of factor analysis and cluster analysis in classification of traditional Chinese medical syndromes in patients with posthepatitic cirrhosis].

Authors:  Qin Zhang; Wen-Tong Zhang; Jian-Jun Wei; Xian-Bo Wang; Ping Liu
Journal:  Zhong Xi Yi Jie He Xue Bao       Date:  2005-01

7.  Latent tree models and diagnosis in traditional Chinese medicine.

Authors:  Nevin L Zhang; Shihong Yuan; Tao Chen; Yi Wang
Journal:  Artif Intell Med       Date:  2007-12-21       Impact factor: 5.326

8.  [Multicentric randomized double blinded clinical study of Yiqi Tongmai Oral Liquid against angina pectoris in patients with coronary heart disease].

Authors:  Shuo Zhang; Yan-qin Song; Wang Yue; Xing-rong Mao; Chuan-xia Ju; Meng-jiu Dong; Qiong-li Zheng; Xiao-hua Dai; Zhong-ye Li; Sha-ping Wang
Journal:  Zhong Xi Yi Jie He Xue Bao       Date:  2007-07

9.  Modelling of inquiry diagnosis for coronary heart disease in Traditional Chinese Medicine by using multi-label learning.

Authors:  Guo-Ping Liu; Guo-Zheng Li; Ya-Lei Wang; Yi-Qin Wang
Journal:  BMC Complement Altern Med       Date:  2010-07-20       Impact factor: 3.659

10.  Relative risk of cardiovascular and cancer mortality in people with severe mental illness from the United Kingdom's General Practice Rsearch Database.

Authors:  David P J Osborn; Gus Levy; Irwin Nazareth; Irene Petersen; Amir Islam; Michael B King
Journal:  Arch Gen Psychiatry       Date:  2007-02
  10 in total
  17 in total

1.  Big data is essential for further development of integrative medicine.

Authors:  Guo-zheng Li; Bao-yan Liu
Journal:  Chin J Integr Med       Date:  2015-05-03       Impact factor: 1.978

2.  Supervised redundant feature detection for tumor classification.

Authors:  Xue-Qiang Zeng; Guo-Zheng Li
Journal:  BMC Med Genomics       Date:  2014-10-22       Impact factor: 3.063

Review 3.  Scientific computation of big data in real-world clinical research.

Authors:  Guozheng Li; Xuewen Zuo; Baoyan Liu
Journal:  Front Med       Date:  2014-09-03       Impact factor: 4.592

4.  Detection of herb-symptom associations from traditional chinese medicine clinical data.

Authors:  Yu-Bing Li; Xue-Zhong Zhou; Run-Shun Zhang; Ying-Hui Wang; Yonghong Peng; Jing-Qing Hu; Qi Xie; Yan-Xing Xue; Li-Li Xu; Xiao-Fang Liu; Bao-Yan Liu
Journal:  Evid Based Complement Alternat Med       Date:  2015-01-11       Impact factor: 2.629

5.  Patient classification of hypertension in Traditional Chinese Medicine using multi-label learning techniques.

Authors:  Guo-Zheng Li; Zehui He; Feng-Feng Shao; Ai-Hua Ou; Xiao-Zhong Lin
Journal:  BMC Med Genomics       Date:  2015-09-23       Impact factor: 3.063

6.  Computerized tongue image segmentation via the double geo-vector flow.

Authors:  Miao-Jing Shi; Guo-Zheng Li; Fu-Feng Li; Chao Xu
Journal:  Chin Med       Date:  2014-02-08       Impact factor: 5.455

7.  A network-based approach to investigate the pattern of syndrome in depression.

Authors:  Jianglong Song; Xi Liu; Qingqiong Deng; Wen Dai; Yibo Gao; Lin Chen; Yunling Zhang; Jialing Wang; Miao Yu; Peng Lu; Rongjuan Guo
Journal:  Evid Based Complement Alternat Med       Date:  2015-03-02       Impact factor: 2.629

Review 8.  Advances in Patient Classification for Traditional Chinese Medicine: A Machine Learning Perspective.

Authors:  Changbo Zhao; Guo-Zheng Li; Chengjun Wang; Jinling Niu
Journal:  Evid Based Complement Alternat Med       Date:  2015-07-12       Impact factor: 2.629

9.  A new prognostic scale for the early prediction of ischemic stroke recovery mainly based on traditional Chinese medicine symptoms and NIHSS score: a retrospective cohort study.

Authors:  Ke-Gang Cao; Cai-Hong Fu; Huan-Qin Li; Xi-Yan Xin; Ying Gao
Journal:  BMC Complement Altern Med       Date:  2015-11-16       Impact factor: 3.659

10.  Qualitative and quantitative analysis for facial complexion in traditional Chinese medicine.

Authors:  Changbo Zhao; Guo-zheng Li; Fufeng Li; Zhi Wang; Chang Liu
Journal:  Biomed Res Int       Date:  2014-05-22       Impact factor: 3.411

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.