| Literature DB >> 32080330 |
Junfeng Peng1, Chuan Chen1, Mi Zhou2, Xiaohua Xie1, Yuqi Zhou3, Ching-Hsing Luo4.
Abstract
Patients with chronic obstructive pulmonary disease (COPD) repeat acute exacerbations (AE). Global Initiative for Chronic Obstructive Lung Disease (GOLD) is only available for patients in stable phase. Currently, there is a lack of assessment and prediction methods for acute exacerbation of chronic obstructive pulmonary disease (AECOPD) patients during hospitalization. To enhance the monitoring and treatment of AECOPD patients, we develop a novel C5.0 decision tree classifier to predict the prognosis of AECOPD hospitalized patients with objective clinical indicators. The medical records of 410 hospitalized AECOPD patients are collected and 28 features including vital signs, medical history, comorbidities and various inflammatory indicators are selected. The overall accuracy of the proposed C5.0 decision tree classifier is 80.3% (65 out of 81 participants) with 95% Confidence Interval (CI):(0.6991, 0.8827) and Kappa 0.6054. In addition, the performance of the model constructed by C5.0 exceeds the C4.5, classification and regression tree (CART) model and the iterative dichotomiser 3 (ID3) model. The C5.0 decision tree classifier helps respiratory physicians to assess the severity of the patient early, thereby guiding the treatment strategy and improving the prognosis of patients.Entities:
Mesh:
Year: 2020 PMID: 32080330 PMCID: PMC7033165 DOI: 10.1038/s41598-020-60042-1
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Evaluation of the C5.0 classifier voted by the 40 independent decision trees.
| Precision | Recall | F-Measure | |
|---|---|---|---|
| Mild | 85.7% | 73.2% | 0.7895 |
| Severe | 76.1% | 87.5% | 0.8140 |
| The overall accuracy is 80.3% | |||
Figure 1The receiver operating characteristic of the proposed C5.0 classifier compared with boosting C45, CART and ID3.
Summary of the C5.0 classifier constructed by 40 independent decision trees.
| Number of decision trees | Average tree size | Average misclassification number | Average misclassification rate |
|---|---|---|---|
| 40 | 29.8 | 54.8 | 16.7% |
Figure 2The variables importance of the proposed C5.0 classifier. Note that SMK, AGE, NOH, TEMP, PULSE, RES, SP, DP, PHD, BRCH, HTN, DM, CHD, CKD, MT, CD, HBV and CRHS denote smoking history, age, number of hospitalizations, temperature, pulse rate, respiratory rate, systolic pressure, diastolic pressure, pulmonary heart disease, bronchiectasis, yypertension, diabetes mellitus, coronary heart disease, chronic kidney disease, malignant tumor, cerebrovascular disease, viral hepatitis and Cirrhosis, respectively. While CRP, EO, ESR, HCRP,LC, LYM, MONO, NEUT and PCT are C-reactive protein, absolute value of eosinophils, erythrocyte sedimentation rate, high sensitivity c-reactive protein, leukocyte count, lymphocyte absolute value, nuclear cell absolute value, absolute neutrophils and detection of procalcitonin, respectively.
Sensitivity analysis on different choices of training/testing samples (Accuracy).
| Method | Sample division ratio between training set (%) and testing set (%) | |||||
|---|---|---|---|---|---|---|
| 90/10 | 80/20 | 70/30 | 60/40 | 50/50 | Mean ± STD | |
|
|
|
|
|
|
|
|
| ABC4.5 | 0.65 | 0.704 | 0.713 | 0.699 | 0.629 | 0.679 ± 0.037 |
| ABCART | 0.675 | 0.667 | 0.639 | 0.712 | 0.654 | 0.669 ± 0.027 |
| ABID3 | 0.575 | 0.728 | 0.631 | 0.644 | 0.624 | 0.64 ± 0.0555 |
Distribution of mild and severe groups in patients with AECOPD Values are expressed as mean ± standard deviation.
| Mild Group(low risk) | Severe Group(high risk) | ||
|---|---|---|---|
| Number of cases | 208 (50.7%) | 202 (49.3%) | |
| Sex | Male | 163 (78.4%) | 178 (88.1%) |
| Female | 39 (18.8%) | 30 (14.9%) | |
| Smoking history (SMK) | 136 (65.4%) | 163 (80.7%) | |
| Age(year) | 79 ± 9 | 82 ± 9 | |
| Number of hospitalizations (NOH) | 3.8 ± 2.8 | 6.3 ± 7.3 | |
| Temperature (TEMP) | 36.8 ± 0.5 | 36.7 ± 0.6 | |
| Pulse rate (PULSE) | 92.5 ± 15.6 | 98.1 ± 17.6 | |
| Respiratory rate (RES) | 21.6 ± 3.1 | 24.5 ± 6 | |
| Systolic pressure (SP) | 133.2 ± 19.3 | 132.1 ± 24.7 | |
| Diastolic pressure (DP) | 76.1 ± 12.2 | 74.5 ± 13.9 | |
| Pulmonary heart disease (PHD) | With | 36 (17.3%) | 65 (32.2%) |
| Without | 172 (82.7%) | 137 (67.8%) | |
| Bronchiectasis (BRCH) | With | 14 (7%) | 8 (4%) |
| Without | 194(93%) | 194 (96%) | |
| Hypertension (HTN) | With | 86 (41.3%) | 80 (39.6%) |
| Without | 122 (58.7%) | 122 (60.4%) | |
| Diabetes mellitus (DM) | With | 17 (8.2%) | 41 (20.3%) |
| Without | 191(91.8%) | 161 (79.7%) | |
| Coronary heart disease (CHD) | With | 21 (10.1%) | 36 (17.8%) |
| Without | 187 (89.9%) | 166 (82.2%) | |
| Chronic kidney disease (CKD) | With | 4 (2%) | 36 (2%) |
| Without | 204 (98%) | 166 (98%) | |
| Malignant tumor (MT) | With | 16 (8%) | 20 (10%) |
| Without | 192 (92%) | 182 (90%) | |
| cerebrovascular disease (CD) | With | 9(4%) | 8 (4%) |
| Without | 199 (96%) | 194 (96%) | |
| Viral hepatitis (HBV) | With | 3 (1%) | 4 (2%) |
| Without | 205 (99%) | 198 (98%) | |
| Cirrhosis (CRHS) | With | 1 (0.5%) | 1 (0.5%) |
| Without | 207 (99.5%) | 201 (99.5%) |
Figure 3The result of medical laboratory of serious and mild COPD Patients. Note that CRP, EO, ESR, HCRP,LC, LYM, MONO, NEUT and PCT are C-reactive protein, absolute value of eosinophils, erythrocyte sedimentation rate, high sensitivity c-reactive protein, leukocyte count, lymphocyte absolute value, nuclear cell absolute value, absolute neutrophils and detection of procalcitonin. Unknown, Normal and Abnormal represents no test results, the normal results and abnormal results, respectively.