| Literature DB >> 34906123 |
Junfeng Peng1, Mi Zhou2, Kaiqiang Zou3, Xiongyong Zhu3, Jun Xu3, Yi Teng3, Feifei Zhang3, Guoming Chen3.
Abstract
BACKGROUND: Due to the complexity and high heterogeneity of the acute exacerbation of chronic obstructive pulmonary disease (AECOPD), the guidelines (global initiative for chronic obstructive, GOLD) is unable to fully guide the treatment of AECOPD.Entities:
Keywords: Chronic obstructive pulmonary disease; Machine learning; Medical decision support system; Real-world data
Mesh:
Year: 2021 PMID: 34906123 PMCID: PMC8670199 DOI: 10.1186/s12911-021-01708-2
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1Framework of multi-stage feature fusion
Basic information of study participants (values are expressed as mean ± standard deviation)
| Mild group | Serious group | |
|---|---|---|
| Number of cases | 220 (53.9%) | 188 (46.1%) |
| Smoking | 173 (78.6%) | 132 (70.2%) |
| Age | 78.8 ± 9.1 | 81.5 ± 9.2 |
| Sex (male) | 188 (85.5%) | 157 (83.5%) |
| Sex (female) | 32 (14.5%) | 31 (16.5%) |
| Number of hospitalization | 3.6 ± 2.6 | 6.5 ± 7.3 |
| Temperate | 36.8 ± 0.5 | 36.7 ± 0.6 |
| Respiratory rate | 21.7 ± 3.2 | 24.7 ± 6.4 |
| Systolic pressure | 133.1 ± 19.1 | 131.6 ± 24.7 |
| Diastolic pressure | 75.8 ± 12.3 | 74.4 ± 13.4 |
| Cor pulmonary | 43 (19.55%) | 64 (34.04%) |
| Bronchiectasis | 13 (5.91%) | 8 (4.26%) |
| Hypertension | 96 (43.64%) | 73 (38.83%) |
| Diabetes | 21 (9.55%) | 34 (18.09%) |
| Coronary disease | 24 (10.91%) | 29 (15.43%) |
| Chronic kidney diseases | 6 (2.73%) | 4 (2.13%) |
| Malignant tumour | 18 (8.18%) | 16 (8.51%) |
| Cerebrovascular disease1 | 9 (4.09%) | 8 (4.26%) |
| Viral hepatitis | 3 (1.36%) | 2 (1.06%) |
| Liver cirrhosis | 1 (0.45%) | 1 (0.53%) |
Feature selected in the phased data sets of AECOPD patients
| Phase | Data snippet | Description | Feature sets |
|---|---|---|---|
| k = 1 | Seg_1 | Basics | Gender, age, et al. |
| k = 2 | Seg_2 | Comorbidities | Pulmonary heart disease,et al. |
| k = 3 | Seg_3 | Inflammations | C-reactive protein, et al. |
| k = 4 | Seg_4 | Metabolism | Oxygen saturation, et al. |
Fig. 2Evaluation of the framework of multi-stage feature fusion with multiple classifiers. a Performance evaluation of the MSFF on the stage data D1–D4 generated by cross-validation method. While b represents the experimental results by independent test (non-cross-validation method). The x-axis represents the phased data set. The y-axis denotes the prediction accuracy of the proposed framework
Performance evaluation of the MSFF on the stage data D1–D4 generated by cross-validation method
| Phased data set | Model | acc | sn | sp | mcc | Curve |
|---|---|---|---|---|---|---|
| D1 | RF | 0.750 | 0.773 | 0.722 | 0.495 | 0.748 |
| SVM | 0.725 | 0.864 | 0.556 | 0.445 | 0.710 | |
| KNN | 0.675 | 1.000 | 0.278 | 0.418 | 0.639 | |
| PLS | 0.775 | 0.773 | 0.778 | 0.548 | 0.775 | |
| D2 | RF | 0.750 | 0.773 | 0.722 | 0.495 | 0.748 |
| SVM | 0.700 | 0.864 | 0.500 | 0.395 | 0.682 | |
| KNN | 0.600 | 0.773 | 0.389 | 0.175 | 0.581 | |
| PLS | 0.750 | 0.818 | 0.667 | 0.492 | 0.742 | |
| D3 | RF | 0.800 | 0.909 | 0.667 | 0.601 | 0.788 |
| SVM | 0.775 | 0.818 | 0.722 | 0.544 | 0.770 | |
| KNN | 0.625 | 0.773 | 0.444 | 0.231 | 0.609 | |
| PLS | 0.725 | 0.818 | 0.611 | 0.441 | 0.715 | |
| D4 | RF | 0.825 | 0.864 | 0.778 | 0.646 | 0.821 |
| SVM | 0.800 | 0.864 | 0.722 | 0.595 | 0.793 | |
| KNN | 0.700 | 0.909 | 0.444 | 0.406 | 0.677 | |
| PLS | 0.850 | 0.864 | 0.833 | 0.697 | 0.849 |
D1, D2, D3 and D4 represent the AECOPD patient data of stage 1 to 4. At each stage, there are 408 samples
Performance evaluation of the MSFF on the stage data D1–D4 generated by independent test (non-cross-validation method)
| Phased data set | Model | acc | sn | sp | mcc | Curve |
|---|---|---|---|---|---|---|
| D1 | RF | 0.750 | 0.773 | 0.722 | 0.495 | 0.748 |
| SVM | 0.775 | 0.955 | 0.556 | 0.568 | 0.755 | |
| KNN | 0.575 | 0.773 | 0.333 | 0.118 | 0.553 | |
| PLS | 0.775 | 0.864 | 0.667 | 0.545 | 0.765 | |
| D2 | RF | 0.750 | 0.773 | 0.722 | 0.495 | 0.748 |
| SVM | 0.675 | 0.818 | 0.500 | 0.338 | 0.659 | |
| KNN | 0.600 | 0.818 | 0.333 | 0.174 | 0.576 | |
| PLS | 0.775 | 0.864 | 0.667 | 0.545 | 0.765 | |
| D3 | RF | 0.700 | 0.727 | 0.667 | 0.394 | 0.697 |
| SVM | 0.800 | 0.864 | 0.722 | 0.595 | 0.793 | |
| KNN | 0.600 | 0.818 | 0.333 | 0.174 | 0.576 | |
| PLS | 0.775 | 0.864 | 0.667 | 0.545 | 0.765 | |
| D4 | RF | 0.775 | 0.818 | 0.722 | 0.544 | 0.770 |
| SVM | 0.800 | 0.955 | 0.611 | 0.614 | 0.783 | |
| KNN | 0.600 | 0.818 | 0.333 | 0.174 | 0.576 | |
| PLS | 0.775 | 0.864 | 0.667 | 0.545 | 0.765 |
D1, D2, D3 and D4 represent the AECOPD patient data of stage 1 to 4. At each stage, there are 408 samples
Diagnostic performance comparison of the junior physicians and MSFF with RF classifier on the new data set
| Overall accuracy | Class | F1 score | |
|---|---|---|---|
| Internal medicine | 0.667 | Serious | 0.644 |
| Mild | 0.691 | ||
| Surgery | 0.632 | Serious | 0.635 |
| Mild | 0.625 | ||
| Emergency | 0.644 | Serious | 0.617 |
| Mild | 0.663 | ||
| ICU | 0.701 | Serious | 0.676 |
| Mild | 0.722 | ||
| MSFF with RF classifier | 0.800 | Serious | 0.765 |
| Mild | 0.826 |
The feature of the new data is equivalent to the D4