| Literature DB >> 36071824 |
Kai Wang1, Yian Tian1, Shanshan Liu1, Zhongyuan Zhang1, Leilei Shen1, Deqian Meng1, Ju Li1.
Abstract
Background: Rapidly progressive interstitial lung disease (RP-ILD) is a significant complication that determines the prognosis of dermatomyositis (DM). Early RP-ILD diagnosis can improve screening and diagnostic efficiency and provide meaningful guidance to carry out early and aggressive treatment.Entities:
Keywords: dermatomyositis; extreme gradient boosting; interstitial lung disease; least absolute shrinkage and selection operator; logistic regression; predictive model; random forest; risk factor
Year: 2022 PMID: 36071824 PMCID: PMC9444234 DOI: 10.2147/PGPM.S369556
Source DB: PubMed Journal: Pharmgenomics Pers Med ISSN: 1178-7066
Clinical Characteristic of DM with RP-ILD and DM Without RP-ILD
| DM with RP-ILD (N = 41) | DM without RP-ILD (N = 216) | ||
|---|---|---|---|
| Age (years) | 55.10 ± 9.60 | 51.92 ± 13.51 | 0.172 |
| Female (%) | 29 (70.73) | 157(72.69) | 0.798 |
| Disease duration (months) | 1.98 ± 1.40 | 12.27 ± 31.57 | |
| Muscle weakness (%) | 38 (92.68) | 180(83.33) | 0.126 |
| Gottron’s sign (%) | 29(70.73) | 124(57.41) | 0.111 |
| Heliotrope rash (%) | 18 (43.90) | 104 (48.15) | 0.618 |
| V rush (%) | 14(34.15) | 71(32.87) | 0.874 |
| Shawl sign (%) | 11(26.83) | 40(18.52) | 0.221 |
| Periungueal erythema (%) | 9(21.95) | 35(16.20) | 0.370 |
| Arthritis (%) | 13(31.71) | 67(31.02) | 0.930 |
| Mechanic’s hand (%) | 11(26.83) | 45(20.83) | 0.394 |
| Skin ulcers (%) | 5(12.20) | 26 (12.04) | 0.977 |
| ALT (U/L) | 75.51 ± 115.70 | 78.11 ± 93.87 | 0.329 |
| AST (U/L) | 99.98 ± 164.60 | 94.73 ± 126.70 | 0.611 |
| LDH (U/L) | 481.66 ± 321.15 | 386.82 ± 252.44 | |
| CK (U/L) | 125.98 ± 168.04 | 474.93 ± 1706.02 | 0.785 |
| ESR (mm/h) | 47.85 ± 21.72 | 36.56 ± 21.47 | |
| CRP (mg/L) | 18.51 ± 28.29 | 10.03 ± 19.57 | |
| Serum ferritin (ng/mL) | 960.12 ± 515.81 | 790.99 ± 1223.41 | |
| ANA positive (%) | 25(60.98) | 113(52.31) | 0.308 |
| Anti-Ro-52 antibody positive (%) | 31(75.61) | 80(37.04) | |
| Anti-ARS antibody positive (%) | 3(7.32) | 25(11.57) | 0.423 |
| Anti-MDA5 antibody positive (%) | 33(80.49) | 67(31.02) |
Notes: Bold values are statistically significant (P < 0.05).
Abbreviations: DM, dermatomyositis; RP-ILD, rapidly progressive interstitial lung disease; ALT, glutamate transaminase; AST, aspartate transaminase; LDH, lactate dehydrogenase; CK, creatine kinase; ESR, erythrocyte sedimentation rate; CRP, C-reactive protein; SF, serum ferritin; ANA, antinuclear antibody; anti-ARS antibodies, anti-aminoacyl-tRNA synthetase antibodies; anti-MDA5 antibody, anti-melanoma differentiation-associated gene 5 antibody.
Univariate Logistic Regression Analysis of Clinical and Laboratory Characteristics
| Variables | OR | 2.5% CI | 97.5% CI | |
|---|---|---|---|---|
| Sex | 1.10 | 0.53 | 2.30 | 0.798 |
| Age at onset | 1.02 | 0.99 | 1.05 | 0.152 |
| Disease duration | 0.78 | 0.66 | 0.92 | 0.004 |
| Muscle weakness | 2.53 | 0.74 | 8.66 | 0.138 |
| Gottron’s sign | 1.79 | 0.87 | 3.70 | 0.114 |
| Heliotrope rash | 0.84 | 0.43 | 1.65 | 0.618 |
| V rush | 1.06 | 0.52 | 2.14 | 0.874 |
| Shawl sign | 1.61 | 0.75 | 3.49 | 0.224 |
| Periungueal erythema | 1.45 | 0.64 | 3.31 | 0.372 |
| Arthritis | 1.03 | 0.50 | 2.12 | 0.93 |
| Mechanic’s hand | 1.39 | 0.65 | 2.99 | 0.395 |
| Skin ulcers | 1.01 | 0.37 | 2.82 | 0.977 |
| ALT | 1.00 | 1.00 | 1.00 | 0.878 |
| AST | 1.00 | 1.00 | 1.00 | 0.815 |
| LDH | 1.00 | 1.00 | 1.00 | |
| CK | 1.00 | 1.00 | 1.00 | 0.261 |
| ESR | 1.02 | 1.01 | 1.04 | |
| CRP | 1.01 | 1.00 | 1.03 | |
| Serum ferritin | 1.00 | 1.00 | 1.00 | 0.406 |
| ANA | 1.42 | 0.72 | 2.82 | 0.309 |
| Anti-Ro-52 antibody | 5.27 | 2.45 | 11.32 | |
| Anti-ARS antibody | 0.60 | 0.17 | 2.10 | 0.427 |
| Anti-MDA5 antibody | 9.17 | 4.02 | 20.92 |
Notes: Bold values are statistically significant (P < 0.05).
Abbreviations: OR, odds ratio; CI, confidence interval; ALT, glutamate transaminase; AST, aspartate transaminase; LDH, lactate dehydrogenase; CK, creatine kinase; ESR, erythrocyte sedimentation rate; CRP, C-reactive protein; SF, serum ferritin; ANA, antinuclear antibody; anti-ARS antibodies, anti-aminoacyl-tRNA synthetase antibodies; anti-MDA5 antibody, anti-melanoma differentiation-associated gene 5 antibody.
Figure 1Different machine learning methods on 23 variables. (A) ROC curve of LASSO on the test set. (B) ROC curve of RF on the test set. (C) ROC curve of XGBoost on the test set. (D) Important variables screening of LASSO. (E) Important variable screening of RF. (F) Important variable screening of XGBoost. (G) Venn diagram of machine learning methods.
Performance Summary of Different Machine Learning Models Based on 23 Variables
| Accuracy | Recall | NPV | PPV | F1-Score | |
|---|---|---|---|---|---|
| LASSO on test set | 0.842 | 0.417 | 0.894 | 0.500 | 0.455 |
| RF on test set | 0.855 | 0.083 | 0.853 | 1.000 | 0.154 |
| XGBoost on test set | 0.829 | 0.881 | 0.333 | 0.922 | 0.901 |
Abbreviations: LASSO, least absolute shrinkage and selection operator; RF, random; XGBoost, extreme gradient boosting; NPV, negative predictive value; PPV, positive predictive value.
Figure 2ROC curve of prediction models based on 5 variables. (A) ROC curve of LASSO on the test set. (B) ROC curve of RF on the test set. (C) ROC curve of XGBoost on the test set. (D) ROC curve of LASSO on the validation set. (E) ROC curve of RF on the validation set. (F) ROC curve of XGBoost on the validation set.
Performance Summary of Different Machine Learning Models Based on 5 Variables
| Accuracy | Recall | NPV | PPV | F1-Score | |
|---|---|---|---|---|---|
| LASSO on test set | 0.828 | 0.417 | 0.892 | 0.455 | 0.435 |
| LASSO on validation set | 0.741 | 0.800 | 0.941 | 0.400 | 0.533 |
| RF on test set | 0.895 | 0.333 | 0.889 | 1.000 | 0.500 |
| RF on validation set | 0.704 | 0.600 | 0.889 | 0.333 | 0.429 |
| XGBoost on test set | 0.894 | 0.900 | 0.417 | 0.984 | 0.940 |
| XGBoost on validation set | 0.852 | 1.000 | 1.000 | 0.555 | 0.714 |
Abbreviations: LASSO, least absolute shrinkage and selection operator; RF, random; XGBoost, extreme gradient boosting; NPV, negative predictive value; PPV, positive predictive value.