| Literature DB >> 34081620 |
Fangran Xin1, Lingyu Fu1,2, Bowen Yang2, Haina Liu3, Tingting Wei1, Cunlu Zou4, Bingqing Bai1.
Abstract
We developed and validated a nomogram to predict the risk of stroke in patients with rheumatoid arthritis (RA) in northern China. Out of six machine learning algorithms studied to improve diagnostic and prognostic accuracy of the prediction model, the logistic regression algorithm showed high performance in terms of calibration and decision curve analysis. The nomogram included stratifications of sex, age, systolic blood pressure, C-reactive protein, erythrocyte sedimentation rate, total cholesterol, and low-density lipoprotein cholesterol along with the history of traditional risk factors such as hypertensive, diabetes, atrial fibrillation, and coronary heart disease. The nomogram exhibited a high Hosmer-Lemeshow goodness-for-fit and good calibration (P > 0.05). The analysis, including the area under the receiver operating characteristic curve, the net reclassification index, the integrated discrimination improvement, and clinical use, showed that our prediction model was more accurate than the Framingham risk model in predicting stroke risk in RA patients. In conclusion, the nomogram can be used for individualized preoperative prediction of stroke risk in RA patients.Entities:
Keywords: development and validation nomogram; inflammatory markers; lipids; rheumatoid arthritis; stroke
Year: 2021 PMID: 34081620 PMCID: PMC8221354 DOI: 10.18632/aging.203071
Source DB: PubMed Journal: Aging (Albany NY) ISSN: 1945-4589 Impact factor: 5.682
Participants’ characteristics in primary and validation cohorts.
| RA with stroke | 218 (16.10) | 95 (16.35) | 0.019 | 0.891 |
| Sex, female | 1021 (75.41) | 428 (73.67) | 0.655 | 0.419 |
| Age, year | ||||
| 18–65 | 757 (55.91) | 308 (53.01) | 1.386 | 0.500 |
| 66–79 | 426 (31.46) | 194 (33.39) | ||
| ≥80 | 171 (12.63) | 79 (13.60) | ||
| SBP, mm Hg | ||||
| <120 | 422 (31.17) | 194 (33.39) | 7.511 | 0.111 |
| 120–139 | 592 (43.72) | 240 (41.31) | ||
| 140–159 | 287 (21.20) | 114 (19.62) | ||
| 160–179 | 46 (3.40) | 24 (4.13) | ||
| ≥180 | 7 (0.52) | 9 (1.55) | ||
| Smoking | 176 (13.00) | 84 (14.46) | 0.744 | 0.388 |
| Diabetes | 190 (14.03) | 97 (16.70) | 2.282 | 0.131 |
| CHD | 208 (15.36) | 104 (17.90) | 1.937 | 0.164 |
| AF | 40 (2.95) | 17 (2.93) | 0.001 | 0.973 |
| LVH* | 1 (0.07) | 1 (0.17) | - | 0.510 |
| CVD | 436 (32.20) | 198 (34.08) | 0.651 | 0.420 |
| Hypertension | 240 (17.73) | 153 (26.33) | 18.615 | <0.001 |
| Bio-med | 22 (1.62) | 12 (2.07) | 0.457 | 0.499 |
| CCP+ | 827 (61.08) | 337 (58.00) | 1.604 | 0.205 |
| RF+ | 908 (67.06) | 381 (65.58) | 0.403 | 0.526 |
| CRP, mg/L | ||||
| <10 | 319 (23.56) | 157 (27.02) | 3.769 | 0.152 |
| ≥9.06nd<64.32 | 670 (49.48) | 287 (49.40) | ||
| ≥64.32 | 365 (26.96) | 137 (23.58) | ||
| ESR, mm/H | ||||
| <29 | 351 (25.92) | 137 (23.58) | 2.368 | 0.306 |
| ≥29nd<84.80 | 662 (48.89) | 280 (48.19) | ||
| ≥84.8 | 341 (25.18) | 164 (28.23) | ||
| C3, g/L | ||||
| <0.95 | 345 (25.48) | 155 (26.68) | 0.890 | 0.641 |
| ≥0.95nd<1.34 | 673 (49.7) | 293 (50.43) | ||
| ≥1.34 | 336 (24.82) | 133 (22.89) | ||
| C4, g/L | ||||
| <0.18 | 378 (27.92) | 146 (25.13) | 5.450 | 0.066 |
| ≥0.18nd<0.28 | 656 (48.45) | 315 (54.22) | ||
| ≥0.28 | 320 (23.63) | 120 (20.65) | ||
| FBG, mmol/L | ||||
| <4.84 | 336 (24.82) | 154 (26.51) | 2.052 | 0.358 |
| ≥4.84nd<6.33 | 670 (49.48) | 295 (50.77) | ||
| ≥6.33 | 348 (25.70) | 132 (22.72) | ||
| TC, mmol/L | ||||
| <5.2 | 1131 (83.53) | 479 (82.44) | 6.259 | 0.044 |
| ≥5.2nd<66.2 | 163 (12.04) | 61 (10.50) | ||
| ≥6.2 | 60 (4.43) | 41 (7.06) | ||
| LDL, mmol/L | ||||
| <3.4 | 1106 (81.68) | 463 (79.69) | 6.278 | 0.043 |
| ≥3.4nd<4.1 | 182 (13.44) | 73 (12.56) | ||
| ≥4.1 | 66 (4.87) | 45 (7.75) | ||
| HDL, mmol/L | ||||
| ≥1.55 | 125 (9.23) | 46 (7.92) | 4.119 | 0.128 |
| ≥1.04nd<1.55 | 570 (42.10) | 273 (46.99) | ||
| <1.04 | 659 (48.67) | 262 (45.09) | ||
| TG, mmol/L | ||||
| <1.7 | 1,113 (82.20) | 460 (79.17) | 2.697 | 0.260 |
| ≥1.7nd<2.3 | 139 (10.27) | 73 (12.56) | ||
| ≥2.3 | 102 (7.53) | 48 (8.26) | ||
Data are represented as numbers and proportions. Statistics were calculated using the chi-square test. *Statistics were calculated by Fisher’s exact test. Abbreviations: RA: rheumatoid arthritis; SBP: systolic blood pressure; CHD: coronary heart disease; AF: atrial fibrillation; LVH: left ventricular hypertrophy; CVD: cardiovascular disease; Bio-med: biologic disease-modifying anti-rheumatic drugs; CCP+: positive anti-cyclic citrullinated peptide antibody; RF+: positive rheumatoid factor; CRP: C-reactive protein; ESR: erythrocyte sedimentation rate; C3: complement 3; C4: complement 4; FBG: fasting blood glucose; TC: total cholesterol; LDL: low-density lipoprotein; HDL: high-density lipoprotein; TG: triglycerides.
Univariate logistic regression analysis of RA patients developing stroke in the primary cohort.
| Sex, female vs. male | 873 (76.85) | 148 (67.89) | 0.64 (0.46–0.88) | 0.005 |
| Age, year | <0.001 | |||
| 66–79 vs. 18–65 | 336 (29.58) | 90 (41.28) | 2.90 (2.05–4.10) | |
| ≥80 vs. 18–65 | 107 (9.42) | 64 (29.36) | 6.48 (4.33–9.68) | |
| SBP, mmHg | <0.001 | |||
| 120–139 vs.<120 | 503 (44.28) | 89 (40.83) | 1.45 (0.99–2.12) | |
| 140–159 vs.<120 | 224 (19.72) | 63 (28.90) | 2.30 (1.52–3.48) | |
| 160–179 vs.<120 | 29 (2.55) | 17 (7.80) | 4.79 (2.45–9.39) | |
| ≥180 vs.<120 | 4 (0.35) | 3 (1.38) | 6.13 (1.33–28.25) | |
| Smoking | 145 (12.76) | 31 (14.22) | 1.13 (0.75–1.72) | 0.558 |
| Diabetes | 142 (12.50) | 48 (22.02) | 1.98 (1.37–2.85) | <0.001 |
| CVD* | 218 (19.19) | 218 (100) | – | <0.001 |
| CHD | 130 (11.44) | 78 (35.78) | 4.31 (3.09–6.01) | <0.001 |
| AF | 22 (1.94) | 18 (8.26) | 4.56 (2.40–8.65) | <0.001 |
| LVH* | 0 (0) | 1 (0.46) | - | 0.161 |
| Hypertension | 164 (14.44) | 76 (34.86) | 3.17 (2.29–4.39) | <0.001 |
| Bio-med | 21 (1.85) | 1 (0.46) | 0.25 (0.03–1.83) | 0.236 |
| CCP+ | 698 (61.44) | 129 (59.17) | 0.91 (0.68–1.22) | 0.529 |
| RF+ | 766 (67.43) | 142 (65.14) | 0.90 (0.67–1.24) | 0.510 |
| CRP, mg/L | 0.007 | |||
| ≥9.06 and <64.32 vs. <10 | 570 (50.18) | 100 (45.87) | 1.19 (0.81, 1.76) | |
| ≥64.32 vs. <10 | 288 (25.35) | 77 (35.32) | 1.81 (1.20–2.74) | |
| ESR, mm/H | 0.037 | |||
| ≥29 and <84.80 vs. <29 | 553 (48.68) | 109 (50) | 0.58 (0.38–0.88) | |
| ≥84.8 vs. <29 | 275 (24.21) | 66 (30.28) | 0.82 (0.59–1.15) | |
| C3, g/L | 0.516 | |||
| ≥0.95 and <1.34 vs. <0.95 | 567 (49.91) | 106 (48.62) | 0.85 (0.61–1.20) | |
| ≥1.34 vs. <0.95 | 286 (25.18) | 50 (22.94) | 0.80 (0.53–1.20) | |
| C4, g/L | 0.786 | |||
| ≥0.18 and <0.28 vs. <0.18 | 550 (48.42) | 106 (48.62) | 0.95 (0.67–1.33) | |
| ≥0.28 vs. <0.18 | 272 (23.94) | 48 (22.02) | 0.87 (0.58–1.30) | |
| FBG, mmol/L | 0.652 | |||
| ≥4.84 and <6.33 vs. <4.84 | 556 (48.94) | 114 (52.29) | 1.12 (0.78–1.60) | |
| ≥6.33 vs. <4.84 | 296 (26.06) | 52 (23.85) | 0.96 (0.63–1.46) | |
| TC, mmol/L | 0.038 | |||
| ≥5.2 and <66.2 vs. <5.2 | 143 (12.59) | 20 (9.17) | 0.73 (0.45–1.20) | |
| ≥6.2 vs. <5.2 | 44 (3.87) | 16 (7.34) | 1.90 (1.05–3.43) | |
| LDL, mmol/L | 0.004 | |||
| ≥3.4 and<4.1 vs. <3.4 | 138 (12.15) | 44 (20.18) | 1.87 (1.28–2.73) | |
| ≥4.1 vs. <3.4 | 53 (4.67) | 13 (5.96) | 1.44 (0.77–2.70) | |
| HDL, mmol/L | 0.774 | |||
| ≥1.04 and <1.55 vs. ≥1.55 | 483 (42.52) | 87 (39.91) | 0.89 (0.53–1.50) | |
| <1.04 vs. ≥1.55 | 549 (48.33) | 110 (50.46) | 0.99 (0.60–1.66) | |
| TG, mmol/L | 0.186 | |||
| ≥1.7 and <2.3 vs. <1.7 | 114 (10.04) | 25 (11.47) | 1.11 (0.70–1.77) | |
| ≥2.3 vs. <1.7 | 92 (8.10) | 10 (4.59) | 0.55 (0.28–1.08) |
Data are represented as numbers and proportions. Statistics were conducted using univariate logistic regression. Abbreviations: RA: rheumatoid arthritis; OR (95% CI), odds ratio, 95% confidence interval; SBP: systolic blood pressure; CHD: coronary heart disease; AF: atrial fibrillation; LVH: left ventricular hypertrophy; CVD: cardiovascular disease; Bio-med: biologic disease-modifying anti-rheumatic drugs; CCP+: positive anti-cyclic citrullinated peptide antibody; RF+: positive rheumatoid factor; CRP: C-reactive protein; ESR: erythrocyte sedimentation rate; C3: complement 3; C4: complement 4; FBG: fasting blood glucose; TC: total cholesterol; LDL: low-density lipoprotein; HDL: high-density lipoprotein; TG: triglycerides.
Figure 1Multivariate logistic regression analysis of data from RA patients developing stroke in the primary cohort ( Abbreviations: SBP: systolic blood pressure; CHD: coronary heart disease; AF: atrial fibrillation; CRP: C-reactive protein; ESR: erythrocyte sedimentation rate; TC: total cholesterol; LDL: low-density lipoprotein; OR (95% CI): odds ratio, 95% confidence interval.
Figure 2Model evaluation (F1-score) results based on the number of features across six models. (A) primary cohort, N = 1,354 patients; (B) validation cohort, N = 581 patients). Abbreviations: GBDT: gradient boosting decision tree; KNN: k-nearest neighbors; LR: logistic regression; RF: random forest; XGB: XGBoost; SVM: Support Vector Machine.
Figure 3A developed stroke nomogram in the primary cohort ( Abbreviations: SBP: systolic blood pressure; CHD: coronary heart disease; AF: atrial fibrillation; CRP: C-reactive protein; ESR: erythrocyte sedimentation rate; TC: total cholesterol; LDL: low-density lipoprotein. For example, a 70-year-old (47 points), male (30 points) RA patient with an AF (55 points) and CHD (62 points) history of 60 mm/H ESR (27 points), and 5 mmol/L TC (65 points) arrived at a total point value of 286, with a probability of 46% of developing a stroke.
Figure 4Calibration curves of (A) complex model in the primary cohort (N = 1,354), (B) simple model in the primary cohort (N = 1,354), (C) complex model in the validation cohort (N = 581), and (D) simple model in the validation cohort (N = 581). Calibration curves depicted the calibration of each model in an agreement between the predicted risks of stroke and observed outcomes of stroke. The y-axis represents the actual stroke. The x-axis represents the predicted stroke risk. The diagonal gray line represents the perfect prediction by an ideal model. The dotted line represents the performance of the nonparametric nomogram, of which a closer fit to the diagonal gray line represents a better prediction.
Performance and internal validation of stroke nomogram.
| Hosmer–Lemeshow test | |||
| NA | 8.517 | 13.456 | |
| NA | 0.385 | 0.097 | |
| AUC (95% CI) | 0.808 (0.778, 0.839)*& | 0.747 (0.711, 0.784)*# | 0.784 (0.750, 0.818)#& |
| NRI (95% CI) | ref. | 11.59 (2.90, 20.29) | 20.30 (12.54, 28.05) |
| IDI (95% CI) | ref. | 1.71 (−0.77, 4.18) | 5.65 (3.41, 7.88) |
*P = 0.0013, significant associations between Framingham risk and simple models; #P = 0.0016, significant associations between complex and simple models; &P = 0.0631, significant associations between complex and Framingham risk models. Abbreviations: AUC: area under the receiver operating characteristic curve; NRI: net reclassification index; IDI: integrated discrimination improvement; NA: not available; ref.: reference level.