| Literature DB >> 34429335 |
Lindsay N Helget1,2, David J Dillon1, Bethany Wolf3, Laura P Parks1, Sally E Self4, Evelyn T Bruner4, Evan E Oates5, Jim C Oates6,7.
Abstract
OBJECTIVE: Lupus nephritis (LN) is an immune complex-mediated glomerular and tubulointerstitial disease in patients with SLE. Prediction of outcomes at the onset of LN diagnosis can guide decisions regarding intensity of monitoring and therapy for treatment success. Currently, no machine learning model of outcomes exists. Several outcomes modelling works have used univariate or linear modelling but were limited by the disease heterogeneity. We hypothesised that a combination of renal pathology results and routine clinical laboratory data could be used to develop and to cross-validate a clinically meaningful machine learning early decision support tool that predicts LN outcomes at approximately 1 year.Entities:
Keywords: health care; lupus erythematosus; lupus nephritis; outcome assessment; systemic
Mesh:
Year: 2021 PMID: 34429335 PMCID: PMC8386213 DOI: 10.1136/lupus-2021-000489
Source DB: PubMed Journal: Lupus Sci Med ISSN: 2053-8790
Baseline characteristics by response status at approximately 1 year
| Non-responder (n=42) | Responder (n=41) | P value | |
| Age at biopsy, years, mean (SD) | 32 (11) | 31 (9.6) | 0.455 |
| Sex, female | 39 (91) | 39 (89) | 1.000 |
| Race, black | 41 (95) | 38 (86) | 0.266 |
| Urine protein-to-creatinine ratio (g/g) | 2.7 (4.8) | 1.31 (3.7) | 0.018 |
| eGFR | 92 (76) | 106 (77) | 0.534 |
| Serum C3 (mg/dL), mean (SD) | 65 (30) | 71 (29) | 0.554 |
| C4, mean (SD) | 15 (8.6) | 16 (7.3) | 0.761 |
| Anti-dsDNA antibodies | 176 (162) | 119 (189) | 0.256 |
| Serum creatinine | 1.0 (1.3) | 0.90 (0.70) | 0.269 |
| White blood cell count | 5.8 (4.4) | 6.5 (6.1) | 0.504 |
| Platelets | 251 (123) | 247 (113) | 0.745 |
| Haemoglobin, mean (SD) | 10 (1.4) | 10 (1.4) | 0.656 |
| Albumin, mean (SD) | 2.4 (0.8) | 2.6 (0.6) | 0.149 |
| Activity score | 38, 5 (5) | 35, 3 (6) | 0.091 |
| Chronicity score | 36, 4 (5) | 34, 2 (2) | 0.004 |
| Interstitial fibrosis | 34, 2 (1.) | 24, 1 (1) | 0.001 |
| Interstitial inflammation | 30, 1 (1) | 29, 1 (1) | 0.978 |
| Glomeruli | 42, 18 (13) | 41, 19 (11) | 0.539 |
| Crescents | 15, 0 (2) | 20, 0 (2) | 0.382 |
| Crescent:Glomeruli ratio | 15, 0 (0.2) | 20, 0 (0.08) | 0.261 |
| Necrosis | 1 (1) | 2 (1) | 0.703 |
| Proliferative disease, yes | 37 (86) | 32 (73) | 0.125 |
| Membranous disease, yes | 23 (53) | 20 (46) | 0.454 |
| Mesangial disease, yes | 3 (7) | 6 (14) | 0.484 |
| Thrombotic microangiopathy (#, %) | 2 (5) | 1 (2) | 1.000 |
| Prednisone, yes | 36 (86) | 38 (93) | 0.307 |
| Hydroxychloroquine, yes | 34 (81) | 34 (83) | 0.815 |
| Mycophenolate/Mycophenolic acid, yes | 28 (67) | 32 (78) | 0.367 |
| Cyclophosphamide, yes | 14 (33) | 9 (22) | 0.361 |
| Rituximab, yes | 6 (14) | 1 (2.4) | 0.109 |
| Azathioprine, yes | 7 (17) | 6 (15) | 0.799 |
|
| |||
| 0 | 0 (0) | 0 (0) | |
| 1–2 | 12 (29) | 10 (24) | 0.403 |
| ≥3 | 30 (71) | 31 (76) | |
dsDNA, double stranded DNA; eGFR, estimated glomerular filtration rate.
Univariate cvAUC for the subset of seven predictors selected for inclusion in the models
| Variable | cvAUC |
| Activity score | 0.591 |
| Chronicity score | 0.660 |
| Interstitial fibrosis | 0.643 |
| Interstitial inflammation | 0.482 |
| Urine protein-to-creatinine ratio | 0.503 |
| White blood cell | 0.511 |
| Haemoglobin | 0.524 |
AUC, area under the curve; cvAUC, cross-validated AUC.
Prediction performance and variables selected for each model for the five models with a cvAUC >0.75
| Model | cvAUC | Activity score | Chronicity | Interstitial fibrosis | Interstitial inflammation | Urine protein-to-creatinine ratio | WBC | Hgb | # of predictors |
| LR | 0.780 | X | X | X | X | X | 5 | ||
| RF | 0.800 | X | X | X | X | 4 | |||
| SVML | 0.783 | X | X | X | X | 4 | |||
| SVMR | 0.790 | X | X | X | X | X | X | 6 | |
| ANN | 0.775 | X | X | X | X | X | 5 |
ANN, artificial neural networks; AUC, area under the curve; cvAUC, cross-validated AUC; Hgb, haemoglobin; LR, logistic regression; RF, random forest; WBC, white blood cell.
Sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) for different cut-offs for probability of non-response at 1 year based on the predicted probability of non-response for each model and the average predicted probability across the five models
| Model | Cut-off | Sensitivity | Specificity | PPV | NPV |
| 0.3 | 0.86 | 0.48 | 0.62 | 0.80 | |
| LR | 0.5 | 0.65 | 0.62 | 0.67 | 0.67 |
| 0.7 | 0.37 | 0.91 | 0.80 | 0.60 | |
| 0.3 | 0.91 | 0.36 | 0.58 | 0.80 | |
| RF | 0.5 | 0.74 | 0.61 | 0.65 | 0.71 |
| 0.7 | 0.42 | 0.86 | 0.75 | 0.60 | |
| 0.3 | 0.93 | 0.27 | 0.56 | 0.80 | |
| SVML | 0.5 | 0.67 | 0.68 | 0.67 | 0.68 |
| 0.7 | 0.19 | 0.93 | 0.73 | 0.54 | |
| 0.3 | 0.93 | 0.30 | 0.56 | 0.81 | |
| SVMR | 0.5 | 0.70 | 0.61 | 0.64 | 0.68 |
| 0.7 | 0.37 | 0.95 | 0.89 | 0.61 | |
| 0.3 | 0.81 | 0.48 | 0.60 | 0.72 | |
| ANN | 0.5 | 0.65 | 0.68 | 0.67 | 0.67 |
| 0.7 | 0.33 | 0.84 | 0.67 | 0.56 | |
| Average | 0.3 | 0.93 | 0.30 | 0.56 | 0.85 |
| prediction across | 0.5 | 0.70 | 0.73 | 0.71 | 0.71 |
| the five models | 0.7 | 0.30 | 0.95 | 0.87 | 0.58 |
ANN, artificial neural networks; LR, logistic regression; RF, random forest.
Figure 1Cross-validation area under the curve (cvAUCs) for each of the final machine-learning models. A summary of probability scores from all models in responders and non-responders (A); cvAUCs depicted for logistic regression (B), random forest, (C) SVM linear, (D) SVM Gaussian (E) and artificial neural network (F) models.
Figure 2Mean model sensitivity and specificity based on chosen prediction threshold. The mean of all model predictions was used to determine the performance of the model at select thresholds. The sensitivity (black line) and specificity (grey line) are depicted on the y-axis for each threshold (reported on the x-axis).