| Literature DB >> 30863010 |
Edsel B Ing1, Neil R Miller2, Angeline Nguyen2, Wanhua Su3, Lulu L C D Bursztyn4, Meredith Poole5, Vinay Kansal6, Andrew Toren7, Dana Albreki8, Jack G Mouhanna9, Alla Muladzanov10, Mikaël Bernier11, Mark Gans10, Dongho Lee12, Colten Wendel13, Claire Sheldon13, Marc Shields14, Lorne Bellan15, Matthew Lee-Wing15, Yasaman Mohadjer16, Navdeep Nijhawan1, Felix Tyndel17, Arun N E Sundaram17, Martin W Ten Hove18, John J Chen19, Amadeo R Rodriguez20, Angela Hu21, Nader Khalidi21, Royce Ing22, Samuel W K Wong23, Nurhan Torun24.
Abstract
PURPOSE: To develop and validate neural network (NN) vs logistic regression (LR) diagnostic prediction models in patients with suspected giant cell arteritis (GCA). Design: Multicenter retrospective chart review.Entities:
Keywords: giant cell arteritis; logistic regression; neural network; ophthalmology; prediction models; rheumatology; temporal artery biopsy
Year: 2019 PMID: 30863010 PMCID: PMC6388759 DOI: 10.2147/OPTH.S193460
Source DB: PubMed Journal: Clin Ophthalmol ISSN: 1177-5467
Figure 1Neural network design.
Notes: This neural network had ten input variables: age, sex, headache, clinical temporal artery abnormality, jaw claudication, vision loss, diplopia, ESR, C-reactive protein, and platelets. There was one hidden layer with four nodes (H1_1 to H1_4), each of which used the hyperbolic tangent activation function. The output was the biopsy result (bx_result). The equations and weights used for the neural network risk score are shown on the right.
Abbreviations: CRP_ULN, C-reactive protein divided by upper limit of normal of each lab; ESR, erythrocyte sedimentation rate; TAabn, temporal artery tenderness, pulselessness, or nodularity.
Characteristics of subjects with positive vs negative temporal artery biopsy
| Factor | Negative biopsy | Positive biopsy | Value range | |
|---|---|---|---|---|
| n | 1,368 | 465 | ||
| Age, years, µ (SD) | 72.8 (10.4) | 77.2 (8.2) | <0.001 | 38, 98 |
| Female | 933 (68.7%) | 329 (71.2%) | 0.31 | |
| Headache, new onset | 957 (73.3%) | 313 (74.5%) | 0.61 | |
| TAabn | 441 (34.3%) | 193 (46.6%) | <0.001 | |
| Jaw claudication | 257 (19.9%) | 215 (49.8%) | <0.001 | |
| Vision loss | 235 (18.1%) | 126 (29.5%) | <0.001 | |
| Diplopia | 105 (8.1%) | 47 (11.0%) | 0.071 | |
| ESR, µ (SD) | 41.2 (30.1) | 55.2 (30.1) | <0.001 | 0.01, 224 |
| CRP, µ (SD) | 5.7 (12.1) | 11.6 (14.4) | <0.001 | 0.01, 212 |
| Platelets, ×109/L, µ (SD) | 282.6 (104.9) | 371.8 (142.9) | <0.001 | 27, 1,199 |
| Biopsy length, cm, µ (SD) | 2.3 (1.0) | 2.3 (.90) | 0.24 | 0.3, 7.5 |
Abbreviations: CRP, C-reactive protein divided by upper limit of normal for each lab; ESR, erythrocyte sedimentation rate; µ, mean; n, number of subjects; TAabn, tenderness or decreased pulsation of temporal artery.
Documented causes of vision loss
| Details | Negative TABx | Positive TABx |
|---|---|---|
| n | 112 | 59 |
| AION | 64 (57%) | 49 (83%) |
| PION | 10 (9%) | 3 (5%) |
| CRAO/BRAO | 26 (23%) | 6 (10%) |
| Stroke | 10 (9%) | |
| CRVO | 1 (~1%) | 1 (~2%) |
| CAR | 1 (~1%) |
Abbreviations: AION, anterior ischemic optic neuropathy; BRAO, branch retinal artery occlusion; CAR, cancer-associated retinopathy; CRAO, central retinal artery occlusion; CRVO, central retinal vein occlusion; n, number of subjects; PION, posterior ischemic optic neuropathy; TABx, temporal artery biopsy.
Multivariable logistic regression for the outcome of a positive temporal artery biopsy with complete-case analysis
| Variables | OR | 95% CI, OR | |
|---|---|---|---|
| Age | 1.060 | <0.001 | 1.036, 1.085 |
| Female | 0.923 | 0.686 | 0.627, 1.359 |
| Headache | 1.540 | 0.035 | 1.030, 2.301 |
| TAabn | 1.466 | 0.019 | 1.064, 2.017 |
| Jaw claud | 3.398 | <0.001 | 2.314, 4.991 |
| Vision loss | 2.611 | 0.005 | 1.327, 5.138 |
| Diplopia | 1.127 | 0.606 | 0.714, 1.780 |
| log(ESR) | 1.200 | 0.043 | 1.005, 1.433 |
| log(CRP/ULN) | 1.370 | <0.001 | 1.246, 1.507 |
| Platelets | 1.005 | <0.001 | 1.003, 1.006 |
| Constant | 0.000 |
Notes: n=1,201; McFaddens R2=0.243, log pseudolikelihood=510.985.
Abbreviations: CRP/ULN, C-reactive protein divided by upper limit normal of each lab; ESR, erythrocyte sedimentation rate; Jaw Claud, jaw claudication; log, natural logarithm; R2, pseudo R square; TAabn, clinical temporal artery abnormality.
Comparison of model performance: logistic regression vs neural network with CCA and MDA on the test (Holdout) set
| Model | Logistic regression (CCA) | Neural network (CCA) | Logistic regression (MDA) | Neural network (MDA) |
|---|---|---|---|---|
| Sensitivity | 0.525 | 0.695 | 0.531 | 0.602 |
| Specificity | 0.951 | 0.891 | 0.904 | 0.838 |
| PLR | 10.610 | 6.380 | 5.500 | 3.710 |
| NLR | 0.500 | 0.340 | 0.520 | 0.480 |
| PPV | 0.861 | 0.789 | 0.732 | 0.648 |
| NPV | 0.774 | 0.833 | 0.794 | 0.809 |
| Accuracy | 0.794 | 0.819 | 0.780 | 0.760 |
| MCR | 0.206 | 0.181 | 0.220 | 0.241 |
| FNR | 0.475 | 0.305 | 0.469 | 0.398 |
| Calibration H–L | 0.119 | 0.805 | 0.420 | 0.987 |
| Calibration plot |
|
|
|
|
| Discrimination ( | 0.867 | 0.860 | 0.827 | 0.809 |
| Brier score | 0.148 | 0.143 | 0.153 | 0.162 |
| Generalized | 0.446 | 0.458 | 0.373 | 0.337 |
| Decision curve analysis |
|
|
|
|
Abbreviations: c, concordance statistic or the area under receiving operating curve; CCA, complete-case analysis; FNR, false-negative rate; H–L P, probability of Hosmer–Lemeshow test (calibration is acceptable if P>0.05); MCR, misclassification rate; MDA, missing data analysis; n, number of subjects; NLR, negative likelihood ratio; NPV, negative predictive value; PLR, positive likelihood ratio; PPV, positive predictive value; R2, square or percent of variance explained by the model; ROC, receiver operating characteristic.
Figure 2Receiver operating characteristic curves for the LR and NN models.
Notes: Both the NN and LR models had good discrimination. The solid lines represent the CCA and the dotted lines represent the MDA. The darker lines are the NN, and the lighter ones the LR.
Abbreviations: AUC, area under the curve; CCA, complete-case analysis; LR, logistic regression; MDA, missing data analysis; NN, neural network.
Figure 3Boxplot of predicted risk scores for the neural network and logistic regression models for the positive and negative temporal artery biopsy groups.
Notes: Risk scores ≥0.5 predict a positive temporal artery biopsy result. The horizontal line inside the box is the median value. The lower hinge of the box is the 25th percentile and the upper hinge of the box is the 75th percentile. The dots indicate the outliers with high-risk scores in the negative biopsy group.
Abbreviations: LR_model, logistic regression prediction model; NN_model, neural network prediction model.