| Literature DB >> 35385045 |
Alexandre Lachance1,2, Mathieu Godbout3, Fares Antaki4, Mélanie Hébert1,2, Serge Bourgault1,2, Mathieu Caissie1,2, Éric Tourville1,2, Audrey Durand3,5, Ali Dirani1,2.
Abstract
Purpose: The purpose of this study was to assess the feasibility of deep learning (DL) methods to enhance the prediction of visual acuity (VA) improvement after macular hole (MH) surgery from a combined model using DL on high-definition optical coherence tomography (HD-OCT) B-scans and clinical features.Entities:
Mesh:
Year: 2022 PMID: 35385045 PMCID: PMC8994199 DOI: 10.1167/tvst.11.4.6
Source DB: PubMed Journal: Transl Vis Sci Technol ISSN: 2164-2591 Impact factor: 3.048
Figure 1.Overview of our proposed hybrid model. (Top) Illustration of the extraction of the OCT-based prediction from the trained DL model. (Bottom) Flow-chart representing the combination of clinical data and OCT-based data to predict the clinical ground truth.
Characteristics for Patients of the Different Splits in the Data Set
| Training Set ( | Test Set ( | Total Set ( |
| |
|---|---|---|---|---|
|
| 66 ± 8 | 69 ± 7 | 67 ± 8 | 0.12 |
|
| 0.84 | |||
| Female, | 76 (73) | 12 (71) | 88 (73) | |
| Male, | 28 (27) | 5 (29) | 33 (27) | |
|
| 50 ± 15 | 51 ± 18 | 50 ± 16 | 0.83 |
|
| 357 ± 171 | 256 ± 101 | 343 ± 167 | 0.002 |
|
| 11 ± 10 | 10 ± 5 | 11 ± 9 | 0.53 |
|
| 18 (17) | 4 (24) | 22 (18) | 0.54 |
|
| 1 (1) | 0 (0) | 1 (1) | 0.68 |
|
| 66 ± 12 | 66 ± 11 | 66 ± 12 | 1.00 |
|
| 52 (50) | 8 (47) | 60 (50) | 0.83 |
|
| 0.81 | |||
| Stage 2, | 16 (15) | 3 (18) | 19 (16) | |
| Stage 3, | 64 (62) | 10 (59) | 74 (61) | |
| Stage 4, | 24 (23) | 4 (24) | 28 (23) |
MH, macular hole; CVA, corrected visual acuity; SD, standard deviation.
Comparison Between the Two Groups (CVA Gain ≥15 Letters Versus <15 Letters) in Terms of Demographic and Clinical Features
| Features | CVA Gain ≥15 Letters ( | CVA Gain <15 Letters ( | Total Set ( |
|
|---|---|---|---|---|
|
| 67 ± 15 | 67 ± 14 | 67 ± 15 | 1.00 |
|
| 0.17 | |||
|
| 47 (78) | 41 (67) | 88 (73) | |
|
| 13 (22) | 20 (33) | 33 (27) | |
|
| 42 ± 15 | 59 ± 10 | 50 ± 16 | <0.0001 |
|
| 394 ± 183 | 293 ± 131 | 343 ± 167 | 0.0008 |
|
| 11 ± 12 | 10 ± 7 | 10 ± 10 | 0.58 |
|
| 15 (25) | 7 (12) | 22 (18) | 0.05 |
|
| 0 (0) | 1 (2) | 1 (2) | 0.32 |
|
| 70 ± 9 | 62 ± 12 | 66 ± 12 | <0.0001 |
|
| 0.53 | |||
|
| 10 (17) | 9 (15) | 19 (16) | |
|
| 35 (58) | 39 (64) | 74 (61) | |
|
| 15 (25) | 13 (21) | 28 (23) | |
|
| ||||
|
| 60 (100) | 61 (100) | 121 (100) | 0.99 |
|
| 0.98 | |||
|
| 53 (88) | 54 (89) | 107 (88) | |
|
| 7 (12) | 7 (12) | 14 (12) | |
|
| 0.11 | |||
|
| 43 (72) | 51 (84) | 94 (78) | |
|
| 17 (28) | 10 (16) | 27 (22) |
MH, macular hole; ILM, internal limiting membrane; CVA, corrected visual acuity; ICG, indocyanine green; TB, trypan blue; SD, standard deviation.
Performances of the Models on the Held-Out Test and Using Cross-Validation
| Models | F1 Scores | AUROC | ACC | SP | SN | PPV | NPV |
|---|---|---|---|---|---|---|---|
|
| |||||||
| Train |
| 79.3 | 76.0 |
|
|
| 72.9 |
| Test | 79.7 | 80.6 |
| 89.7 | 72.3 |
|
|
|
| |||||||
| Train | 67.1 ± 28.9 | 77.3 ± 10.3 | 69.8 ± 3.6 | 76.9 ± 25.2 | 65.4 ± 15.6 | 57.6 ± 14.4 |
|
| Test | 61.5 ± 23.7 | 72.8 ± 14.6 | 63.9 ± 13.2 | 70.8 ± 30.2 | 60.2 ± 17.9 | 60.2 ± 15.4 | 76.9 ± 15.4 |
|
| |||||||
| Train | 78.0 ± 1.7 |
|
| 79.0 ± 16.8 |
| 74.2 ± 9.4 | 78.8 ± 7.6 |
| Test |
|
| 78.7 ± 2.9 |
| 67.8 ± 26.9 | 77.4 ± 4.3 | 80.8 ± 6.7 |
|
| |||||||
| Train |
|
|
| 84.7 ± 9.9 | 64.2 ± 19.9 | 72.6 ± 9.5 |
|
| Test |
|
|
|
| 55.4 ± 23.2 | 70.4 ± 11.0 | 86.7 ± 5.9 |
|
| |||||||
| Train | 74.0 ± 3.7 | 75.3 ± 6.9 | 73.5 ± 7.3 |
| 53.8 ± 21.5 | 66.8 ± 9.0 | 79.5 ± 7.3 |
| Test | 76.3 ± 6.8 | 74.8 ± 11.1 | 74.9 ± 10.3 | 87.3 ± 11.2 | 57.2 ± 25.8 | 70.3 ± 13.5 | 81.3 ± 14.6 |
|
| |||||||
| Train | 76.8 ± 2.6 | 82.2 ± 3.2 | 76.4 ± 5.6 | 80.6 ± 9.8 |
|
| 80.0 ± 6.0 |
| Test | 80.1 ± 7.6 | 81.7 ± 10.6 | 79.3 ± 10.7 | 92.4 ± 9.3 |
|
|
|
DL, deep learning; AUROC, area under the receiver operating characteristic curve; ACC, accuracy; SP, specificity; SN, sensitivity; PPV, positive predictive value; NPV, negative predictive value.
Best means are highlighted.
Figure 2.The receiver operating characteristic (ROC) curves on the cross-validation and held-out test set for all three models. We report the mean value and a 95% CI computed from 100 independent runs for the DL and hybrid models and 25 runs for the clinical data regression model. For each independent run, we plotted the corresponding ROC curve in a different color.
Figure 3.Visualization heatmap for prediction of the clinical ground truth. The heatmap was generated by Gradient-weighted Class Activation Mapping (Grad-CAM). The heatmap highlights the pathological area (fovea) as being most important for accurate prediction of CVA improvement after surgery in HD-OCT B-scans. (A) Original image. (B) Grad-CAM.
Figure 4.Feature importance ratio (%) attributed by the regression models with and without the addition of the OCT-based prediction. For each feature, we reported the average and standard deviation computed from 10 independent runs. The standard deviation is 0 for all features when using only clinical data because the clinical features are always strictly the same in this case. MH, macular hole; CVA, corrected visual acuity.
The Differences in the Pre-Operative and Postoperative Factors Between Eyes Predicted Correctly and Incorrectly by the Hybrid Model
| Eyes Predicted Correctly ( | Eyes Predicted Incorrectly ( |
| |
|---|---|---|---|
|
| 70 ± 7 | 66 ± 3 | 0.13 |
|
| 0.83 | ||
| Female, | 9 (69) | 3 (75) | |
| Male, | 4 (31) | 1 (25) | |
|
| 51 ± 19 | 49 ± 11 | 0.80 |
|
| 280 ± 103 | 180 ± 38 | 0.01 |
|
| 9 ± 7 | 9 ± 4 | 1.00 |
|
| 2 (15) | 2 (50) | 0.15 |
|
| 0 (0) | 0 (0) | 1.00 |
|
| 67 ± 7 | 66 ± 3 | 0.69 |
|
| 6 (46) | 2 (50) | 0.90 |
|
| 0.05 | ||
| Stage 2, | 1 (8) | 2 (50) | |
| Stage 3, | 9 (69) | 1 (25) | |
| Stage 4, | 3 (23) | 1 (25) |
MH, macular hole; CVA, corrected visual acuity; SD, standard deviation.
MH size is smaller in the eyes predicted incorrectly.
Figure 5.Horizontal HD-OCT of the three most difficult cases to predict on the held-out test with the hybrid model.
Performances of the Hybrid Model on the Held-Out Test on Unclosed Versus Closed Macular Holes After the First Vitrectomy
| Hybrid Model | F1 Scores | AUROC | ACC | SP | SN | PPV | NPV |
|---|---|---|---|---|---|---|---|
|
| 80.4 ± 7.7 | 81.9 ± 5.2 | 78.7 ± 2.9 | 91.3 ± 15.9 | 67.8 ± 26.9 | 77.4 ± 4.3 | 80.8 ± 6.7 |
|
| 77.8 ± 1.4 | 79.3 ± 3.1 | 79.4 ± 1.9 | 100.0 ± 0.0 | 72.7 ± 2.0 | 69.7 ± 1.0 | 100.0 ± 0.0 |
DL, deep learning; AUROC, area under the receiver operating characteristic curve; ACC, accuracy; SP, specificity; SN, sensibility; PPV, positive predictive value; NPV, negative predictive value.
The performances are similar which indicates that our hybrid model well generalized to MH that failed to close after the first vitrectomy.