| Literature DB >> 33805685 |
Sejong Oh1, Yuli Park2, Kyong Jin Cho2, Seong Jae Kim3.
Abstract
The aim is to develop a machine learning prediction model for the diagnosis of glaucoma and an explanation system for a specific prediction. Clinical data of the patients based on a visual field test, a retinal nerve fiber layer optical coherence tomography (RNFL OCT) test, a general examination including an intraocular pressure (IOP) measurement, and fundus photography were provided for the feature selection process. Five selected features (variables) were used to develop a machine learning prediction model. The support vector machine, C5.0, random forest, and XGboost algorithms were tested for the prediction model. The performance of the prediction models was tested with 10-fold cross-validation. Statistical charts, such as gauge, radar, and Shapley Additive Explanations (SHAP), were used to explain the prediction case. All four models achieved similarly high diagnostic performance, with accuracy values ranging from 0.903 to 0.947. The XGboost model is the best model with an accuracy of 0.947, sensitivity of 0.941, specificity of 0.950, and AUC of 0.945. Three statistical charts were established to explain the prediction based on the characteristics of the XGboost model. Higher diagnostic performance was achieved with the XGboost model. These three statistical charts can help us understand why the machine learning model produces a specific prediction result. This may be the first attempt to apply "explainable artificial intelligence" to eye disease diagnosis.Entities:
Keywords: glaucoma; machine learning; model explanation; prediction
Year: 2021 PMID: 33805685 PMCID: PMC8001225 DOI: 10.3390/diagnostics11030510
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Characteristics of the participants.
| Patient | Normal | Glaucoma | Total | |
|---|---|---|---|---|
| Number of participants | 377 | 430 | 807 | - |
| Gender (male/female) | 201/176 | 260/170 | 461/346 | 0.04061 |
| Age (mean ± SD) | 51.7 ± 16.5 | 60.3 ± 14.1 | 56.9 ± 15.7 | <0.001 |
| Number of eyes | 564 | 680 | 1244 | - |
| Number of cases | 649 | 975 | 1624 | - |
* Normal group vs. Glaucoma group.
Figure 1Procedure of prediction model development.
List of candidate features for Model building.
| No | Feature | Glaucoma | Healthy | |
|---|---|---|---|---|
| 1 | Sex | - | - | - |
| 2 | Age | 60.3 (14.13) | 51.7 (16.45) | <0.001 |
| 3 | GHT 2 | 4.28 (1.28) | 2.10 (1.53) | <0.001 |
| 4 | VFI 3 | 72.3 (32.24) | 95.7 (5.55) | <0.001 |
| 5 | MD 4 | −10.24 (9.72) | −2.33 (2.55) | <0.001 |
| 6 | Pattern standard deviation | 6.76 (4.27) | 2.49 (1.03) | <0.001 |
| 7 | RNFL 5 superior | 82.0 (27.71) | 112.6 (19.89) | <0.001 |
| 8 | RNFL Nasal | 56.1 (33.84) | 64.6 (16.64) | <0.001 |
| 9 | RNFL inferior | 79.9 (29.76) | 117.6 (20.28) | <0.001 |
| 10 | RNFL temporal | 58.6 (18.48) | 71.6 (15.01) | <0.001 |
| 11 | Mean of RNFL thickness | 68.5 (18.82) | 91.6 (12.60) | <0.001 |
| 12 | Intraocular pressure | 18.7 (8.69) | 15.7 (3.10) | <0.001 |
| 13 | Cornea thickness | 527.2 (34.15) | 530.1 (34.01) | <0.001 |
| 14 | BCVA 6 | 0.63 (0.31) | 0.73 (0.31) | 0.002 |
| 15 | Spherical equivalent | −1.63 (2.88) | −1.42 (3.08) | 0.12 |
| 16 | Axial length | 24.1 (1.81) | 24.1 (1.42) | 0.92 |
| 17 | Neuro-retinal rim | 0.79 (0.28) | 1.06 (0.21) | <0.001 |
| 18 | Cup | 0.47 (0.23) | 0.38 (0.43) | 0.16 |
| 19 | Disc | 1.97 (0.23) | 2.09 (0.43) | 0.25 |
| 20 | Mean of cup/disc ratio | 0.74 (0.11) | 0.65 (0.12) | <0.001 |
| 21 | vertical_cup/disc ratio | 0.73 (0.10) | 0.62 (0.16) | <0.001 |
| 22 | CNN 7 degree | 0.69 (0.18) | 0.53 (0.21) | <0.001 |
1 standard deviation; 2 glaucoma hemifield test; 3 visual field index; 4 mean deviation; 5 retinal nerve fiber layer; 6 best-corrected visual acuity; 7 convolutional neural network.
Final features list for building the prediction model.
| No | Feature | Abbreviation | Source |
|---|---|---|---|
| 1 | pattern standard deviation | PSD | VF |
| 2 | RNFL superior | RNFL_S | RNFL optical coherence tomography (OCT) |
| 3 | RNFL inferior | RNFL_I | RNFL OCT |
| 4 | RNFL temporal | RNFL_T | RNFL OCT |
| 5 | intraocular pressure | IOP | IOP test |
Figure 2Box plots for selected features (H, healthy control; G, glaucoma).
Hyper parameters of proposed XGboost prediction model.
| No | Hyper Parameter * | Value |
|---|---|---|
| 1 | booster | “gbtree” |
| 2 | eta | 0.7 |
| 3 | max_depth | 8 |
| 4 | gamma | 3 |
| 5 | subsample | 0.8 |
| 6 | colsample_bytree | 0.5 |
| 7 | objective | “multi:softprob” |
| 8 | eval_metric | “merror” |
| 9 | num_class | 2 |
* We use default values for other hyper-parameters that are not listed in the table.
Figure 3An example of a gauge chart and a boxplot for RNFL_T. (Left: gauge chart of RNFL_S with a value of 86, Right: boxplot and statistics of distributions for RNFL_S).
Figure 4Radar charts for typical glaucoma and healthy patients.
Figure 5An example of a Shapley Additive Explanations (SHAP) chart.
Final features list for building the prediction model.
| Metric | Support Vector Machine (SVM) | C50 | Random Forest (RF) | xgboost |
|---|---|---|---|---|
| Accuracy | 0.925 | 0.903 | 0.937 | 0.947 |
| Sensitivity | 0.933 | 0.874 | 0.924 | 0.941 |
| Specificity | 0.920 | 0.92 | 0.945 | 0.950 |
| AUC | 0.945 | 0.897 | 0.945 | 0.945 |
Figure 6Feature importance of proposed model.
Figure 7Feature interaction chart of the proposed model.
Cases of prediction by the proposed model.
| Case | PSD | RNFL_S | RNFL_I | RNFL_T | IOP | Diagnosis | Prediction |
|---|---|---|---|---|---|---|---|
| 1 | 1.92 | 142 | 153 | 94 | 13 | Healthy | Healthy |
| 2 | 11.85 | 83 | 41 | 55 | 14 | Glaucoma | Glaucoma |
| 3 | 1.53 | 73 | 107 | 71 | 18 | Glaucoma | Healthy |
| 4 | 2.76 | 81 | 95 | 73 | 18 | Healthy | Glaucoma |
| 5 | 2.31 | 98 | 130 | 60 | 12 | Glaucoma | Healthy |
Confusion matrix of the proposed prediction model.
| Predict | |||
|---|---|---|---|
| Glaucoma | Healthy | ||
| Actual | Glaucoma | 916 | 59 |
| Healthy | 28 | 621 | |
Analysis of prediction: glaucoma cases.
| Case | PSD | RNFL_S | RNFL_I | RNFL_T | IOP |
|---|---|---|---|---|---|
| Correct prediction | 7.99 | 75.31 | 72.86 | 57.98 | 20.73 |
| Miss prediction | 2.65 | 111.12 | 122.69 | 71.25 | 15.78 |
| 2.23 × 10−55 | 2.74 × 10−24 | 1.22 × 10−34 | 3.09 × 10−10 | 1.39 × 10−17 |
Analysis of prediction: healthy cases.
| Case | PSD | RNFL_S | RNFL_I | RNFL_T | IOP |
|---|---|---|---|---|---|
| Correct prediction | 2.23 | 117.0 | 126.59 | 77.69 | 15.72 |
| Miss prediction | 3.84 | 90.40 | 91.21 | 61.89 | 15.68 |
| 1.64 × 10−3 | 1.74 × 10−8 | 2.66 × 10−11 | 1.27 × 10−6 | 9.44 × 10−1 |
Figure 8SHAP chart that shows feature interaction. The same PSD value of 1.39 has different weights in case A and case B.
Figure 9Distributions of degree of support according to the feature values.