| Literature DB >> 32704414 |
Tae Keun Yoo1, Ik Hee Ryu2, Hannuy Choi2, Jin Kuk Kim2, In Sik Lee2, Jung Sub Kim2, Geunyoung Lee3, Tyler Hyungtaek Rim4.
Abstract
Purpose: Recently, laser refractive surgery options, including laser epithelial keratomileusis, laser in situ keratomileusis, and small incision lenticule extraction, successfully improved patients' quality of life. Evidence-based recommendation for an optimal surgery technique is valuable in increasing patient satisfaction. We developed an interpretable multiclass machine learning model that selects the laser surgery option on the expert level.Entities:
Keywords: corneal laser surgery; explainable machine learning; multiclass classification; refractive surgery
Mesh:
Year: 2020 PMID: 32704414 PMCID: PMC7346876 DOI: 10.1167/tvst.9.2.8
Source DB: PubMed Journal: Transl Vis Sci Technol ISSN: 2164-2591 Impact factor: 3.283
Figure 1.Architecture of the proposed machine learning models for corneal laser refractive surgery recommendation.
Figure 2.Schematic diagram of our proposed interpretable model for corneal laser refractive surgery recommendation.
Comparison Between LASEK, LASIK, SMILE, and Contraindication Cases
| LASEK | LASIK | SMILE | Contraindication |
| |
|---|---|---|---|---|---|
| Number | 4893 | 6123 | 5834 | 1630 | |
| Age (years) | 26.9 ± 5.6 | 27.3 ± 6.1 | 27.3 ± 6.0 | 33.8 ± 8.0 | <0.001 |
| Sex, female (%) | 2723 (55.7) | 3202 (52.3) | 3069 (52.6) | 868 (53.3) | <0.001 |
| Spherical equivalent (Diopter) | −5.38 ± 2.23 | −3.94 ± 1.84 | −4.39 ± 1.69 | −7.82 ± 5.02 | <0.001 |
| CDVA (logMAR) | −0.012 ± 0.039 | −0.012 ± 0.038 | −0.011 ± 0.038 | 0.017 ± 0.121 | <0.001 |
| IOP (mm Hg) | 15.1 ± 3.5 | 15.5 ± 3.4 | 15.4 ± 3.0 | 15.2 ± 2.4 | <0.001 |
| Pupil diameter (mm) | 2.93 ± 0.62 | 2.88 ± 0.55 | 2.88 ± 0.57 | 2.86 ± 0.52 | <0.001 |
| Central corneal thickness (µm) | 530.5 ± 33.0 | 549.3 ± 27.4 | 545.9 ± 33.1 | 503.8 ± 42.9 | <0.001 |
| NIBUT (seconds) | 6.67 ± 6.40 | 7.04 ± 6.70 | 7.06 ± 6.67 | 5.20 ± 3.61 | <0.001 |
| Anticipated surgery option | |||||
| LASEK (%) | 2402 (49.1) | 1586 (25.9) | 1488 (25.5) | 448 (27.5) | <0.001 |
| LASIK (%) | 983 (20.1) | 2951 (48.2) | 915 (15.7) | 463 (28.4) | <0.001 |
| SMILE (%) | 1052 (21.5) | 1193 (19.5) | 3296 (56.5) | 523 (32.1) | <0.001 |
| ICL or none | 456 (9.3) | 392 (6.4) | 134 (2.3) | 196 (12.0) | <0.001 |
| Occupation | |||||
| Sports (%) | 680 (13.9) | 380 (6.2) | 583 (10.0) | 155 (9.5) | <0.001 |
| Driver (%) | 298 (6.1) | 416 (6.8) | 543 (9.3) | 1277 (7.8) | <0.001 |
| Computer or smartphone (%) | 2906 (59.4) | 3472 (56.7) | 3506 (60.1) | 950 (58.3) | <0.001 |
| Anticipated recovery time | |||||
| One day (%) | 274 (5.6) | 1702 (27.8) | 1762 (30.2) | 414 (25.4) | <0.001 |
| Three days (%) | 2251 (46.0) | 3594 (58.7) | 2818 (48.3) | 843 (51.7) | <0.001 |
| One week (%) | 1649 (33.7) | 1163 (19.0) | 1202 (20.6) | 328 (20.1) | <0.001 |
| Concern about budget (%) | 3303 (67.5) | 4298 (70.2) | 4230 (72.5) | 1154 (70.8) | <0.001 |
CDVA, corrected distance visual acuity; IOP, intraocular pressure; NIBUT, noninvasive breakup time.
Comparison using the 1-way ANOVA test and χ2 test.
Classification Performance of Machine Learning Models to Predict Laser Corneal Refractive Surgery Option Via 10-Fold Cross-Validation
| Accuracy (%) (95% CI) | RCI (95% CI) | κ (95% CI) |
| |
|---|---|---|---|---|
| Trained with SMOTE | ||||
| Multiclass XGBoost | 82.1 (81.1–83.0) | 0.537 (0.525–0.549) | 0.758 (0.747–0.769) | Reference |
| One-versus-rest XGBoost | 81.7 (80.7–82.6) | 0.531 (0.519–0.543) | 0.753 (0.742–0.764) | 0.578 |
| One-versus-one XGBoost | 81.9 (80.9–82.8) | 0.534 (0.522–0.546) | 0.756 (0.745–0.767) | 0.780 |
| Random forest | 81.5 (80.5–82.4) | 0.527 (0.515–0.539) | 0.750 (0.739–0.761) | 0.407 |
| One-versus-rest SVM | 75.3 (74.2–76.3) | 0.422 (0.410–0.434) | 0.668 (0.656–0.680) | <0.001 |
| One-versus-one SVM | 75.7 (74.7–76.7) | 0.428 (0.415–0.441) | 0.674 (0.662–0.686) | <0.001 |
| DAG SVM | 75.5 (74.5–76.5) | 0.425 (0.412–0.438) | 0.671 (0.659–0.683) | <0.001 |
| Artificial neural network | 76.0 (74.9–77.0) | 0.432 (0.419–0.445) | 0.677 (0.665–0.689) | <0.001 |
| Trained without SMOTE | ||||
| Multiclass XGBoost | 80.2 (79.2–81.2) | 0.514 (0.502–0.526) | 0.730 (0.719–0.741) | 0.011 |
| One-versus-rest XGBoost | 80.1 (79.1–81.1) | 0.513 (0.501–0.525) | 0.727 (0.715–0.738) | 0.015 |
| One-versus-one XGBoost | 78.5 (77.4–79.5) | 0.505 (0.493–0.517) | 0.721 (0.709–0.732) | 0.001 |
| Without anticipated surgery option | ||||
| Multiclass XGBoost | 70.3 (69.2–71.4) | 0.407 (0.394–0.420) | 0.593 (0.581–0.605) | <0.001 |
| One-versus-rest XGBoost | 68.8 (67.7–69.9) | 0.385 (0.372–0.398) | 0.571 (0.559–0.583) | <0.001 |
| One-versus-one XGBoost | 68.3 (67.2–69.4) | 0.380 (0.366–0.393) | 0.565 (0.552–0.568) | <0.001 |
CI, confidence interval; RCI, relative classifier information; SVM, support vector machine.
Comparison of accuracy with the best machine learning technique (multiclass XGBoost with SMOTE).
Figure 3.Global feature importance estimates selected by the XGBoost-based SHAP technique. (A) Multiclass (4 classes) classification problem. (B) Binary classification with LASEK versus rest groups. (C) Binary classification with LASIK versus rest groups. (D) Binary classification with SMILE versus rest groups. (E) Binary classification with Contraindication versus rest groups.
Figure 4.Confusion matrix of multiclass XGBoost and performance for multiclass XGBoost and other algorithms. (A) Performance in the internal validation. (B) Performance in the external validation. The error bars demonstrate the 95% confidence intervals.
Figure 5.Comparison of the primary factors between the explainable XGBoost model and clinician's decision. The surgical decision was based on a review of electronic health records. One hundred samples for each group were extracted randomly from the external validation dataset for comparison.
Figure 6.SMILE case example showing the machine learning prediction result with local interpretation via force plots.
Figure 7.Contraindication case example showing the machine learning prediction result with local interpretation via force plots.