| Literature DB >> 33950206 |
Ji Wang1, Jie Ji2,3, Mingzhi Zhang1, Jian-Wei Lin1, Guihua Zhang1, Weifen Gong1, Ling-Ping Cen1, Yamei Lu4, Xuelin Huang5, Dingguo Huang1, Taiping Li1, Tsz Kin Ng1,6,7, Chi Pui Pang1,7.
Abstract
Importance: A retinopathy of prematurity (ROP) diagnosis currently relies on indirect ophthalmoscopy assessed by experienced ophthalmologists. A deep learning algorithm based on retinal images may facilitate early detection and timely treatment of ROP to improve visual outcomes. Objective: To develop a retinal image-based, multidimensional, automated, deep learning platform for ROP screening and validate its performance accuracy. Design, Setting, and Participants: A total of 14 108 eyes of 8652 preterm infants who received ROP screening from 4 centers from November 4, 2010, to November 14, 2019, were included, and a total of 52 249 retinal images were randomly split into training, validation, and test sets. Four main dimensional independent classifiers were developed, including image quality, any stage of ROP, intraocular hemorrhage, and preplus/plus disease. Referral-warranted ROP was automatically generated by integrating the results of 4 classifiers at the image, eye, and patient levels. DeepSHAP, a method based on DeepLIFT and Shapley values (solution concepts in cooperative game theory), was adopted as the heat map technology to explain the predictions. The performance of the platform was further validated as compared with that of the experienced ROP experts. Data were analyzed from February 12, 2020, to June 24, 2020. Exposure: A deep learning algorithm. Main Outcomes and Measures: The performance of each classifier included true negative, false positive, false negative, true positive, F1 score, sensitivity, specificity, receiver operating characteristic, area under curve (AUC), and Cohen unweighted κ.Entities:
Mesh:
Year: 2021 PMID: 33950206 PMCID: PMC8100867 DOI: 10.1001/jamanetworkopen.2021.8758
Source DB: PubMed Journal: JAMA Netw Open ISSN: 2574-3805
Demographic Details of Patients From 4 Centers
| Device | No./total No. (%) | |||||||
|---|---|---|---|---|---|---|---|---|
| Total | JSIEC | Yuexiu | Panyu | Qingyuan | ||||
| RetCam | RetCam II & III | RetCam II | RetCam III | RetCam III | RetCam III | RetCam III | ||
| Patient, No. | 8652 | 3714 | 2533 | 1181 | 2155 | 1251 | 1532 | |
| Images, No. | 52 249 | 15 382 | 9225 | 6157 | 11 269 | 14 708 | 10 890 | |
| Eyes, No. | 14 108 | 5907 | 3964 | 1943 | 3521 | 2225 | 2455 | |
| Patient sex known | 7973/8652 (92.2) | 3714/3714 (100) | 2533/2533 (100) | 1181/1181(100) | 2058/2155 (95.5) | 766/1251 (61.2) | 1435/1532 (93.7) | |
| Boys | 4818/7973 (60.4) | 2287/3714 (61.6) | 1607/2533 (63.4) | 680/1181 (57.6) | 1295/2058 (62.9) | 446/766 (58.2) | 790/1435 (55.1) | |
| Patients with GA available | 6364/8652 (73.6) | 3545/3714 (95.4) | 2399/2533 (94.7) | 1146/1181 (97.0) | 2056/2155 (95.4) | 763/1251 (61.0) | 0 | |
| GA, mean (SD), wk | 32.9 (3.05) | 32.7 (2.60) | 32.72 (2.54) | 32.62 (2.71) | 32.8 (3.40) | 34.0 (3.67) | NA | |
| Patients with BW available | 6363/8652 (73.5) | 3546/3714 (95.5) | 2400/2533 (94.7) | 1146/1181 (97.0) | 2056/2155 (95.4) | 761/1251 (60.8) | 0 | |
| BW, mean (SD), g | 1925 (774) | 1885 (677) | 1889 (742) | 1879 (516) | 1922 (931) | 2118 (694) | NA | |
Abbreviations: BW, birth weight; GA, gestational age; JSIEC, Joint Shantou International Eye Center; NA, not available.
A total of 566 of 6364 infants (8.89%) were term infants with a GA of 37 weeks or older. The data of postmenstrual age was only available from 2457 infants of JSIEC, and the mean (SD) postmenstrual age was 40.6 (3.07) weeks. Images were taken from 1 visit of each infant randomly. The average (range) number of images per eye was 3 (1-11 images).
RetCam II and III are corneal contact retinal cameras used to photograph the retina. They are manufactured by Clarity Medical Systems.
Performance of 5 Classifiers of the Platform
| Classifiers | Data set | No. | F1 | Sensitivity | Specificity | AUC (95% CI) | |||
|---|---|---|---|---|---|---|---|---|---|
| TN | FP | FN | TP | ||||||
| Image quality | Training | 2771 | 23 | 849 | 35386 | 0.988 | 0.977 | 0.992 | 0.9976 (0.9973-0.9979) |
| Validation | 306 | 21 | 156 | 4657 | 0.981 | 0.968 | 0.936 | 0.9899 (0.9874-0.9924) | |
| Test | 558 | 30 | 253 | 7239 | 0.981 | 0.966 | 0.949 | 0.9922 (0.9906-0.9938) | |
| Stage | Training | 31076 | 334 | 2 | 4823 | 0.966 | 1.000 | 0.989 | 0.9997 (0.9996-0.9998) |
| Validation | 4253 | 67 | 14 | 479 | 0.922 | 0.972 | 0.984 | 0.9977 (0.9959-0.9994) | |
| Test | 6349 | 98 | 19 | 1026 | 0.946 | 0.982 | 0.985 | 0.9981 (0.9974-0.9989) | |
| Hemorrhage | Training | 30800 | 167 | 11 | 5257 | 0.983 | 0.998 | 0.995 | 0.9999 (0.9999-0.9999) |
| Validation | 4123 | 40 | 8 | 642 | 0.964 | 0.988 | 0.990 | 0.9982 (0.9957-1.0000) | |
| Test | 6389 | 54 | 29 | 1020 | 0.961 | 0.972 | 0.992 | 0.9977 (0.9963-0.9991) | |
| Posterior | Training | 27203 | 861 | 488 | 7683 | 0.919 | 0.940 | 0.969 | 0.9936 (0.9931-0.9941) |
| Validation | 3558 | 153 | 70 | 1032 | 0.902 | 0.936 | 0.959 | 0.9908 (0.9890-0.9927) | |
| Test | 5629 | 220 | 127 | 1516 | 0.897 | 0.923 | 0.962 | 0.9901 (0.9884-0.9918) | |
| Preplus/plus | Training | 12701 | 192 | 0 | 631 | 0.868 | 1.000 | 0.985 | 0.9993 (0.9991-0.9996) |
| Validation | 1707 | 27 | 12 | 120 | 0.860 | 0.909 | 0.984 | 0.9882 (0.9753-1.0000) | |
| Test | 2518 | 78 | 10 | 112 | 0.718 | 0.918 | 0.970 | 0.9827 (0.9706-0.9948) | |
| RW | |||||||||
| Images | Test | 5295 | 144 | 40 | 2013 | 0.956 | 0.981 | 0.974 | 0.9956 (0.9942-0.9970) |
| Eyes | 1694 | 83 | 7 | 482 | 0.915 | 0.986 | 0.953 | 0.9938 (0.9898-0.9977) | |
| Patients | 1047 | 68 | 6 | 327 | 0.898 | 0.982 | 0.939 | 0.9901 (0.9835-0.9966) | |
| RW without hemorrhage | |||||||||
| Images | Test | 6193 | 156 | 28 | 1115 | 0.924 | 0.976 | 0.975 | 0.9958 (0.9943-0.9973) |
| Eyes | 1902 | 75 | 4 | 285 | 0.878 | 0.986 | 0.962 | 0.9959 (0.9927-0.9991) | |
| Patients | 1183 | 61 | 3 | 201 | 0.863 | 0.985 | 0.951 | 0.9937 (0.9884-0.9991) | |
Abbreviations: AUC, area under curve; F1, F1 score; FN, false negative; FP, false positive; ROP, retinopathy of prematurity; RW, referral warranted; TN, true negative; TP, true positive.
Ignoring the hemorrhage dimension, 3 levels of RW ROP were regenerated based on the results of stage and preplus/plus classifiers in the test set.
Figure 1. The Receive Operating Characteristic (ROC) Curves for System Performance
The ROC for detecting any stage of retinopathy of prematurity (ROP) (A), intraocular hemorrhage (B), and preplus/plus disease (C). The area under curve (AUC) of training, validation, and test sets from each of the classifiers are shown (D). The ROC for referral-warranted ROP (RW ROP) was obtained by aggregating the results of 4 classifiers (stage, hemorrhage, posterior, and preplus/plus). Any positive findings of ROP-related features would result in the RW, and the AUC at the image, eye, and patient levels are shown.
Figure 2. Visualization of Stage and Hemorrhage in Heat Maps
The first and second columns indicate the original and preprocessed retinal images, respectively. The third and fourth columns are heat maps generated by Class Activation Mapping (CAM) and DeepSHAP, respectively. A, The original image presents both the stage of ROP and retinal hemorrhage on the peripheral retina. The upper row contains heat maps showing the stage of ROP, whereas the lower row contains heat maps showing retinal hemorrhages. B, The image presents both the stage of ROP and the reflection. Though it shares a similar morphology with its reflection, the lesion is successfully recognized. C, The image shows retinal hemorrhages and many artifacts; however, the hemorrhage area is highlighted by the heat map. DeepSHAP shows the more fine-grained heat map than CAM on each feature.
Performance Comparison Between Human Experts and J-PROP
| ROP-related features | Reader | No. | F1 | Sensitivity | Specificity | AUC (95% CI) | |||
|---|---|---|---|---|---|---|---|---|---|
| TN | FP | FN | TP | ||||||
| Stage of ROP | Expert 1 | 152 | 0 | 0 | 48 | 1.000 | 1.000 | 1.000 | NA |
| Expert 2 | 152 | 0 | 9 | 39 | 0.897 | 0.812 | 1.000 | NA | |
| Expert 3 | 148 | 4 | 3 | 45 | 0.928 | 0.938 | 0.974 | NA | |
| Experts_average | NA | NA | NA | NA | 0.943 | 0.917 | 0.991 | NA | |
| J-PROP | 148 | 4 | 1 | 47 | 0.949 | 0.979 | 0.974 | 0.9980 (0.9950-1.0000) | |
| Intraocular hemorrhage | Expert1 | 156 | 4 | 1 | 39 | 0.940 | 0.975 | 0.975 | NA |
| Expert 2 | 160 | 0 | 3 | 37 | 0.961 | 0.925 | 1.000 | NA | |
| Expert 3 | 155 | 5 | 1 | 39 | 0.929 | 0.975 | 0.969 | NA | |
| Experts_average | NA | NA | NA | NA | 0.943 | 0.958 | 0.981 | NA | |
| J-PROP | 158 | 2 | 0 | 40 | 0.976 | 1.000 | 0.988 | 1.0000 (1.0000-1.0000) | |
| Preplus/plus disease | Expert 1 | 190 | 0 | 0 | 10 | 1.000 | 1.000 | 1.000 | NA |
| Expert 2 | 190 | 0 | 0 | 10 | 1.000 | 1.000 | 1.000 | NA | |
| Expert 3 | 189 | 1 | 0 | 10 | 0.952 | 1.000 | 0.995 | NA | |
| Experts_average | NA | NA | NA | NA | 0.984 | 1.000 | 0.998 | NA | |
| J-PROP | 187 | 3 | 0 | 10 | 0.870 | 1.000 | 0.984 | 1.0000 (1.0000-1.0000) | |
| RW ROP, image level | Expert 1 | 117 | 0 | 1 | 82 | 0.994 | 0.988 | 1.000 | NA |
| Expert 2 | 117 | 0 | 6 | 77 | 0.962 | 0.928 | 1.000 | NA | |
| Expert 3 | 114 | 3 | 4 | 79 | 0.958 | 0.952 | 0.974 | NA | |
| Experts_average | NA | NA | NA | NA | 0.971 | 0.956 | 0.991 | NA | |
| J-PROP | 115 | 2 | 0 | 83 | 0.988 | 1.000 | 0.983 | 0.9999 (0.9996-1.0000) | |
| RW ROP, image level, without hemorrhage | Expert 1 | 147 | 0 | 0 | 53 | 1.000 | 1.000 | 1.000 | NA |
| Expert 2 | 147 | 0 | 5 | 48 | 0.950 | 0.906 | 1.000 | NA | |
| Expert 3 | 147 | 0 | 3 | 50 | 0.971 | 0.943 | 1.000 | NA | |
| Experts_average | NA | NA | NA | NA | 0.974 | 0.950 | 1.000 | NA | |
| J-PROP | 144 | 3 | 0 | 53 | 0.972 | 1.000 | 0.980 | 1.0000 (1.0000-1.0000) | |
Abbreviations: AUC, area under curve; F1, F1 score; FN, false negative; FP, false positive; J-PROP, Joint Shantou International Eye Center Platform for Retinopathy of Prematurity; NA, not available; ROP, retinopathy of prematurity; RW, referral warranted; TN, true negative; TP, true positive.
The results of referral-warranted ROP here were generated by ignoring the hemorrhage dimension.