| Literature DB >> 33043147 |
Fei Li1, Diping Song2, Han Chen2, Jian Xiong1, Xingyi Li1, Hua Zhong3, Guangxian Tang4, Sujie Fan5, Dennis S C Lam6, Weihua Pan7, Yajuan Zheng8, Ying Li2, Guoxiang Qu2, Junjun He2, Zhe Wang9, Ling Jin1, Rouxi Zhou1, Yunhe Song1, Yi Sun1, Weijing Cheng1, Chunman Yang10, Yazhi Fan11, Yingjie Li12, Hengli Zhang4, Ye Yuan6, Yang Xu3, Yunfan Xiong3, Lingfei Jin7, Aiguo Lv5, Lingzhi Niu8, Yuhong Liu1, Shaoli Li1, Jiani Zhang1, Linda M Zangwill13, Alejandro F Frangi14, Tin Aung15, Ching-Yu Cheng15, Yu Qiao2, Xiulan Zhang1, Daniel S W Ting15.
Abstract
By 2040, ~100 million people will have glaucoma. To date, there are a lack of high-efficiency glaucoma diagnostic tools based on visual fields (VFs). Herein, we develop and evaluate the performance of 'iGlaucoma', a smartphone application-based deep learning system (DLS) in detecting glaucomatous VF changes. A total of 1,614,808 data points of 10,784 VFs (5542 patients) from seven centers in China were included in this study, divided over two phases. In Phase I, 1,581,060 data points from 10,135 VFs of 5105 patients were included to train (8424 VFs), validate (598 VFs) and test (3 independent test sets-200, 406, 507 samples) the diagnostic performance of the DLS. In Phase II, using the same DLS, iGlaucoma cloud-based application further tested on 33,748 data points from 649 VFs of 437 patients from three glaucoma clinics. With reference to three experienced expert glaucomatologists, the diagnostic performance (area under curve [AUC], sensitivity and specificity) of the DLS and six ophthalmologists were evaluated in detecting glaucoma. In Phase I, the DLS outperformed all six ophthalmologists in the three test sets (AUC of 0.834-0.877, with a sensitivity of 0.831-0.922 and a specificity of 0.676-0.709). In Phase II, iGlaucoma had 0.99 accuracy in recognizing different patterns in pattern deviation probability plots region, with corresponding AUC, sensitivity and specificity of 0.966 (0.953-0.979), 0.954 (0.930-0.977), and 0.873 (0.838-0.908), respectively. The 'iGlaucoma' is a clinically effective glaucoma diagnostic tool to detect glaucoma from humphrey VFs, although the target population will need to be carefully identified with glaucoma expertise input.Entities:
Keywords: Optic nerve diseases; Translational research
Year: 2020 PMID: 33043147 PMCID: PMC7508974 DOI: 10.1038/s41746-020-00329-9
Source DB: PubMed Journal: NPJ Digit Med ISSN: 2398-6352
Baseline characteristics of study participants in Phase I.
| Characteristics | Non-glaucoma group | Glaucoma group | |
|---|---|---|---|
| Patients (eyes) | 1761 (3030) | 3324 (4482) | – |
| Images, | 3566 (35.2) | 6569 (64.8) | – |
| Left/Right | 1834/1732 | 3206/3363 | – |
| Age, mean (SD) (years) | 48.4 (17.7) | 55.2 (16.4) | <0.001 |
| VFI, median (IQR) (%) | 98 (5) | 91 (19) | <0.001 |
| MD, median (IQR) (dB) | −2.78 (3.96) | −5.92 (7.58) | <0.001 |
| PSD, median (IQR) (dB) | 1.89 (1.71) | 3.97 (5.99) | <0.001 |
VFI visual field index, MD mean deviation, PSD pattern standard deviation, SD standard deviation, IQR interquartile range.
aComparison of the demographic and VF parameters between non-glaucoma and glaucoma groups by Wilcoxon rank sum test.
Performance of the CNNs and ophthalmologists in test set 1.
| AUC (95%CI) | Sensitivity | Specificity | ||
|---|---|---|---|---|
| Ophthalmologists | ||||
| Attending ophthalmologist #1 | 0.712 (0.632–0.792) | 0.741 (0.668–0.814) | 0.683 (0.566–0.801) | <0.001 |
| Attending ophthalmologist #2 | 0.689 (0.613–0.765) | 0.525 (0.442–0.608) | 0.852 (0.763–0.941) | <0.001 |
| Attending ophthalmologist #3 | 0.636 (0.553–0.718) | 0.583 (0.501–0.665) | 0.689 (0.572–0.805) | <0.001 |
| Glaucoma professor #1 | 0.656 (0.576–0.736) | 0.525 (0.442–0.608) | 0.787 (0.684–0.890) | <0.001 |
| Glaucoma professor #2 | 0.683 (0.617–0.750) | 0.580 (0.497–0.662) | 0.787 (0.684–0.890) | <0.001 |
| Glaucoma professor #3 | 0.717 (0.652–0.783) | 0.647 (0.568–0.727) | 0.787 (0.684–0.890) | <0.001 |
| CNN | ||||
| ND + NDP + PDP | 0.873 (0.822–0.924) | 0.922 (0.876–0.969) | 0.676 (0.567–0.785) | – |
| ND | 0.870 (0.817–0.923) | 0.915 (0.867–0.963) | 0.732 (0.629–0.835) | 0.81 |
| NDP | 0.857 (0.802–0.913) | 0.798 (0.729–0.868) | 0.817 (0.727–0.907) | 0.06 |
| PDP | 0.861 (0.808–0.914) | 0.868 (0.810–0.927) | 0.718 (0.614–0.823) | 0.06 |
CNN convolutional neural network, ND numeric displays, NDP numerical pattern deviation plots, PDP pattern deviation probability plots. AUC, area under curve.
aComparison of AUC between the ND + NDP + PDP and the other groups using Z test.
Fig. 1Comparison of diagnostic performance of the 2D-Fusion-CNN in VF interpretation with ophthalmologists in test set 1.
The figure shows receiver operating curve of glaucoma diagnosis by the 2D-Fusion-CNN (ND + NDP + PDP) in test set 1. 2D-Fusion-CNN combining pattern deviation probability plots (PDPs), numerical pattern deviation plots (NDPs), and numeric displays (NDs) as training data outperformed all the ophthalmologists with an AUC of 0.873.
Fig. 2Representative heatmaps generated by the CNNs.
The figure shows the heatmaps of the typical samples of eyes with and without glaucoma detected by the PDP-CNN. a and b stand for the heatmaps generated in the true-positive and true-negative cases, while c and d stand for the false-positive and false-negative cases.
Fig. 3Flow chart of the current study.
The study is composed of two parts. In Phase I, we developed the deep learning algorithms for classifying VFs. In Phase II, a smartphone app based on the deep learning algorithm was created and tested in the real world.