| Literature DB >> 36245759 |
Henry Shen-Lih Chen1,2, Guan-An Chen3, Jhen-Yang Syu3, Lan-Hsin Chuang2,4, Wei-Wen Su1,2, Wei-Chi Wu1,2, Jian-Hong Liu3, Jian-Ren Chen3, Su-Chen Huang3, Eugene Yu-Chuan Kang1,2,5.
Abstract
Objective: We aimed to develop a deep learning (DL)-based algorithm for early glaucoma detection based on color fundus photographs that provides information on defects in the retinal nerve fiber layer (RNFL) and its thickness from the mapping and translating relations of spectral domain OCT (SD-OCT) thickness maps. Design: Developing and evaluating an artificial intelligence detection tool. Subjects: Pretraining paired data of color fundus photographs and SD-OCT images from 189 healthy participants and 371 patients with early glaucoma were used.Entities:
Keywords: Autoencoder; DL, deep learning; DNN, deep neural network; GAN, generative adversarial network; Glaucoma; Image-to-image translation; OCT; PSNR, peak signal-to-noise ratio; RNFL, retinal nerve fiber layer; Retinal nerve fiber layer thickness; SD-OCT, spectral domain optical coherence tomography; SSIM, structural similarity index measure; VAE, variational autoencoder
Year: 2022 PMID: 36245759 PMCID: PMC9559108 DOI: 10.1016/j.xops.2022.100180
Source DB: PubMed Journal: Ophthalmol Sci ISSN: 2666-9145
Figure 1Illustration of the pipelines in model training and model application for the conversion from fundus images to OCT thickness maps. A, Model training: Color fundus images were inputted into the U-Net for generating OCT thickness maps. In addition, model parameter correction was performed using raw OCT images and the compound loss function (CLFun.) proposed in this study. Images similar to the original OCT thickness maps were generated. B, Model application: Only the color fundus images of the patients were required to generate output OCT thickness maps, which can be used for early glaucoma diagnosis. Conv = convolution layer; PSNR = peak signal-to-noise ratio; ReLU = rectified linear unit; SSIM = structural similarity index measure.
Demographic Data of the Study Participants
| Parameter | Glaucoma | Nonglaucoma |
|---|---|---|
| Number of subjects | 371 | 189 |
| Number of eyes | 742 | 378 |
| Age, years (SD) | 50.96 (11.96) | 44.41 (15.29) |
| Female (%) | 45.20 | 66.15 |
| Myopia (%) | 24.94 | 18.19 |
SD = standard deviation.
Figure 2U-Net architectures with different depths to compare image translation performances: A, 4 × 4 U-Net, B, 5 × 5 U-Net, and C, 6 × 6 U-Net. Conv = convolution layer; DEC = decoder convolution; EC = encoder convolution; InC = input convolution; ReLU = rectified linear unit.
Figure 3Learning curves of the model during training and validation with different network architectures in loss function measurement of peak signal-to-noise ratio (PSNR) and structural similarity index measure (SSIM). Learning curves of (A) training PSNR, (B) validation PSNR, (C) training SSIM, and (D) validation SSIM.
Performance of U-Net Networks and Models with Various Depths and Parameters
| Architecture | Parameters | Training | Validation | ||
|---|---|---|---|---|---|
| PSNR (dB) | SSIM | PSNR (dB) | SSIM | ||
| 4 × 4 U-Net | 13.40 M | 23.76 ± 1.20 | 0.67 ± 0.03 | 18.84 ± 2.43 | 0.44 ± 0.09 |
| 5 × 5 U-Net | 53.52 M | 26.16 ± 1.10 | 0.75 ± 0.03 | 19.31 ± 3.06 | 0.44 ± 0.12 |
| 6 × 6 U-Net | 213.97 M | 27.63 ± 0.99 | 0.81 ± 0.12 | 19.42 ± 0.44 | 0.44 ± 0.02 |
dB = decibels; M = million; PSNR = peak signal-to-noise ratio; SSIM = structural similarity index measure.
Figure 4Four cases were used as examples to display the performance of a U-Net with various depths. The thickness information corresponding to the color is provided on the right side of the image. The cooler the color is, the thinner the relative thickness is. Comparing to the raw OCT, the 4 × 4 U-Net was not as ideal as the 5 × 5 and 6 × 6 U-Nets in the detailed translation of blood vessels. The 5 × 5 and 6 × 6 U-Nets differed little to the naked eye.
Figure 5Different peak signal-to-noise ratio (PSNR) indices used in validation and corresponding quality changes in the generated images. The higher PSNR value indicates the higher similarity of the generated images to the raw OCT.
Figure 6Example of early glaucoma detection by using our model to predict retinal nerve fiber layer (RNFL) thickness distribution on the retinal fundus image from the right eye of a 50-year-old female patient with low-grade myopia (axial length = 24.2 mm, spherical equivalent = −2.25 diopters). A, The conventional retinal fundus photography showed no visible RNFL defect. B, No scotoma was identified by using the 24-2 visual field test. C, The OCT thickness map automatically generated by the model showed a high probability of RNFL thinning in the superotemporal retina region (red color labeled area). D, Although the macular ganglion cell analysis revealed no abnormality, the thickness map from the wide-field swept-source OCT examination showed a wedge-shaped dark-blue area indicating RNFL thinning (black arrows). In addition, the OCT superpixel map revealed an arcuate pattern of contiguous abnormal yellow/red pixels over the corresponding macula area. The lesion identified by the OCT scan was highly correlated with the results generated by the model. E, Circumpapillary RNFL thickness analysis confirmed a borderline superotemporal RNFL thinning. GCL = ganglion cell layer.