| Literature DB >> 36246944 |
Julian Lo1, Timothy T Yu1, Da Ma1, Pengxiao Zang2, Julia P Owen3, Qinqin Zhang4, Ruikang K Wang3,4, Mirza Faisal Beg1, Aaron Y Lee3, Yali Jia2, Marinko V Sarunic1.
Abstract
Purpose: To evaluate the performance of a federated learning framework for deep neural network-based retinal microvasculature segmentation and referable diabetic retinopathy (RDR) classification using OCT and OCT angiography (OCTA). Design: Retrospective analysis of clinical OCT and OCTA scans of control participants and patients with diabetes. Participants: The 153 OCTA en face images used for microvasculature segmentation were acquired from 4 OCT instruments with fields of view ranging from 2 × 2-mm to 6 × 6-mm. The 700 eyes used for RDR classification consisted of OCTA en face images and structural OCT projections acquired from 2 commercial OCT systems.Entities:
Keywords: API, application programming interface; COVID-19, coronavirus disease 2019; DR, diabetic retinopathy; DSC, Dice similarity coefficient; Diabetic retinopathy; Federated learning; IRB, institutional review board; Machine learning; NRDR, non-referable diabetic retinopathy; Neural network; OCT; OCTA, OCT angiography; OHSU, Oregon Health and Science University; RDR, referable diabetic retinopathy; SFU, Simon Fraser University
Year: 2021 PMID: 36246944 PMCID: PMC9559956 DOI: 10.1016/j.xops.2021.100069
Source DB: PubMed Journal: Ophthalmol Sci ISSN: 2666-9145
Figure 1Diagram showing federated learning schematic.
Datasets Used for Microvasculature Segmentation
| Image Source | Field of View (mm) | Included Capillary Complexes | No. of Available Images | Dataset Partitions (Quadrants) |
|---|---|---|---|---|
| SFU prototype swept-source OCTA | 2 × 2 | SVC | 30 | 18 training 6 validation 6 testing |
| RTVue XR Avanti (OptoVue, Inc.) | 3 × 3 | SVC | 26 | 16 training 5 validation 5 testing |
| Angioplex (Carl Zeiss Meditec) | 3 × 3 | SVC | 24 | 14 training 5 validation 5 testing |
| PLEX Elite 9000 (Carl Zeiss Meditec) | 6 × 6 | SVC and DVC | 73 (292 quadrants) | 42 (168) training 15 (60) validation 16 (64) testing |
DVC = deep vascular complex; SFU = Simon Fraser University; SVC = superficial vascular complex.
Overview of the 4 individual datasets used for the federated learning simulation of microvasculature segmentation. Images in the PLEX Elite 6 × 6-mm dataset were split after partitioning into training, validation, and test sets.
Datasets Used for Referable Diabetic Retinopathy Classification
| Institution | Commercial OCT Systems (Field of View; mm) | Binary Stratification (No. of Images in Each) | Institution-Specific Stratification (No. of Images in Each) | Dataset Allocation |
|---|---|---|---|---|
| OHSU17 | OptoVue, Avanti RTVue-XR SD OCT (3 × 3) | Non-RDR (n = 111), RDR (n = 212) | Normal (n = 99) | 20% testing |
| SFU16 | Zeiss, PLEX Elite SS OCT (3 × 3) | Non-RDR (n = 226); RDR (n = 151) | Normal (n = 157) | 20% testing |
OHSU = Oregon Health and Science University; RDR = referable diabetic retinopathy; SD = spectral-domain; SFU = Simon Fraser University; SS = swept-source.
Overview of the datasets from the collaborating institutions for the task of RDR classification.
Accuracy of Federated Learning on Microvasculature Segmentation
| Model Training Method (No. of Images) | Simon Fraser University Prototype 2 × 2-mm | OptoVue 3 × 3-mm | Angioplex 3 × 3-mm | PLEX Elite 6 × 6-mm |
|---|---|---|---|---|
| Federated learning | 0.857 ± 0.031 | 0.815 ± 0.006 | 0.850 ± 0.030 | 0.784 ± 0.053 |
| Sequential | 0.835 ± 0.033 | 0.789 ± 0.006 | 0.821 ± 0.036 | 0.810 ± 0.039 |
| Only SFU prototype (n = 18) | 0.858 ± 0.033 | 0.721 ± 0.017 | 0.808 ± 0.057 | 0.574 ± 0.061 |
| Only Optovue (n = 16) | 0.831 ± 0.049 | 0.829 ± 0.015 | 0.801 ± 0.059 | 0.621 ± 0.073 |
| Only Angioplex (n = 14) | 0.818 ± 0.045 | 0.817 ± 0.009 | 0.858 ± 0.024 | 0.713 ± 0.075 |
| Only PLEX Elite (n = 168) | 0.814 ± 0.034 | 0.800 ± 0.007 | 0.828 ± 0.024 | 0.817 ± 0.038 |
| Combined (all images) | 0.855 ± 0.029 | 0.804 ± 0.005 | 0.842 ± 0.030 | 0.806 ± 0.036 |
| Combined equally (n = 14 each) | 0.861 ± 0.033 | 0.829 ± 0.014 | 0.857 ± 0.029 | 0.795 ± 0.043 |
| Combined equally (n = 4 each) | 0.855 ± 0.030 | 0.829 ± 0.014 | 0.853 ± 0.026 | 0.785 ± 0.039 |
SFU = Simon Fraser University.
Data are presented as mean ± standard deviation. Accuracy for each training method when evaluated on each dataset’s test set is shown.
Internal model.
Highest value(s) in each column.
Dice Similarity Coefficient of Federated Learning on Microvasculature Segmentation
| Model Training Method (No. of Images) | Simon Fraser University Prototype 2 × 2-mm | OptoVue 3 × 3-mm | Angioplex 3 × 3-mm | PLEX Elite 6 × 6-mm |
|---|---|---|---|---|
| Federated learning | 0.773 ± 0.060 | 0.814 ± 0.011 | 0.824 ± 0.011 | 0.761 ± 0.078 |
| Sequential | 0.752 ± 0.060 | 0.782 ± 0.006 | 0.782 ± 0.016 | 0.798 ± 0.045 |
| Only SFU prototype (n = 18) | 0.756 ± 0.074 | 0.663 ± 0.019 | 0.746 ± 0.051 | 0.265 ± 0.141 |
| Only Optovue (n = 16) | 0.671 ± 0.153 | 0.836 ± 0.020 | 0.734 ± 0.061 | 0.439 ± 0.165 |
| Only Angioplex (n = 14) | 0.735 ± 0.079 | 0.818 ± 0.011 | 0.836 ± 0.009 | 0.637 ± 0.141 |
| Only PLEX Elite (n = 168) | 0.738 ± 0.059 | 0.801 ± 0.010 | 0.801 ± 0.011 | 0.816 ± 0.038 |
| Combined (all images) | 0.755 ± 0.059 | 0.797 ± 0.007 | 0.806 ± 0.010 | 0.797 ± 0.039 |
| Combined equally (14 each) | 0.772 ± 0.070 | 0.835 ± 0.018 | 0.835 ± 0.011 | 0.784 ± 0.049 |
| Combined equally (4 each) | 0.772 ± 0.058 | 0.838 ± 0.018 | 0.831 ± 0.010 | 0.782 ± 0.042 |
SFU = Simon Fraser University.
Data are presented as mean ± standard deviation. Dice similarity coefficients for each training method, evaluated on each dataset’s test set, are shown.
Internal model.
Highest value(s) in each column.
Performance of Federated Learning for Referable Diabetic Retinopathy Classification
| Testing Set | Training Model | Accuracy | Area under the Receiver Operating Characteristic Curve | Area under the Precision-Recall Curve | Balanced Accuracy | F1 Score | Sensitivity | Specificity |
|---|---|---|---|---|---|---|---|---|
| SFU | Internal (SFU) | 0.869 ± 0.046 | 0.956 ± 0.011 | 0.927 ± 0.015 | 0.870 ± 0.035 | 0.847 ± 0.041 | 0.875 ± 0.102 | 0.864 ± 0.127 |
| External (OHSU) | 0.676 ± 0.081 | 0.852 ± 0.036 | 0.814 ± 0.031 | 0.611 ± 0.104 | 0.341 ± 0.293 | 0.250 ± 0.235 | 0.973 ± 0.041 | |
| Federated learning | 0.875 ± 0.003 | 0.960 ± 0.011 | 0.936 ± 0.017 | 0.876 ± 0.037 | 0.851 ± 0.043 | 0.883 ± 0.103 | 0.870 ± 0.059 | |
| OHSU | Internal (OHSU) | 0.884 ± 0.014 | 0.973 ± 0.008 | 0.986 ± 0.004 | 0.891 ± 0.016 | 0.908 ± 0.014 | 0.869 ± 0.050 | 0.913 ± 0.071 |
| External (SFU) | 0.586 ± 0.131 | 0.766 ± 0.137 | 0.864 ± 0.087 | 0.594 ± 0.032 | 0.585 ± 0.252 | 0.568 ± 0.378 | 0.620 ± 0.349 | |
| Federated learning | 0.888 ± 0.019 | 0.954 ± 0.004 | 0.972 ± 0.002 | 0.897 ± 0.024 | 0.911 ± 0.015 | 0.869 ± 0.011 | 0.924 ± 0.042 |
OHSU = Oregon Health and Science University; SFU = Simon Fraser University.
Comparing federated learning with internal and external models for the calculated evaluation metrics: mean values are calculated with 1 standard deviation in parentheses.
Benjamini-Hochberg–adjusted statistically significant (P < 0.05) difference in means when compared with internal learning models.
Benjamini-Hochberg–adjusted statistically significant (P < 0.05) difference in means when compared with federated learning models.
Performance of Federated Learning on Stratified Diabetic Retinopathy Severities
| Testing Set | Training Model | Normal | Mild | Moderate | Severe | Proliferative |
|---|---|---|---|---|---|---|
| SFU | Internal (SFU) | 0.914 ± 0.090 | 0.750 ± 0.222 | 0.542 ± 0.315 | 0.942 ± 0.074 | 0.962 ± 0.077 |
| External (OHSU) | 1.000 ± 0.000 | 0.911 ± 0.135 | 0.083 ± 0.096 | 0.250 ± 0.202 | 0.327 ± 0.335 | |
| Federated learning | 0.922 ± 0.054 | 0.750 ± 0.071 | 0.750 ± 0.215 | 0.846 ± 0.126 | 0.981 ± 0.038 | |
| OHSU | Internal (OHSU) | 0.900 ± 0.082 | 1.000 ± 0.000 | 0.600 ± 0.365 | 0.500 ± 0.000 | 0.926 ± 0.014 |
| External (SFU) | 0.625 ± 0.328 | 0.583 ± 0.500 | 0.700 ± 0.383 | 0.375 ± 0.479 | 0.561 ± 0.376 | |
| Federated learning | 0.913 ± 0.048 | 1.000 ± 0.000 | 0.400 ± 0.000 | 0.500 ± 0.408 | 0.953 ± 0.014 |
OHSU = Oregon Health and Science University; SFU = Simon Fraser University.
Comparing federated learning to external and internal model accuracies at the institution-specific diabetic retinopathy severity stages: mean values are calculated with 1 standard deviation in parentheses.
Benjamini-Hochberg–adjusted statistically significant (P < 0.05) difference in means when compared with federated learning models.
Benjamini-Hochberg–adjusted statistically significant (P < 0.05) difference in means when compared with internal learning models.
Figure 4Histograms showing representations of model output probability distributions on Simon Fraser University (SFU) data showing the number of images (y-axes) with the corresponding probability score for referable diabetic retinopathy (RDR; x-axes): non-RDR images (left column) and RDR images (right column). Further 5-stage severity stratification is distinguished by the different shades within each subgroup. PDR = proliferative diabetic retinopathy.
Figure 5Histograms showing representations of model output probability distributions on Oregon Health and Science University (OHSU) data showing the number of images (y-axes) with the corresponding probability of referable diabetic retinopathy (RDR; x-axes): non-RDR images (left column) and RDR images (right images). Further 5-stage severity stratification is distinguished by the different shades within each subgroup. PDR = proliferative diabetic retinopathy.
Figure 6Confusion matrices investigating the referable diabetic retinopathy (RDR) classification performance for each of the 5 diabetic retinopathy severities on Simon Fraser University (SFU) data. Entries shaded blue and red represent correct and incorrect classification, respectively. PDR = proliferative diabetic retinopathy; NRDR = non-RDR.
Figure 7Confusion matrices investigating the referable diabetic retinopathy (RDR) classification performance for each of the 5 diabetic retinopathy severities on Oregon Health and Science University (OHSU) data. Entries shaded blue and red represent correct and incorrect classification, respectively. PDR = proliferative diabetic retinopathy; NRDR = non-RDR.