| Literature DB >> 31260494 |
Stefan Maetschke1, Bhavna Antony1, Hiroshi Ishikawa2, Gadi Wollstein2, Joel Schuman2, Rahil Garnavi1.
Abstract
Optical coherence tomography (OCT) based measurements of retinal layer thickness, such as the retinal nerve fibre layer (RNFL) and the ganglion cell with inner plexiform layer (GCIPL) are commonly employed for the diagnosis and monitoring of glaucoma. Previously, machine learning techniques have relied on segmentation-based imaging features such as the peripapillary RNFL thickness and the cup-to-disc ratio. Here, we propose a deep learning technique that classifies eyes as healthy or glaucomatous directly from raw, unsegmented OCT volumes of the optic nerve head (ONH) using a 3D Convolutional Neural Network (CNN). We compared the accuracy of this technique with various feature-based machine learning algorithms and demonstrated the superiority of the proposed deep learning based method. Logistic regression was found to be the best performing classical machine learning technique with an AUC of 0.89. In direct comparison, the deep learning approach achieved a substantially higher AUC of 0.94 with the additional advantage of providing insight into which regions of an OCT volume are important for glaucoma detection. Computing Class Activation Maps (CAM), we found that the CNN identified neuroretinal rim and optic disc cupping as well as the lamina cribrosa (LC) and its surrounding areas as the regions significantly associated with the glaucoma classification. These regions anatomically correspond to the well established and commonly used clinical markers for glaucoma diagnosis such as increased cup volume, cup diameter, and neuroretinal rim thinning at the superior and inferior segments.Entities:
Mesh:
Year: 2019 PMID: 31260494 PMCID: PMC6602191 DOI: 10.1371/journal.pone.0219126
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Demographic data: Gender and race distribution, and mean values with standard deviations and ranges for age, IOP, MD and GHT.
| Healthy | POAG | |
|---|---|---|
| #Female | 88 | 217 |
| #Male | 49 | 215 |
| #White | 101 | 318 |
| #Black | 30 | 154 |
| #Asian | 5 | 12 |
| Age | 54.1±15.3 [22.1-88.9] | 64.3±12.5 [25.2-93.8] |
| IOP | 13.5±2.4 [9-23] | 16.7±5.8 [2-51] |
| MD | -0.8±1.7 [-9.9-2.8] | -6.8±8.1 [-32.9-2.17] |
| GHT | 1.6±1.0 [1-6] | 2.4±0.9 [1-6] |
Average numbers of healthy eyes and eyes with POAG in training, validation and test set.
| Healthy | POAG | |
|---|---|---|
| Training | 216 | 672 |
| Validation | 30 | 82 |
| Test | 17 | 93 |
Hyper-parameters and parameter ranges used for parameter tuning on validation set.
| Classifier | Parameter ranges |
|---|---|
| Naïve Bayes | none |
| Logistic regression | C = [10−1…101] |
| Linear SVM | C = [10−3…103] |
| Polynomial SVM | C = [10−3…103] |
| RBF SVM | C = [10−3…103] |
| Random Forest | max_features = [0.1…1.0] |
| Gradient Boosting | learning_rate = [10−1…100] |
| Extra Trees | max_features = [0.1…1.0] |
Fig 1Network architecture.
5-fold cross-validated prediction performance (mean AUC) of feature-based methods on validation set (AUC) and test set (AUC) with standard deviation.
Last column shows the differences between test and validation AUCs.
| Algorithm | |||
|---|---|---|---|
| Logistic Regression | 0.88±0.035 | -0.013 | |
| SVM (linear) | 0.89±0.044 | 0.88±0.038 | 0.007 |
| SVM (rbf) | 0.90±0.045 | 0.86±0.039 | 0.033 |
| Random Forest | 0.91±0.034 | 0.86±0.027 | 0.043 |
| Extra Trees | 0.90±0.038 | 0.86±0.046 | 0.043 |
| Naive Bayes | 0.87±0.033 | 0.86±0.029 | 0.015 |
| Gradient Boosting | 0.87±0.033 | 0.82±0.043 | 0.049 |
| SVM (poly) | 0.85±0.030 | 0.82±0.033 | 0.034 |
Fig 2Importance of individual features for glaucoma classification.
Error bars show standard deviation. Features are peripapillary RNFL thickness at 12 clock-hours (clockhour1..clockhour12), peripapillary RNFL thickness in the four quadrants (quad_t..quad_i), average RNFL thickness (avgthickness), rim area (rimeara), disc area (discarea), average cup-to-disc ratio avg_cd_ratio), vertical cup-to-disc ratio (vert_cd_ratio) and cup volume (cupvol).
5-fold cross-validated prediction performance (mean AUC) of feature-agnostic CNN on validation set (AUC) and test set (AUC) with standard deviation.
Last column shows the differences between test and validation AUCs. Results are reported for training with and without augmentation.
| Algorithm | augmentation | |||
|---|---|---|---|---|
| CNN | no | 0.93±0.015 | -0.003 | |
| CNN | yes | 0.95±0.018 | 0.92±0.046 | 0.027 |
Fig 3CAMs of a healthy and a POAG eye.
Top row shows enface (a) and side (b) view of healthy eye. Bottom row shows enface (c) and side (d) view of POAG eye. (N:Nasal, T:Temporal, S:Superior, I:Inferior).