| Literature DB >> 35325999 |
Prasanna Venkatesh Ramesh1, Tamilselvan Subramaniam2, Prajnya Ray3, Aji Kunnath Devadas3, Shruthy Vaishali Ramesh4, Sheik Mohamed Ansar5, Meena Kumari Ramesh6, Ramesh Rajasekaran7, Sathyan Parthasarathi8.
Abstract
Purpose: For diagnosing glaucomatous damage, we have employed a novel convolutional neural network (CNN) from TrueColor confocal fundus images to conquer the black box dilemma in artificial intelligence (AI). This neural network with CNN architecture with human-in-the-loop (HITL) data annotation helps not only in diagnosing glaucoma but also in predicting and locating detailed signs in the glaucomatous fundus, such as splinter hemorrhages, glaucomatous optic atrophy, vertical glaucomatous cupping, peripapillary atrophy, and retinal nerve fiber layer (RNFL) defect.Entities:
Keywords: Artificial Intelligence; Confocal Fundus Images; Glaucomatous Cupping; HITL; Machine Learning
Mesh:
Year: 2022 PMID: 35325999 PMCID: PMC9240493 DOI: 10.4103/ijo.IJO_2583_21
Source DB: PubMed Journal: Indian J Ophthalmol ISSN: 0301-4738 Impact factor: 2.969
Figure 1(a) Sample fundus photograph of an eye with glaucomatous cupping and retinal nerve fiber layer defect utilized for annotating. (b) Customized labeling of the optic cup (green-dotted area). (c) Customized labeling of bayoneting signs (red-dotted area). (d) Customized labeling of superior notching (blue-dotted area). (e) Customized labeling of the optic disc (pink-dotted area). (f) Customized labeling of peripapillary atrophy (gray-dotted area). (g) Customized annotation of the retinal nerve fiber layer (RNFL) defect (green-dotted area). (h) Complete annotation of a fundus image with glaucomatous changes in the optic nerve head and RNFL region
Figure 2Image showing the methodology workflow of this study
The model head used to perform the final detection part
| # parameters | # custom head |
| nc: 24 # number of classes | head: |
| depth_multiple: 0.67 # model depth multiple | [[-1, 3, BottleneckCSP, [1024, False]], # 11 |
| width_multiple: 0.75 # layer channel multiple | [-1, 1, nn.Conv2d, [na * (nc+5), 1, 1, 0]], # 12 (P5/32-large) |
| # anchors | [-2, 1, nn.Upsample, [None, 2, ’nearest’]], |
| anchors: | [[-1, 6], 1, Concat, [1]], # cat backbone P4 |
| - [10,13, 16,30, 33,23] # P3/8 | [-1, 1, Conv, [512, 1, 1]], |
| - [30,61, 62,45, 59,119] # P4/16 | [-1, 3, BottleneckCSP, [512, False]], |
| - [116,90, 156,198, 373,326] # P5/32 | [-1, 1, nn.Conv2d, [na * (nc+5), 1, 1, 0]], # 17 (P4/16-medium) |
| # custom backbone | [-2, 1, nn.Upsample, [None, 2, ’nearest’]], |
| backbone: | [[-1, 4], 1, Concat, [1]], # cat backbone P3 |
| # [from, number, module, args] | [-1, 1, Conv, [256, 1, 1]], |
| [[-1, 1, Focus, [64, 3]], # 1-P1/2 | [-1, 3, BottleneckCSP, [256, False]], |
| [-1, 1, Conv, [128, 3, 2]], # 2-P2/4 | [-1, 1, nn.Conv2d, [na * (nc+5), 1, 1, 0]], # 22 (P3/8-small) |
| [-1, 3, Bottleneck, [128]], | [[], 1, Detect, [nc, anchors]], # Detect (P3, P4, P5) |
| [-1, 1, Conv, [256, 3, 2]], # 4-P3/8 | |
| [-1, 9, BottleneckCSP, [256]], | |
| [-1, 1, Conv, [512, 3, 2]], # 6-P4/16 | |
| [-1, 9, BottleneckCSP, [512]], | |
| [-1, 1, Conv, [1024, 3, 2]], # 8-P5/32 | |
| [-1, 1, SPP, [1024, [5, 9, 13]]], | |
| [-1, 6, BottleneckCSP, [1024]], # 10 | |
| ] |
Figure 3A sample of the batch size of eight image predictions during training, consisting of class probabilities, objectness scores, and bounding boxes
The private dataset of the high-resolution fundus images split-up into 80% and 20% for training and testing respectively
| Training Images | Testing Images | |
|---|---|---|
| Count | 1,120 | 280 |
| Percentages | 80% | 20% |
The split-up of the 280 testing images into three different testing groups (90+100+90). Test 1 predictions were performed after the first fifteen days of annotations. Test 2 predictions were performed after the next fifteen days of annotations. Test 3 predictions were performed after the final one month of annotation. With time there is a surge in the no. of correct predictions as the machine is learning with more data
| Test | Total Images | No. of images in which all the detailed findings were correctly predicted | No. of images in which all the detailed findings were wrongly predicted |
|---|---|---|---|
| Test 1 | 90 | 85 (94.44) | 5 (5.56) |
| Test 2 | 100 | 98 (98) | 2 (2) |
| Test 3 | 90 | 89 (98.89) | 1 (1.11) |
Figure 4(a) 2D distribution graph showing the annotations which were repeatedly used depicted as spikes. (b and c) 3D distribution graph showing repeated annotations seen as warmer colors
Figure 5(a) Image showing the quality of training during the beginning of training with respect to GLoU, objectness. classification, precision, and recall. (b) Image showing the quality of improvement at the end of the training with respect to GLoU, objectness, classification, precision, and recall
This table shows the distribution of specificity and sensitivity of the 90 images in Test 1. TP - True Positive; FN - False Negative; TN - True Negative; FP - False Positive
| Condition exists | Condition does not exist | Total | |
|---|---|---|---|
| Test Positive | 37 (True Positive) | 1 (False Positive) | 38 |
| Test Negative | 4 (False Negative) | 48 (True Negative) | 52 |
| Total | 41 | 49 | 90 |
| Sensitivity | TP/TP + FN | 37/(37+4) | 90.24% |
| Specificity | TN (TN + FP) | 48/(48+1) | 97.96% |
This table shows the distribution of specificity and sensitivity of the 100 images in Test 2. TP - True Positive; FN - False Negative; TN - True Negative; FP - False Positive
| Condition exists | Condition does not exist | Total | |
|---|---|---|---|
| Test Positive | 28 | 1 | 29 |
| Test Negative | 1 | 70 | 71 |
| Total | 29 | 71 | 100 |
| Sensitivity | TP/TP + FN | 28/(28+1) | 96.55% |
| Specificity | TN (TN + FP) | 70/(70+1) | 98.59% |
This table shows the distribution of specificity and sensitivity of the 90 images in Test 3. TP - True Positive; FN - False Negative; TN - True Negative; FP - False Positive
| Condition exists | Condition does not exist | Total | |
|---|---|---|---|
| Test positive | 36 (True Positive) | 1 (False Positive) | 37 |
| Test negative | 0 (False Negative) | 53 (True Negative) | 53 |
| Total | 36 | 54 | 90 |
| Sensitivity | TP/TP + FN | 36/(36+0) | 100% |
| Specificity | TN (TN + FP) | 53/(53+1) | 98.14% |
Figure 6(a-d) Image depicting the prediction done by the trained AI module on feeding a new fundus image not previously trained by the tool, after the AI tool has been primed and trained. These images were predicted with diagnosis, severity, and all their detailed findings seen in glaucomatous damage