| Literature DB >> 31578338 |
Joon Young Kim1, Ha Eun Lee1, Yeon Hyung Choi1, Suk Jun Lee2, Jong Soo Jeon3.
Abstract
The purpose of this methodological study was to develop a convolutional neural network (CNN), which is a recently developed deep-learning-based image recognition method, to determine corneal ulcer severity in dogs. The CNN model was trained with images for which corneal ulcer severity (normal, superficial, and deep) were previously classified by veterinary ophthalmologists' diagnostic evaluations of corneal photographs from patients who visited the Veterinary Medical Teaching Hospital (VMTH) at Konkuk University and 3 different veterinary ophthalmology specialty hospitals in Korea. The original images (depicting normal corneas (36) and corneas with superficial (47) ulcers, deep (47) ulcers), flipped images (total 520), rotated images (total 520), and both flipped and rotated images (total 1,040) were labeled, learned and evaluated with GoogLeNet, ResNet, and VGGNet models, and the severity of each corneal ulcer image was determined. To accomplish this task, models based on TensorFlow, an open-source software library developed by Google, were used, and the labeled images were converted into TensorFlow record (TFRecord) format. The models were fine-tuned using a CNN model trained on the ImageNet dataset and then used to predict severity. Most of the models achieved accuracies of over 90% when classifying superficial and deep corneal ulcers, and ResNet and VGGNet achieved accuracies over 90% for classifying normal corneas, corneas with superficial ulcers, and corneas with deep ulcers. This study proposes a method to effectively determine corneal ulcer severity in dogs by using a CNN and concludes that multiple image classification models can be used in the veterinary field.Entities:
Mesh:
Year: 2019 PMID: 31578338 PMCID: PMC6775068 DOI: 10.1038/s41598-019-50437-0
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Architectures of GoogLeNet. (a) Inception module with dimension reduction, ResNet. (b) Residual learning: a building block, and VGGNet. (c) Architecture of VGGNet.
Accuracy (%) results of each model for predicting superficial and deep.
| Models | Raw images | Flipped images | Rotated images | Flipped and rotated images | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Superficial | Deep | Total | Superficial | Deep | Total | Superficial | Deep | Total | Superficial | Deep | Total | |
| Inception_v1 | 100.0 | 100.0 | 100.0 | 100.0 | 92.3 | 96.0 | 100.0 | 92.3 | 96.0 | 100.0 | 84.6 | 92.0 |
| Inception_v2 | 100.0 | 76.9 | 88.0 | 100.0 | 92.3 | 96.0 | 100.0 | 76.9 | 88.0 | 91.7 | 84.6 | 88.0 |
| Inception_v3 | 100.0 | 92.3 | 96.0 | 100.0 | 100.0 | 100.0 | 100.0 | 92.3 | 96.0 | 91.7 | 84.6 | 88.0 |
| Inception_v4 | 100.0 | 100.0 | 100.0 | 100.0 | 92.3 | 96.0 | 100.0 | 84.6 | 92.0 | 100.0 | 76.9 | 88.0 |
| ResNet_v1_50 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 |
| ResNet_v1_101 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 |
| ResNet_v1_152 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 |
| ResNet_v2_50 | 100.0 | 92.3 | 96.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 84.6 | 92.0 |
| ResNet_v2_101 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 84.6 | 92.0 |
| ResNet_v2_152 | 100.0 | 92.3 | 96.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 |
| VGGNet_16 | 100.0 | 100.0 | 100.0 | 100.0 | 92.3 | 96.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 |
| VGGNet_19 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 |
Accuracy (%) results of each model for predicting normal, superficial, and deep.
| Models | Raw images | Flipped images | Rotated images | Flipped and rotated images | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Normal | Superficial | Deep | Total | Normal | Superficial | Deep | Total | Normal | Superficial | Deep | Total | Normal | Superficial | Deep | Total | |
| Inception_v1 | 66.7 | 100.0 | 84.6 | 85.3 | 88.9 | 100.0 | 92.3 | 94.1 | 66.7 | 100.0 | 76.9 | 82.4 | 88.9 | 100.0 | 92.3 | 94.1 |
| Inception_v2 | 66.7 | 83.3 | 84.6 | 79.4 | 100.0 | 100.0 | 61.5 | 85.3 | 33.3 | 66.7 | 61.5 | 55.9 | 55.6 | 91.7 | 84.6 | 79.4 |
| Inception_v3 | 77.8 | 100.0 | 84.6 | 88.2 | 88.9 | 100.0 | 92.3 | 94.1 | 66.7 | 75.0 | 76.9 | 73.5 | 77.8 | 100.0 | 69.2 | 82.4 |
| Inception_v4 | 77.8 | 100.0 | 84.6 | 88.2 | 100.0 | 100.0 | 92.3 | 97.1 | 66.7 | 100.0 | 76.9 | 82.4 | 77.8 | 100.0 | 92.3 | 91.2 |
| ResNet_v1_50 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 88.9 | 100.0 | 92.3 | 94.1 |
| ResNet_v1_101 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 66.7 | 100.0 | 100.0 | 91.2 | 100.0 | 100.0 | 100.0 | 100.0 |
| ResNet_v1_152 | 88.9 | 100.0 | 100.0 | 97.1 | 88.9 | 91.7 | 92.3 | 91.2 | 100.0 | 100.0 | 92.3 | 97.1 | 100.0 | 100.0 | 92.3 | 97.1 |
| ResNet_v2_50 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 77.8 | 100.0 | 100.0 | 94.1 | 100.0 | 91.7 | 92.3 | 94.1 |
| ResNet_v2_101 | 88.9 | 100.0 | 100.0 | 97.1 | 88.9 | 100.0 | 100.0 | 97.1 | 55.6 | 91.7 | 92.3 | 82.4 | 100.0 | 100.0 | 84.6 | 94.1 |
| ResNet_v2_152 | 100.0 | 100.0 | 100.0 | 100.0 | 88.9 | 100.0 | 100.0 | 97.1 | 88.9 | 83.3 | 84.6 | 85.3 | 88.9 | 100.0 | 92.3 | 94.1 |
| VGGNet_16 | 88.9 | 100.0 | 100.0 | 97.1 | 88.9 | 100.0 | 100.0 | 97.1 | 77.8 | 100.0 | 100.0 | 94.1 | 77.8 | 100.0 | 100.0 | 94.1 |
| VGGNet_19 | 88.9 | 100.0 | 100.0 | 97.1 | 88.9 | 100.0 | 100.0 | 97.1 | 88.9 | 100.0 | 100.0 | 97.1 | 88.9 | 91.7 | 100.0 | 94.1 |
Figure 2Examples of inaccurately classified images.
Figure 3Example of the cropping performed prior to image processing.
Figure 4Examples of images with each class label.