| Literature DB >> 35419047 |
Maleika Heenaye-Mamode Khan1, Nuzhah Gooda Sahib-Kaudeer1, Motean Dayalen2, Faadil Mahomedaly2, Ganesh R Sinha3, Kapil Kumar Nagwanshi4, Amelia Taylor5.
Abstract
The lack of annotated datasets makes the automatic detection of skin problems very difficult, which is also the case for most other medical applications. The outstanding results achieved by deep learning techniques in developing such applications have improved the diagnostic accuracy. Nevertheless, the performance of these models is heavily dependent on the volume of labelled data used for training, which is unfortunately not available. To address this problem, traditional data augmentation is usually adopted. Recently, the emergence of a generative adversarial network (GAN) seems a more plausible solution, where synthetic images are generated. In this work, we have developed a deep generative adversarial network (DGAN) multi-class classifier, which can generate skin problem images by learning the true data distribution from the available images. Unlike the usual two-class classifier, we have developed a multi-class solution, and to address the class-imbalanced dataset, we have taken images from different datasets available online. One main challenge faced during our development is mainly to improve the stability of the DGAN model during the training phase. To analyse the performance of GAN, we have developed two CNN models in parallel based on the architecture of ResNet50 and VGG16 by augmenting the training datasets using the traditional rotation, flipping, and scaling methods. We have used both labelled and unlabelled data for testing to test the models. DGAN has outperformed the conventional data augmentation by achieving a performance of 91.1% for the unlabelled dataset and 92.3% for the labelled dataset. On the contrary, CNN models with data augmentation have achieved a performance of up to 70.8% for the unlabelled dataset. The outcome of our DGAN confirms the ability of the model to learn from unlabelled datasets and yet produce a good diagnosis result.Entities:
Mesh:
Year: 2022 PMID: 35419047 PMCID: PMC8995545 DOI: 10.1155/2022/1797471
Source DB: PubMed Journal: Comput Intell Neurosci
Comparison of the state-of-the-art methods.
| Authors | Techniques | Datasets used | Performance and observation |
|---|---|---|---|
| Hameed et al. [ | Pretrained AlexNet model | 9,144 images obtained from different sources available online | Done multi-class classification for healthy, acne, eczema, benign, or malignant melanoma. A recognition rate of 86.21% was reached. The authors have catered for overfitting using k-folds |
| Bi et al. [ | A probability-based stepwise integration (PSI) approach for segmentation refinement | ISBI 2016, ISBI 2017 and PH2 dataset | Able to segment melanomas with fuzzy boundaries and heterogeneous textures compared with other work conducted so far. However, the main focus is on the segmentation process |
| Al-Masni et al. [ | Deep full-resolution convolutional networks | Public datasets: ISBI 2017 and | Skin lesion segmentation using convolution network. An overall segmentation accuracy of 94.03% for the ISBI 2017 test dataset and 95.08%, respectively, for the PH2 dataset. However, the authors have worked until the segmentation of the lesions only |
| Yuan and Lo [ | Enhanced convolutional-deconvolutional networks | ISBI 2017 skin lesion segmentation challenge, trained on 2000 images | Worked on the segmentation process only |
| Ünver and Ayan (2019) [ | You Only Look Once (YOLO) and the GrabCut algorithm |
| A recognition rate of 90% challenges in the segmentation process because of the presence of other artefacts such as hairs, bubbles, and ruler marks |
| Sun et al. [ | Handcrafted techniques and VGGNet | SD-198 | Used VGGNet and has achieved a performance of VGGNet of 50.27%. The authors have analysed the performance of handcrafted techniques and deep techniques. The authors have concluded that the performance is different when using labelled datasets and real unlabelled datasets |
| Wu et al. [ | Pretrained EfficientNet-B4 CNN algorithm | Own created dataset consisting of 4,740 clinical images | A diagnosis assistant was built based on the pretrained model. An overall diagnostic accuracy of 95.8% was achieved. However, no work has been conducted with respect to overfitting, and there was no analysis of the hyperparameters that may influence performance when using CNN |
| Budhiman et al. [ | ResNet without data augmentation | ISIC 2018 | Used the architecture of ResNet50, ResNet40, ResNet25, ResNet10, and ResNet7 models for training the datasets. Experiments were conducted with and without data augmentation. The validation accuracy achieved was 83% on data without data augmentation |
Figure 1Type of skin disease. (a) Acne. (b) Nevus. (c) Angioma. (d) Eczema. (e) Dermatitis ulcer. (f) Heat rash.
Number of images for each skin disease.
| Type of skin disease | Number of images |
|---|---|
| Acne vulgaris | 900 (450 labelled and 450 unlabelled) |
| Angioma | 1200 (600 labelled and 600 unlabelled) |
| Carcinoma | 800 (400 labelled and 400 unlabelled) |
| Keratosis | 1000 (500 labelled and 500 unlabelled) |
| Nevus | 1100 (550 labelled and 550 unlabelled) |
| Café-au-lait macule | 900 (450 labelled and 450 unlabelled) |
| Dermatofibroma | 950 (475 labelled and 475 unlabelled) |
| Eczema | 1200 (600 labelled and 600 unlabelled) |
| Keloid | 800 (400 labelled and 400 unlabelled) |
| Psoriasis | 900 (450 labelled and 450 unlabelled) |
| Dermatitis ulcer | 750 (375 labelled and 375 unlabelled) |
| Steroid acne | 800 (400 labelled and 400 unlabelled) |
| Versicolor | 750 (375 labelled and 375 unlabelled) |
| Heat rash | 800 (400 labelled and 400 unlabelled) |
| Vulgaris | 900 (450 labelled and 450 unlabelled) |
List of notations.
| Notation | Description |
|---|---|
|
| Gaussian distribution as input and is |
|
| Real data in a given |
|
| Output |
|
| Mapping function from |
|
| Generator function |
|
| Discriminator function |
|
| Value function of the generator and discriminator together to a minimax game |
|
| True positive |
|
| True negative |
|
| False positives |
|
| False negatives |
Figure 2High-level architecture of the proposed solution.
Figure 3Architecture of GANs.
Figure 4Architecture of generator.
Figure 5Architecture of discriminator.
Figure 6Architecture of generator.
Figure 7Model training.
Figure 8Examples of preprocessed images.
Figure 9Training loss of generator and discriminator.
Performance of model using DCGAN.
| Class of skin disease | Number of test images | Accuracy (%) |
|---|---|---|
| Acne vulgaris | 270 | 92.1 |
| Angioma | 360 | 89.2 |
| Carcinoma | 240 | 92.1 |
| Keratosis | 300 | 88.7 |
| Nevus | 330 | 96.6 |
| Café-au-lait macule | 270 | 91.3 |
| Dermatofibroma | 285 | 83.2 |
| Eczema | 360 | 86.4 |
| Keloid | 240 | 94.2 |
| Psoriasis | 270 | 96.0 |
| Dermatitis ulcer | 225 | 82.4 |
| Steroid acne | 240 | 98.3 |
| Versicolor | 225 | 93.5 |
| Heat rash | 240 | 82.5 |
| Vulgaris | 270 | 90.2 |
Figure 10Data augmentation on images.
Figure 11Application of segmentation on images.
Figure 12Performance of ResNet50 and VGG16 after data augmentation with fully labelled datasets.
Figure 13Performance of ResNet50 and VGG16 after data augmentation with partially labelled datasets.
Summary of overall performance.
| GAN | Data augmentation | ||||
|---|---|---|---|---|---|
| Performance (%) (labelled dataset) | Performance (%) (unlabelled dataset) | Performance (%) (labelled dataset) | Performance (%) (unlabelled dataset) | ||
| ResNet50 | VGG16 | ResNet50 | VGG16 | ||
| 92.3 | 91.1 | 85.9 | 80.2 | 70.8 | 64.9 |
Comparison of state-of-the-art techniques.
| Authors | Number of classes | Technique | Accuracy (%) |
|---|---|---|---|
| Rashid et al. [ | 7 types of skin lesions | GAN | 86.1 |
| Ali et al. [ | 2 classes (malignant and benign melanoma) | DCGAN | 91.9 |
| Sedigh et al. [ | 2 classes (malignant and benign melanoma) | GAN | 71.1 |
| Udrea and Mitra [ | 2 classes (pigmented vs. nonpigmented lesions) | DCGAN | 92.1 |
| Proposed work | 15 types of skin diseases | DCGAN | 91.1 |
Existing work using CNN models.
| Authors | Details | Technique | Accuracy (%) |
|---|---|---|---|
| Budhiman et al. [ | ResNet50 on ISIC 2018 | ResNet50 without data augmentation | 83 |
| Ayan and Unver [ | CNN on ISIC 2018 | CNN with data augmentation | 81 |