| Literature DB >> 35741128 |
Josefin Sandström1, Hermanus Myburgh2, Claude Laurent3,4, De Wet Swanepoel4, Thorbjörn Lundberg1.
Abstract
BACKGROUND: Otitis media includes several common inflammatory conditions of the middle ear that can have severe complications if left untreated. Correctly identifying otitis media can be difficult and a screening system supported by machine learning would be valuable for this prevalent disease. This study investigated the performance of a convolutional neural network in screening for otitis media using digital otoscopic images labelled by an expert panel.Entities:
Keywords: artificial intelligence; convolutional neural network; digital imaging; global health; machine learning; otitis media
Year: 2022 PMID: 35741128 PMCID: PMC9222011 DOI: 10.3390/diagnostics12061318
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Figure 1Flow chart. Description of tympanic membrane images from inclusion to selection for the convolutional neural network.
Figure 2Augmentation. Augmentation demonstrated by generating multiple low-resolution images from a high-resolution image. In this example a number of 32 × 32 pixels images (to the right) were generated from one 1280 × 1024 pixels image (to the left) using an augmentation factor of 5.
Figure 3GoogLeNet architecture (from [25]).
Figure 4Inception modules (from [25]).
Agreement (number and percentage) among expert panel members on diagnostic categories of images presented to the convolutional neural network (n = 273) and of the not possible to determine (NPD) images (n = 38).
| 5 Out of 5 | 4 Out of 5 | 3 Out of 5 | Total ( | |
|---|---|---|---|---|
|
|
|
|
| 2 |
|
|
|
|
| 20 |
|
|
|
|
| 183 |
|
|
|
|
| 22 |
|
|
|
|
| 46 |
|
| 133 (49%) | 66 (24%) | 74 (27%) | 273 |
|
| 9 (24%) | 10 (26%) | 74 (27%) | 38 |
The number of expert panel members that agreed upon the diagnosis is referred to as 5 out of 5, and so forth. Total* refers to the 273 images analysed by the convolutional neural network.
Figure 5Overall accuracy. Overall accuracy for the convolutional neural network for scenarios (A–D) for augmentation factors 0, 5, 10 and 20. (A): training and testing on dataset 1, (B): training and testing on dataset 1norm, (C): training on dataset 1 and dataset 2 combined and testing on dataset 1, (D): training on dataset 1norm and dataset 2norm combined and testing on dataset 1norm.
Confusion matrix for scenario A and B. 1= perfect correspondence and 0 = no correspondence.
| Norm (0) | Norm (5) | Norm (10) | Norm (20) | Path (0) | Path (5) | Path (10) | Path (20) | Wax (0) | Wax (5) | Wax (10) | Wax (20) | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Normal |
|
|
|
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Path | 0 | 0 | 0.10 | 0.15 |
|
|
|
| 0 | 0.10 | 0.09 | 0.07 |
| Wax | 0 | 0 | 0 | 0 | 0 | 0.05 | 0.01 | 0 |
|
|
|
|
Combined confusion matrix presenting the true positive rate (grey fields) and false positive rate (white fields), for scenario A and B for augmentation factors 0, 5, 10 and 20 (augmentation factor within brackets). In each box, the value above is for scenario A and the value below is for scenario B. Abbreviations: Normal (Norm), Pathological (Path).
Sensitivity and specificity.
| SCENARIOS | A | B | C | D | |
|---|---|---|---|---|---|
|
| Sensitivity | 100% | 98% | 100% | 98% |
| Specificity | 100% | 100% | 100% | 100% | |
|
| Sensitivity | 100% | 100% | 100% | 100% |
| Specificity | 100% | 100% | 100% | 100% | |
|
| Sensitivity | 95% | 100% | 96% | 100% |
| Specificity | 100% | 100% | 100% | 100% | |
|
| Sensitivity | 93% | 100% | 100% | 100% |
| Specificity | 100% | 100% | 100% | 100% |
Sensitivity and specificity for the convolutional neural network in identifying ears with pathology or wax, all scenarios and with different augmentation (Augm) factors.
Confusion matrix for scenario C and D. 1 = perfect correspondence and 0 = no correspondence.
| Norm (0) | Norm (5) | Norm (10) | Norm (20) | Path (0) | Path (5) | Path (10) | Path (20) | Wax (0) | Wax (5) | Wax (10) | Wax (20) | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Norm |
|
|
|
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Path | 0 | 0 | 0.07 | 0 |
|
|
|
| 0.41 | 0.17 | 0.08 | 0.13 |
| Wax | 0 | 0 | 0 | 0 | 0 | 0.30 | 0.02 | 0 |
|
|
|
|
Combined confusion matrix presenting the true positive rate (grey fields) and false positive rate (white fields), for scenarios C and D for augmentation factors 0, 5, 10 and 20 (augmentation factor within brackets). In each box, the value above is for scenario C and the value below is for scenario D. Abbreviations: Normal (Norm), Pathological (Path).
Figure 6CNN diagnosis of NPD images. Convolutional neural network (CNN) diagnosis of nine not possible to determine (NPD) images when trained on dataset 1 without normalization and augmentation.