| Literature DB >> 34901332 |
Jong Soo Kim1, Yongil Cho2, Tae Ho Lim2.
Abstract
BACKGROUND/Entities:
Keywords: Deep learning; Glottis; Localization; Orthogonal neural network; Pneumothorax
Year: 2021 PMID: 34901332 PMCID: PMC8640727 DOI: 10.1016/j.ejro.2021.100388
Source DB: PubMed Journal: Eur J Radiol Open ISSN: 2352-0477
Fig. 1Schematic representation of the ONN structure.
Fig. 2Example of chest X-ray image processing for application to an ONN: (a) Original X-ray image, (b) Pneumothorax-marked image.
Fig. 3Example of laryngeal image processing for application to an ONN and CNN: (a) Original laryngeal image, (b) Glottis-marked image.
Fig. 4Schematic representation of the CNN structure.
Test results for detecting the location of pneumothorax in chest X-rays by the type of deep-learning methods, i.e., artificial neural networks (ANNs), convolution neural networks (CNNs), and orthogonal neural networks (ONNs).
| Method | Hidden nodes | AUC | Cut-off | Sensitivity % | Specificity % | PPV % | NPV % | Accuracy % |
|---|---|---|---|---|---|---|---|---|
| ANN | 49 | 0.876 | 0.122 | 78.3 | 84.2 | 37.5 | 97.0 | 83.6 |
| ANN | 49 | 0.881 | 0.101 | 81.0 | 83.6 | 37.4 | 97.3 | 83.3 |
| ANN | 49–49–49 | 0.876 | 0.084 | 82.0 | 80.7 | 34.0 | 97.4 | 80.8 |
| ANN | 49–49–49 | 0.882 | 0.101 | 80.6 | 83.0 | 36.5 | 97.2 | 82.7 |
| CNN | 49 | 0.861 | 0.119 | 76.6 | 81.1 | 33.0 | 96.6 | 80.6 |
| CNN | 49–49 | 0.859 | 0.128 | 76.9 | 82.6 | 34.8 | 96.7 | 81.9 |
| CNN | 49 | 0.829 | 0.072 | 80.8 | 76.9 | 29.7 | 97.1 | 77.3 |
| CNN | 49–49 | 0.795 | 0.134 | 73.9 | 82.6 | 34.0 | 96.3 | 81.7 |
| ONN | 49 | 0.870 | 0.132 | 75.0 | 86.5 | 40.3 | 96.6 | 85.3 |
| ONN | 49–49 | 0.866 | 0.091 | 80.4 | 80.5 | 33.3 | 97.1 | 80.5 |
| ONN | 49 | 0.840 | 0.088 | 74.6 | 82.5 | 34.1 | 96.4 | 81.6 |
| ONN | 49–49 | 0.820 | 0.072 | 78.1 | 78.4 | 30.5 | 96.7 | 78.4 |
Cho et al. [14]; using a fully-connected small artificial neural network (ANN) with a sigmoid activation function for all the nodes with an input resolution of 20 × 20 or 30 × 30 pixels.
Cho et al.[14]; using a convolution neural network (CNN) with a sigmoid or RELU activation function for the fully-connected hidden nodes with an input resolution of 256 × 256 pixels.
This work; using an orthogonal neural network (ONN) with a sigmoid or RELU activation function for all the nodes other than the output nodes with an input resolution of 256 × 256 pixels.
Prediction results for the glottic location in laryngeal images by the type of deep-learning methods. %, (number of images).
| Method/number of hidden nodes | Accurate | Adjacent | Inaccurate |
|---|---|---|---|
| ANNa/98 | 74.5% (149) | 21.5% (43) | 4.0% (8) |
| CNNb/49 | 68.5% (137) | 25.5% (51) | 6.0% (12) |
| CNNb/49–49 | 66.0% (132) | 28.5% (57) | 5.5% (11) |
| ONNc/49 | 70.5% (141) | 20.5% (41) | 9.0% (18) |
| ONNc/49–49 | 67.0% (134) | 22.5% (45) | 10.5% (21) |
aKim et al.[11]; using an artificial neural network (ANN) with an input resolution of 30 × 30 pixels. bThis work; using a convolution neural network (CNN) with an input resolution of 256 × 256 pixels. cThis work; using an orthogonal neural network (ONN) with an input resolution of 256 × 256 pixels.