| Literature DB >> 31915523 |
Fatih Demir1, Abdulkadir Sengur1, Varun Bajaj2.
Abstract
Treatment of lung diseases, which are the third most common cause of death in the world, is of great importance in the medical field. Many studies using lung sounds recorded with stethoscope have been conducted in the literature in order to diagnose the lung diseases with artificial intelligence-compatible devices and to assist the experts in their diagnosis. In this paper, ICBHI 2017 database which includes different sample frequencies, noise and background sounds was used for the classification of lung sounds. The lung sound signals were initially converted to spectrogram images by using time-frequency method. The short time Fourier transform (STFT) method was considered as time-frequency transformation. Two deep learning based approaches were used for lung sound classification. In the first approach, a pre-trained deep convolutional neural networks (CNN) model was used for feature extraction and a support vector machine (SVM) classifier was used in classification of the lung sounds. In the second approach, the pre-trained deep CNN model was fine-tuned (transfer learning) via spectrogram images for lung sound classification. The accuracies of the proposed methods were tested by using the ten-fold cross validation. The accuracies for the first and second proposed methods were 65.5% and 63.09%, respectively. The obtained accuracies were then compared with some of the existing results and it was seen that obtained scores were better than the other results.Entities:
Keywords: Convolutional neural networks; Deep learning; Lung disease detection; Time-frequency images
Year: 2019 PMID: 31915523 PMCID: PMC6928168 DOI: 10.1007/s13755-019-0091-3
Source DB: PubMed Journal: Health Inf Sci Syst ISSN: 2047-2501
Fig. 1The proposed deep feature extraction and SVM classification methodology for lung sound classification
Fig. 2The proposed transfer learning methodology for lung sound classification
Cycle info for an audio file of ICBHI 2017 database
| Cycles | Start time | End time | Crackles | Wheezes |
|---|---|---|---|---|
| 1 | 0.804 | 3.256 | 0 | 0 |
| 2 | 3.256 | 5.566 | 0 | 0 |
| 3 | 5.566 | 7.851 | 0 | 1 |
| 4 | 7.851 | 10.054 | 0 | 1 |
| 5 | 10.054 | 12.066 | 1 | 0 |
| 6 | 12.066 | 14.47 | 1 | 0 |
| 7 | 14.47 | 16.696 | 1 | 1 |
| 8 | 16.696 | 18.887 | 1 | 1 |
| 9 | 18.887 | 19.792 | 1 | 1 |
The total number of ICBHI 2017 dataset cycles
| Dataset | Total |
|---|---|
| Number of cycles with crackles | 1864 |
| Number of cycles with wheezes | 886 |
| Number of cycles with both | 506 |
| Number of normal cycles | 3642 |
| Number total of cycles | 6898 |
Fig. 3Confusion matrix for lung sound classification
Fig. 4ROC curve for lung sound classification
The classification accuracies of the methods using ICBHI 2017 Database
| Authors | Methodology | Accuracy (%) |
|---|---|---|
| Jakovljević et al. [ | MFCC, Hidden Markov model | 39.56 |
| Chambres et al. [ | Low level feature, decision tree | 49.62 |
| Serbes et al. [ | STFT + Wavelet, SVM classifier | 57.88 |
| The first proposed method | Deep Feature with CNN model, and SVM classifier | |
| The second proposed method | Transfer learning with CNN Model, and softmax classifier | 63.09 |
The best score is given in bold
The classification accuracies together with AlexNet and ResNet-50 CNN models
| CNN models | Deep feature + SVM (Acc %) | Transfer learning +Softmax (Acc %) |
|---|---|---|
| AlexNet | 60.5 | 61.23 |
| ResNet-50 | 59.10 | 60.05 |
| VGG-16 | 63.09 |
The best score is given in bold
The other evaluation criterias for the proposed method
| Sensitivity | Specificity | False alarm rate | Precision | ||
|---|---|---|---|---|---|
| Crackles label | 0.60 | 0.81 | 0.19 | 0.58 | 0.59 |
| Crackles + wheezes label | 0.31 | 0.44 | 0.36 | ||
| Normal label | 0.60 | 0.40 | |||
| Wheezes label | 0.43 | 0.93 | 0.07 | 0.55 | 0.48 |
| Average values | 0.53 | 0.83 | 0.17 | 0.57 | 0.55 |
The best scores are given in bold