| Literature DB >> 33426615 |
Himadri Mukherjee1, K C Santosh2, Priyanka Sreerama3, Ankita Dhar1, Sk Md Obaidullah4, Kaushik Roy1, Mufti Mahmud5.
Abstract
Significant changes have been made on audio-based technologies over years in several different fields. Healthcare is no exception. One of such avenues is health screening based on respiratory sounds. In this paper, we developed a tool to detect respiratory sounds that come from respiratory infection carrying patients. Linear Predictive Cepstral Coefficient (LPCC)-based features were used to characterize such audio clips. With Multilayer Perceptron (MLP)-based classifier, in our experiment, we achieved the highest possible accuracy of 99.22% that was tested on a publicly available respiratory sounds dataset (ICBHI17) (Rocha et al. Physiol. Meas. 40(3):035,001 20) of size 6800+ clips. In addition to other popular machine learning classifiers, our results outperformed common works that exist in the literature.Entities:
Keywords: Healthcare; Lung health; Respiratory infection; Respiratory sound
Mesh:
Year: 2021 PMID: 33426615 PMCID: PMC7797201 DOI: 10.1007/s10916-020-01681-9
Source DB: PubMed Journal: J Med Syst ISSN: 0148-5598 Impact factor: 4.460
Fig. 1200 audio clips (original): healthy class (left) and non-healthy class (right)
Respiratory sound database [20]
| Clip type | Number of clips |
|---|---|
| Healthy | 3642 |
| Non-healthy | 3256 |
Fig. 2200 audio clips (as in Fig. 1) after framing: healthy class (left) and non-healthy class (right)
Fig. 3Representation of 200 audio clips (as in Fig. 1) after windowing: healthy class (left) and non-healthy class (right)
Fig. 4Representation of 30 dimensional features for the audio clips: healthy class (left); non-healthy class (right)
Performance of different feature dimensions using MLP
| Feature dim. | Accuracy(%) |
|---|---|
| 10 | 93.91 |
| 20 | 90.01 |
| 30 | 99.07 |
| 40 | 89.19 |
| 50 | 98.78 |
Inter-class confusions for the 30 dimensional features (Best result) using MLP
| Healthy | Non-healthy | |
|---|---|---|
| Healthy | 3611 | 31 |
| Non-healthy | 33 | 3223 |
Performance for different momentum values on 30 dimensional features with learning rate of 0.3
| Momentum | Accuracy(%) |
|---|---|
| 0.1 | 99.14 |
| 0.2 | 99.07 |
| 0.3 | 99.04 |
| 0.4 | 99.07 |
| 0.5 | 99.12 |
Inter-class confusions for momentum value of 0.1 on 30 dimensional features
| Healthy | Non-healthy | |
|---|---|---|
| Healthy | 3607 | 35 |
| Non-healthy | 24 | 3232 |
Performance for different learning rates with momentum of 0.2
| Learning rate | Accuracy(%) |
|---|---|
| 0.1 | 99.03 |
| 0.2 | 99.13 |
| 0.3 | 99.07 |
| 0.4 | 99.06 |
| 0.5 | 99.22 |
| 0.6 | 99.13 |
Interclass confusions for learning rate of 0.5 and momentum of 0.2 on 30 dimensional features
| Healthy | non-healthy | |
|---|---|---|
| Healthy | 3615 | 27 |
| non-healthy | 27 | 3229 |
Performance metrics for default scenario, best results after tuning momentum value and best result after tuning learning rate
| Metrics | Default | Best momentum | Best learning rate |
|---|---|---|---|
| Sensitivity | 0.9915 | 0.9904 | 0.9917 |
| Specificity | 0.9899 | 0.9926 | 0.9926 |
| Precision | 0.9909 | 0.9834 | 0.9917 |
| False positive rate | 0.0101 | 0.0074 | 0.0074 |
| False negative rate | 0.0085 | 0.0096 | 0.0083 |
| Accuracy(%) | 99.07 | 99.14 | 99.22 |
| F1 score | 0.9912 | 0.9919 | 0.9917 |
| AUC | 0.9994 | 0.9995 | 0.9993 |
Fig. 5ROC curves: a default settings, b best momentum value (0.1), and c best learning rate (0.5, overall highest result)
Performance of different classifiers on the 30 dimensional features
| Classifier | Accuracy(%) |
|---|---|
| BayesNet | 98.26 |
| Naïve Bayes | 88.98 |
| SVM | 98.59 |
| RBF Network | 95.82 |
| LibLINEAR | 98.59 |
| Simple Logistic | 98.70 |
| Decision Table | 98.62 |
| RNN | 93.82 |
| Multilayer Perceptron | 99.22 |
Comparison with reported works
| Work | Accuracy(%) |
|---|---|
| Kok et al. [ | 87.10 |
| Chambers et al. [ | 85.00 |
| Proposed technique | 99.22 |