| Literature DB >> 33202613 |
Syed Zohaib Hassan Naqvi1, Mohammad Ahmad Choudhry1.
Abstract
Chronic obstructive pulmonary disease (COPD) and pneumonia are two of the few fatal lung diseases which share common adventitious lung sounds. Diagnosing the disease from lung sound analysis to design a noninvasive technique for telemedicine is a challenging task. A novel framework is presented to perform a diagnosis of COPD and Pneumonia via application of the signal processing and machine learning approach. This model will help the pulmonologist to accurately detect disease A and B. COPD, normal and pneumonia lung sound (LS) data from the ICBHI respiratory database is used in this research. The performance analysis is evidence of the improved performance of the quadratic discriminate classifier with an accuracy of 99.70% on selected fused features after experimentation. The fusion of time domain, cepstral, and spectral features are employed. Feature selection for fusion is performed through the back-elimination method whereas empirical mode decomposition (EMD) and discrete wavelet transform (DWT)-based techniques are used to denoise and segment the pulmonic signal. Class imbalance is catered with the implementation of the adaptive synthetic (ADASYN) sampling technique.Entities:
Keywords: chronic obstructive pulmonary disease; discrete wavelet transform; empirical mode decomposition; feature extraction; lung sounds; pneumonia; quadratic discriminant analysis
Mesh:
Year: 2020 PMID: 33202613 PMCID: PMC7697014 DOI: 10.3390/s20226512
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1WHO statistics about worldwide deaths due to respiratory issues in 2000 and 2016.
Figure 2WHO statistics about worldwide death due to different respiratory issues in 2016 [3].
Figure 3The proposed method for diagnosis of COPD, pneumonia, and healthy LS.
Figure 4ICBHI respiratory database comprised of LS from various pulmonary pathologies.
Recording Equipment Used to develop Lung Sound Database.
| Sr. No | Equipment |
|---|---|
| 1. | Welch Allyn Meditron Master Elite Plus Stethoscope Model 5079-400 |
| 2. | 3M Littmann 3200 |
| 3. | 3M Littmann Classic II SE |
Demographic Information of Focused LS Database.
| Description | Detail |
|---|---|
| Total number of LS signal sets (COPD, pneumonia, healthy) | 703 |
| The sampling frequency of recording equipment | 44.1 kHz |
| Bits/ sample | 16 |
| Average recording duration | 21.5s |
| Number of participants | 57 (50 adults & 7 children) |
| Gender | 40 males, 17 females |
| Age | Adults: 69.44 ± 8.24 Years, |
Figure 5Time-domain graphical representation of an LS (raw) signal from the COPD, pneumonia, and healthy class.
Figure 6Frequency domain graphical representation of an LS (raw) signal from the COPD, pneumonia, and healthy class.
Figure 7Graphical representation of IMF1-IMF10 after EMD analysis which corresponds to LS signal of COPD class.
Figure 8Graphical representation of IMF1-IMF10 after EMD analysis which corresponds to an LS signal of pneumonia class.
Figure 9Graphical representation of IMF1-IMF10 after EMD analysis which corresponds to an LS signal of healthy class.
Figure 10Time-domain graphical representation of reconstructed LS signal from IMF-2, IMF-3, and IMF-4 of COPD, pneumonia, and healthy class after EMD analysis.
Figure 11Frequency domain graphical representation of reconstructed LS signal from IMF-2, IMF-3, and IMF-4 of COPD, pneumonia, and healthy class after EMD analysis.
Figure 12Graphical representation of the Coiflets 5 wavelet.
Figure 13Time-domain graphical representation of preprocessed LS signal after denoising from DWT.
Figure 14Frequency domain graphical representation of preprocessed LS signal after denoising from DWT.
List of extracted features for LS analysis.
| Time Domain | Spectral (S) Domain | Cepstral Features (26) | Texture Features (59) |
|---|---|---|---|
| Mean, St. Deviation, Skewness, Kurtosis, Peak to Peak, Root Mean Square, Crest Factor, Shape Factor, Impulse Factor, Margin Factor, Energy, Peak to RMS, Root Sum of Squares, Shannon Energy, Log Energy, Mean Abs Deviation, Median Abs Deviation, Average Frequency, Jitter | S.Mean, S.St. Deviation, S.Skewness, S.Kurtosis, S.Centriod, S.Flux, S.Rolloff, S.Flateness, S.Crest, S.Decrease, S.Slope, S.Spread | MFCC, GCC | LBP |
MFCC: Mel Frequency Cepstral Coefficient, GCC: Gammatone Cepstral Coefficients, LBP: Local Binary Patterns. (Appendix A lists the statistics of all features in each class).
Accuracy of different classifiers on various feature groups.
| FEATURE GROUPS | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| CLASSIFIERS | TD | FD | CD | Texture | TD+FD | TD+CD | FD+CD | CD+Texture | TD+Texture | TD+FD+CD | TD+FD+Texture | FD+CD+Texture | TD+FD+CD+Texture | |
| DT | 87.30% | 86.80% | 94.50% | 92.50% | 89.50% | 95.40% | 94.60% | 94.60% | 93.70% | 95.00% | 94.20% | 94.60% | 95.00% | |
| LD | 76.40% | 76.50% | 91.10% | 73.90% | 82.70% | 93.40% | 93.60% | 92.00% | - | 94.20% | - | 93.70% | - | |
| QD | - | 84.70% | 98.60% | - | - | - | - | - | - | - | - | - | - | |
| LR | - | - | - | - | - | - | - | - | - | - | - | - | - | |
| NB-G | 59.30% | 68.00% | 82.40% | - | 69.20% | 73.40% | 79.80% | - | - | 75.10% | - | - | - | |
| NB-K | 66.80% | 71.10% | 88.30% | 41.70% | 73.00% | 85.50% | 88.90% | 47.10% | 43.50% | 85.90% | 44.70% | 48.00% | 48.50% | |
| SVM-L | 79.30% | 80.10% | 94.90% | 83.90% | 85.30% | 96.00% | 95.30% | 95.50% | 91.30% | 96.00% | 91.90% | 96.10% | 95.90% | |
| SVM-Q | 91.00% | 88.70% | 97.90% | 92.10% | 94.40% | 98.10% | 98.30% | 97.80% | 95.60% | 98.10% | 96.00% | 97.90% | 98.00% | |
| SVM-C | 93.60% | 93.50% | 98.50% | 58.00% | 97.00% | 98.70% | 98.50% | 96.00% | 88.00% | 98.60% | 94.20% | 98.60% | 98.60% | |
| SVM-FG | 94.60% | 91.30% | 96.20% | 91.40% | 97.30% | 97.20% | 95.80% | 99.70% | 97.20% | 97.20% | 98.10% | 99.40% | 99.20% | |
| SVM-MG | 85.60% | 82.90% | 98.40% | 79.60% | 91.40% | 98.60% | 98.80% | 97.10% | 90.40% | 98.70% | 92.20% | 97.80% | 97.80% | |
| SVM-CG | 70.00% | 75.40% | 90.80% | 61.40% | 79.70% | 94.00% | 93.10% | 85.70% | 73.50% | 93.20% | 78.70% | 89.90% | 90.70% | |
| KNN-F | 92.70% | 91.60% | 97.90% | 93.80% | 95.00% | 97.50% | 97.50% | 98.00% | 94.30% | 97.00% | 95.50% | 97.80% | 97.00% | |
| KNN-M | 87.50% | 84.90% | 94.10% | 86.70% | 89.40% | 94.00% | 92.60% | 94.30% | 90.00% | 92.90% | 89.80% | 93.20% | 93.00% | |
| KNN-Cor | 65.80% | 71.40% | 80.60% | 71.30% | 74.80% | 84.10% | 81.00% | 81.80% | 70.30% | 84.20% | 76.30% | 81.30% | 87.70% | |
| KNN-Cos | 88.10% | 86.00% | 94.90% | 87.50% | 90.20% | 94.60% | 94.60% | 95.20% | 90.90% | 94.40% | 91.00% | 94.30% | 94.80% | |
| KNN-C | 87.90% | 85.00% | 93.80% | 86.60% | 88.80% | 93.60% | 92.40% | 94.30% | 89.90% | 92.00% | 89.50% | 92.90% | 92.40% | |
| KNN-W | 89.60% | 88.40% | 9.60% | 90.10% | 91.10% | 94.20% | 93.30% | 94.50% | 91.40% | 93.30% | 91.00% | 93.70% | 93.20% | |
| Eboost | 84.80% | 83% | 96.40% | 89.30% | 90.50% | 96.60% | 96.60% | 96.60% | 94.70% | 96.30% | 94.50% | 96.70% | 96.70% | |
| EBT | 93.80% | 91.50% | 97.10% | 94.90% | 95.70% | 97.30% | 96.90% | 97.90% | 96.30% | 97.20% | 97.00% | 97.50% | 97.60% | |
| ESD | 71.70% | 72.30% | 88.70% | 71.40% | 79.50% | 92.30% | 91.60% | 90.00% | 81.40% | 93.10% | 85.30% | 93.30% | 94.00% | |
| ESKNN | 68.70% | 77.30% | 97.50% | 92.00% | 74.90% | 69.30% | 83.40% | 97.70% | 69.00% | 74.60% | 75.40% | 84.20% | 74.70% | |
| ERT | 77.00% | 78.80% | 91.20% | 84.40% | 84.20% | 92.40% | 92.90% | 92.40% | 88.40% | 92.80% | 88.30% | 92.80% | 93.00% | |
Figure 15System accuracy of the classification techniques in comparison to different numbers of features.
Figure 16Scatter plot illustrating the minimum correlation between log energy (LE) and GFCC-5 feature of all classes.
Figure 17Scatter plot illustrating the minimum correlation between MFCC-10 and spectral decrease (SDec) features of all classes.
List of Selected Features Extracted and Performance evaluation.
| Selected Features from Time, Frequency, and Cepstral Domain | Classifier | Performance Outcome |
|---|---|---|
| Quadratic discriminant | Overall Accuracy 99.70% |
Mathematical Description of the selected features for classification of LS signal .
| Feature | Mathematical Representation |
|---|---|
| Standard Deviation (SD) |
|
| Peak to Peak (PP) | |
| Log Energy (LE) |
|
| Spectral Standard Deviation (SSD) |
|
| Spectral Skewness (SSkw) |
|
| Spectral Kurtosis(SK) |
|
| Spectral Flux (SF) |
|
| Spectral Roll Off (SRO) | If |
| Spectral Decrease (SDec) |
|
| Mel frequency cepstral coefficient (MFCC) | In MFCC, (i) Frame blocking or windowing to get 50 to 60ms. (ii) Performing a discrete Fourier transform (iii) computing logarithm of the signal. (iv) Deforming the frequencies on a Mel scale, followed by applying the discrete cosine transform (DCT). Mel scale is calculated as follows: |
| Gammatone Frequency Cepstral Coefficient (GFCC) | In GCC, (i) Firstly, the signal is passed through gammatone filter bank which consists of 64 Channels. (ii) Take the absolute value at each channel and reduce it to 100 Hz as a way of time windowing. (iii) Take cubic root on the time-frequency representation. (iv) Deforming the frequencies on an equivalent rectangular bandwidth (ERB) scale Apply DCT to derive cepstral features. ERB scale is calculated as follows. |
Figure 18Accuracy of the proposed diagnosis method with an optimized number of features on different classifiers.
Figure 19System confusion matrix (QD classifier) before denoising.
Figure 20System confusion matrix (QD classifier) after denoising.
Cross-validation of the proposed method with different folds.
| Evaluation | Classes | ACC | TPR | FNR | PPV | FDR |
|---|---|---|---|---|---|---|
| (5, 10, 15, and 20) Fold Cross-Validation | COPD | 99.6 | >99 | <1 | 99 | 1 |
| Normal | >99 | <1 | 100 | 0 | ||
| Pneumonia | >99 | <1 | >99 | <1 | ||
| 20% Hold Out Validation | COPD | 99.7 | 99 | 1 | 100 | 0 |
| Normal | 100 | 0 | 100 | 0 | ||
| Pneumonia | 100 | 0 | 99 | 1 | ||
| 25% Hold Out Validation | COPD | 99.8 | 100 | 0 | 99 | 1 |
| Normal | 99 | 1 | 100 | 0 | ||
| Pneumonia | 100 | 0 | 100 | 0 |
ACC: Mean accuracy, TPR: True Positive, FNR: False Negative Rate, PPV: Positive Predictive Value, FDR: False Discovery Rate.
Comparative analysis of the purposed technique with similar lung pathology methods.
| Class | Number of Features | Accuracy (%) |
|---|---|---|
| Pneumonia [ | 13 | 87.87 |
| Pneumonia [ | 18 | 90.06 |
| Pneumonia [ | 7 | 99.70 |
| COPD [ | 25 | 85.10 |
| COPD [ | 27 | 95.10 |
| COPD, pneumonia (This method) | 25 | 99.70 |
Performance analysis of the proposed diagnosis methodology for COPD and pneumonia identification with current techniques on lung pathologies.
| Classes | Method | Results (%) |
|---|---|---|
| Crackles, Crackles+ Wheeze, | STFT, WT, SVM | ACC: 49.86 |
| Normal, Pneumonia [ | SA | ACC: 91.98 SEN:92.06 SPE: 90.68 |
| Pneumonia and Asthma [ | NN | SEN: 89, SPE:100 |
| Normal, Pneumonia [ | WT, LR | SEN: 94 |
| Normal, Pneumonia [ | EMD, KNN | ACC: 99.7 |
| Normal, COPD and Pneumonia [ | SA | - |
| Normal Asthma and COPD [ | ANN | ACC:60.33 |
| Normal, Asthma, Bronchitis [ | EMD, KNN | ACC: 99.3 |
| COPD [ | KT | ACC: 85.1 |
| COPD [ | KG, ML | ACC:95.1 |
| COPD, Healthy, Pneumonia, | CNN | SEN: 98.8 |
| Crackles, Crackles+ Wheeze, | CNN | ACC i: 65.5 |
| Normal, COPD, Pneumonia | EMD, WT, QD | ACC: 99.8% |
Knowledge graph: KG, Short-time Fourier Transform: STFT, Linear regression: LR, Statistical analysis: SA, Neural network: NN, Knowledge transfer: KT, ACC: Mean accuracy. SEN: Sensitivity, SPE: Specificity.
Feature Statistics of All Classes.
| Features | COPD | Pneumonia | Normal |
|---|---|---|---|
| Mean |
|
|
|
| Standard Deviation |
|
|
|
| Skewness |
|
|
|
| Kurtosis |
|
|
|
| Peak_to_Peak |
|
|
|
| Root_Mean_Square |
|
|
|
| Crest_Factor |
|
|
|
| Shape_Factor |
|
|
|
| Impulse_Factor |
|
|
|
| Margin_Factor |
|
|
|
| Energy |
|
|
|
| Peak_to_Root_Mean_Square |
|
|
|
| Root_Sum_of_Squares |
|
|
|
| Shannon_Energy |
|
|
|
| Log_Energy |
|
|
|
| Mean_Absolute_Deviation |
|
|
|
| Median_Absolute_Deviation |
|
|
|
| Average_Frequency |
|
|
|
| Jitter |
|
|
|
| Spectral Mean |
|
|
|
| Spectral Std_Deviation |
|
|
|
| Spectral Skewness |
|
|
|
| Spectral Kurtosis |
|
|
|
| Spectral Centriod |
|
|
|
| Spectral Flux |
|
|
|
| Spectral Rolloff |
|
|
|
| Spectral Flateness |
|
|
|
| Spectral Crest |
|
|
|
| Spectral Decrease |
|
|
|
| Spectral Slope |
|
|
|
| Spectral Spread |
|
|
|
| MFCC_1 |
|
|
|
| MFCC_2 |
|
|
|
| MFCC_3 |
|
|
|
| MFCC_4 |
|
|
|
| MFCC_5 |
|
|
|
| MFCC_6 |
|
|
|
| MFCC_7 |
|
|
|
| MFCC_8 |
|
|
|
| MFCC_9 |
|
|
|
| MFCC_10 |
|
|
|
| MFCC_11 |
|
|
|
| MFCC_12 |
|
|
|
| MFCC_13 |
|
|
|
| GFCC_1 |
|
|
|
| GFCC_2 |
|
|
|
| GFCC_3 |
|
|
|
| GFCC_4 |
|
|
|
| GFCC_5 |
|
|
|
| GFCC_6 |
|
|
|
| GFCC_7 |
|
|
|
| GFCC_8 |
|
|
|
| GFCC_9 |
|
|
|
| GFCC_10 |
|
|
|
| GFCC_11 |
|
|
|
| GFCC_12 |
|
|
|
| GFCC_13 |
|
|
|