| Literature DB >> 30861052 |
Renard Xaviero Adhi Pramono1, Syed Anas Imtiaz1, Esther Rodriguez-Villegas1.
Abstract
Chronic Respiratory Diseases (CRDs), such as Asthma and Chronic Obstructive Pulmonary Disease (COPD), are leading causes of deaths worldwide. Although both Asthma and COPD are not curable, they can be managed by close monitoring of symptoms to prevent worsening of the condition. One key symptom that needs to be monitored is the occurrence of wheezing sounds during breathing since its early identification could prevent serious exacerbations. Since wheezing can happen randomly without warning, a long-term monitoring system with automatic wheeze detection could be extremely helpful to manage these respiratory diseases. This study evaluates the discriminatory ability of different types of feature used in previous related studies, with a total size of 105 individual features, for automatic identification of wheezing sound during breathing. A linear classifier is used to determine the best features for classification by evaluating several performance metrics, including ranksum statistical test, area under the sensitivity--specificity curve (AUC), F1 score, Matthews Correlation Coefficient (MCC), and relative computation time. Tonality index attained the highest effect size, at 87.95%, and was found to be the feature with the lowest p-value when ranksum significance test was performed. Third MFCC coefficient achieved the highest AUC and average optimum F1 score at 0.8919 and 82.67% respectively, while the highest average optimum MCC was obtained by the first coefficient of a 6th order LPC. The best possible combination of two and three features for wheeze detection is also studied. The study concludes with an analysis of the different trade-offs between accuracy, reliability, and computation requirements of the different features since these will be highly useful for researchers when designing algorithms for automatic wheeze identification.Entities:
Mesh:
Year: 2019 PMID: 30861052 PMCID: PMC6414007 DOI: 10.1371/journal.pone.0213659
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
List of features extracted for evaluation.
| Features Extracted | Size |
|---|---|
| Averaged PSD | 32 |
| Wavelet transform | 20 |
| MFCC | 13 |
| LPC coefficients | 8 |
| Percentile frequency ratio | 4 |
| Entropy-based | 4 |
| Power ratio | 1 |
| ASE flux | 1 |
| Tonality index | 1 |
| Mean crossing irregularity | 1 |
| Other time and spectral features | 20 |
Fig 1Block diagram for preprocessing steps.
Fig 2Flowchart for classification of wheeze and normal respiratory sounds using single feature.
Fig 3Flowchart for classification of wheeze and normal respiratory sounds using combination of features.
Fig 4ROC of features with the highest AUC.
AUC comparison of best performing features on all dataset using linear classifier.
| Feature Name | AUC |
|---|---|
| MFCC-3 | 0.8919 |
| LPC coef-1 | 0.8708 |
| Tonality index | 0.8659 |
| LPC coef-2 | 0.8557 |
| Averaged PSD-17 | 0.8423 |
| Spectral irregularity | 0.8354 |
| WT coef STD-1 | 0.8327 |
| WT coef energy-1 | 0.8323 |
| Averaged PSD-16 | 0.8300 |
| LPC coef-3 | 0.8211 |
| Spectral brightness | 0.8153 |
| 95% freq roll-off | 0.8147 |
Comparison of optimum ROC points of features with the highest AUC.
| Feature Name | SE | SP | Dist. to (0,1) | Threshold |
|---|---|---|---|---|
| MFCC-3 | 0.8386 | 0.8119 | 0.2479 | > 0.0520 |
| LPC coefficient-1 | 0.8206 | 0.8416 | 0.2393 | > −0.1760 |
| Tonality index | 0.7848 | 0.8515 | 0.2615 | < 0.7600 |
| LPC coefficient-2 | 0.7623 | 0.8020 | 0.3094 | < −0.1300 |
| Averaged PSD-17 | 0.7265 | 0.8416 | 0.3161 | < −0.9880 |
| Spectral irregularity | 0.7578 | 0.7822 | 0.3257 | > −0.7020 |
| WT coef STD-1 | 0.7578 | 0.7822 | 0.3257 | < −0.7230 |
| WT coef energy-1 | 0.7668 | 0.7673 | 0.3294 | < −0.9510 |
| Averaged PSD-16 | 0.7713 | 0.7723 | 0.3227 | < −0.9800 |
| LPC coef-3 | 0.6906 | 0.8119 | 0.3621 | > 0.6660 |
| Spectral brightness | 0.7803 | 0.7673 | 0.3200 | < −0.9120 |
| 95% freq roll-off | 0.7668 | 0.7475 | 0.3437 | < −0.2860 |
Classification result using optimum F1 score threshold on the Test-Set.
| Feature Name | SE | SP | PPV | NPV | Threshold | F1 Score |
|---|---|---|---|---|---|---|
| MFCC-3 | 0.9097 ± 0.0850 | 0.6811 ± 0.1114 | 0.7628 ± 0.0478 | 0.8834 ± 0.0888 | > −0.1409 ± 0.0969 | 0.8267 ± 0.0452 |
| LPC coef-1 | 0.8027 ± 0.0924 | 0.8128 ± 0.1413 | 0.8359 ± 0.0881 | 0.7993 ± 0.0844 | > −0.1990 ± 0.1163 | 0.8122 ± 0.0594 |
| Tonality index | 0.7657 ± 0.0915 | 0.8366 ± 0.0914 | 0.8477 ± 0.0676 | 0.7677 ± 0.0870 | < 0.7556 ± 0.0363 | 0.7976 ± 0.0431 |
| Spectral irregularity | 0.8214 ± 0.0776 | 0.6174 ± 0.1198 | 0.7022 ± 0.0881 | 0.7644 ± 0.0894 | > −0.7907 ± 0.0624 | 0.7495 ± 0.0381 |
| LPC coef-2 | 0.8084 ± 0.1573 | 0.6414 ± 0.1923 | 0.7290 ± 0.0997 | 0.7846 ± 0.1236 | < 0.0525 ± 0.2180 | 0.7491 ± 0.0729 |
| WT coef mean-5 | 0.9327 ± 0.0608 | 0.3897 ± 0.1487 | 0.6291 ± 0.0609 | 0.8423 ± 0.1291 | > −0.9154 ± 0.0339 | 0.7485 ± 0.0458 |
| WT coef STD-5 | 0.9200 ± 0.0815 | 0.3910 ± 0.1593 | 0.6274 ± 0.0646 | 0.8227 ± 0.1450 | > −0.8945 ± 0.0490 | 0.7416 ± 0.0503 |
| WT coef energy-5 | 0.9145 ± 0.0956 | 0.4027 ± 0.1599 | 0.6318 ± 0.0678 | 0.8189 ± 0.1457 | > −0.9890 ± 0.0278 | 0.7409 ± 0.0587 |
| 90% freq roll-off | 0.8657 ± 0.1080 | 0.4887 ± 0.1896 | 0.6583 ± 0.0768 | 0.8008 ± 0.1553 | < 0.1024 ± 0.2834 | 0.7401 ± 0.0492 |
| Spectral brightness | 0.8202 ± 0.1205 | 0.5657 ± 0.2256 | 0.6903 ± 0.0998 | 0.7715 ± 0.1269 | < −0.6789 ± 0.3939 | 0.7369 ± 0.0533 |
| 95% freq roll-off | 0.8377 ± 0.1270 | 0.5312 ± 0.2068 | 0.6745 ± 0.0863 | 0.7864 ± 0.1458 | < −0.0306 ± 0.2682 | 0.7355 ± 0.0507 |
| WT coef STD-4 | 0.8124 ± 0.1169 | 0.5415 ± 0.2130 | 0.6746 ± 0.0975 | 0.7324 ± 0.1492 | > −0.6403 ± 0.1480 | 0.7245 ± 0.0513 |
Classification result using optimum MCC threshold on the Test-Set.
| Feature Name | SE | SP | PPV | NPV | Threshold | MCC |
|---|---|---|---|---|---|---|
| LPC coef-1 | 0.7378 ± 0.0965 | 0.8828 ± 0.0844 | 0.8825 ± 0.0672 | 0.7602 ± 0.0778 | > −0.0918 ± 0.0902 | 0.6312 ± 0.0929 |
| Tonality Index | 0.6971 ± 0.0892 | 0.9143 ± 0.0545 | 0.9083 ± 0.0426 | 0.7358 ± 0.0780 | < 0.7031 ± 0.0391 | 0.6269 ± 0.0727 |
| MFCC-3 | 0.8630 ± 0.1223 | 0.7119 ± 0.1294 | 0.7777 ± 0.0529 | 0.8482 ± 0.0981 | > −0.0818 ± 0.1238 | 0.5989 ± 0.0995 |
| 0.5695 ± 0.0830 | 0.9474 ± 0.0468 | 0.9292 ± 0.0572 | 0.6681 ± 0.0786 | > 0.7388 ± 0.0633 | 0.5548 ± 0.0715 | |
| 0.5141 ± 0.0773 | 0.9730 ± 0.0290 | 0.9572 ± 0.0425 | 0.6465 ± 0.0794 | > 0.5459 ± 0.0435 | 0.5414 ± 0.0772 | |
| LPC coef-2 | 0.6664 ± 0.1345 | 0.8477 ± 0.1697 | 0.8563 ± 0.1161 | 0.7095 ± 0.0934 | < −0.2752 ± 0.1849 | 0.5383 ± 0.1254 |
| 0.5217 ± 0.0829 | 0.9606 ± 0.0422 | 0.9397 ± 0.0532 | 0.6475 ± 0.0791 | > 0.4959 ± 0.0664 | 0.5312 ± 0.0861 | |
| 0.4807 ± 0.0773 | 0.9802 ± 0.0260 | 0.9657 ± 0.0432 | 0.6332 ± 0.0771 | > 0.2774 ± 0.0349 | 0.5244 ± 0.0816 | |
| LPC coef-3 | 0.6284 ± 0.1366 | 0.8401 ± 0.1792 | 0.8443 ± 0.1213 | 0.6813 ± 0.0977 | > 0.6948 ± 0.1897 | 0.4949 ± 0.1445 |
| Entropy ratio | 0.8863 ± 0.1488 | 0.3328 ± 0.3790 | 0.6313 ± 0.1467 | 0.7264 ± 0.0949 | < −0.2803 ± 0.8446 | 0.4946 ± 0.0945 |
| Spectral irregularity | 0.6689 ± 0.1220 | 0.8011 ± 0.1084 | 0.8002 ± 0.0863 | 0.6949 ± 0.0844 | > −0.6162 ± 0.1354 | 0.4816 ± 0.0605 |
| Spectral brightness | 0.6755 ± 0.1720 | 0.7539 ± 0.1748 | 0.7809 ± 0.0996 | 0.6942 ± 0.0997 | < −0.9088 ± 0.0768 | 0.4491 ± 0.1207 |
Fig 5Box Plot of (a) LPC coef-2, (b) LPC coef-3, (c) Spectral spread, (d) LPC coef-1, (e) ASE flux, (f) MFCC-3, (g) f25/f90, (h) Averaged PSD-9, (i) f50/f90, (j) LPC 6th order error, (k) WT coef mean-2, and (l) f25/f75 which represent features with best Distance Between Median (DBM) and Overall Visible Spread (OVS) ratios.
Features with highest ratio of DBM and OVS.
| Feature Name | DBM/OVS Ratio |
|---|---|
| LPC coef-2 | 0.6314 |
| LPC coef-3 | 0.6311 |
| Spectral spread | 0.5610 |
| LPC coef-1 | 0.5607 |
| ASE Flux | 0.5520 |
| MFCC-3 | 0.5244 |
| 0.5155 | |
| Averaged PSD-9 | 0.4940 |
| 0.4736 | |
| LPC 6th order error | 0.4692 |
| WT coef mean-2 | 0.4670 |
| 0.4599 |
Features with highest effect size using ranksum test.
| Feature Name | Ranksum | p-value | Effect Size |
|---|---|---|---|
| Tonality Index | 891 | 2.9902E-06 | 0.8795 |
| LPC coef-2 | 861 | 3.7751E-05 | 0.8348 |
| 963 | 5.1789E-05 | 0.8289 | |
| LPC coef-1 | 963 | 5.1789E-05 | 0.8289 |
| Spectral irregularity | 962 | 5.6003E-05 | 0.8274 |
| 961 | 6.0541E-05 | 0.8259 | |
| MFCC-3 | 959 | 7.0681E-05 | 0.8229 |
| LPC coef-3 | 951 | 1.2964E-04 | 0.8110 |
| Averaged PSD-16 | 841 | 1.7423E-04 | 0.8051 |
| Averaged PSD-17 | 835 | 2.6887E-04 | 0.7961 |
| Spectral flatness | 830 | 3.8260E-04 | 0.7887 |
| Averaged PSD-18 | 824 | 5.7813E-04 | 0.7798 |
Features with fastest average computation time in each domain.
| Domain | Feature | Time (ms) |
|---|---|---|
| Time | RMS | 0.0102 |
| Spectral | Tonality index | 0.3850 |
| Cepstral | MFCC | 1.4987 |
| Wavelet | WT coef energy | 1.5847 |
F1 score of the best pairs of features for wheeze and normal sound classification using logistic regression on the Test-Set.
| First Feature | Second Feature | F1 Score |
|---|---|---|
| Tonality index | Spectral slope | 0.8319 |
| MFCC-3 | MFCC-1 | 0.8165 |
| MFCC-3 | Spectral slope | 0.8154 |
| Tonality index | MFCC-1 | 0.8152 |
| MFCC-3 | RMS | 0.8133 |
| MFCC-3 | Averaged PSD-2 | 0.8112 |
| MFCC-3 | Entropy mean | 0.8108 |
| Entropy difference | Spectral Irregularity | 0.8099 |
| MFCC-3 | Spectral Irregularity | 0.8089 |
| MFCC-10 | Spectral Irregularity | 0.8080 |
| MFCC-3 | Entropy ratio | 0.8078 |
| MFCC-3 | Spectral skewness | 0.8049 |
| Tonality index | - | 0.7863 |
| MFCC-3 | - | 0.7786 |
MCC of the best pairs of features for wheeze and normal sound classification using logistic regression on the Test-Set.
| First Feature | Second Feature | MCC |
|---|---|---|
| Tonality index | Spectral slope | 0.7161 |
| Tonality index | MFCC-1 | 0.7154 |
| Tonality index | Entropy ratio | 0.7142 |
| Spectral slope | 0.7053 | |
| MFCC-1 | MFCC-2 | 0.6981 |
| Entropy difference | Spectral Irregularity | 0.6933 |
| Tonality index | MFCC-8 | 0.6849 |
| Spectral crest factor | Spectral Irregularity | 0.6849 |
| Spectral slope | MFCC-2 | 0.6823 |
| Tonality index | Averaged PSD-7 | 0.6822 |
| Tonality index | Renyi entropy | 0.6811 |
| Tonality index | MFCC-11 | 0.6788 |
| Tonality index | - | 0.5790 |
| - | 0.5420 |
F1 score of the best three features vectors for wheeze and normal sound classification using logistic regression on the Test-Set.
| First Feature | Second Feature | Third Feature | F1 Score |
|---|---|---|---|
| MFCC-3 | Tonality index | ZCR | 0.8718 |
| MFCC-3 | Tonality index | RMS | 0.8596 |
| MFCC-3 | WT coef STD-4 | WT coef mean-4 | 0.8595 |
| MFCC-3 | Tonality index | Spectral slope | 0.8533 |
| MFCC-3 | Tonality index | Entropy ratio | 0.8533 |
| MFCC-3 | Entropy ratio | Spectral Irregularity | 0.8485 |
| Tonality index | Spectral slope | Spectral kurtosis | 0.8479 |
| Tonality index | Spectral slope | Spectral flatness | 0.8462 |
| MFCC-3 | Entropy ratio | Entropy mean | 0.8458 |
| Tonality index | STD | Spectral slope | 0.8451 |
| Tonality index | Spectral crest factor | Spectral slope | 0.8447 |
| MFCC-3 | WT coef mean ratio-4 | ZCR | 0.8438 |
MCC of the best three features vectors for wheeze and normal sound classification using logistic regression on the Test-Set.
| First Feature | Second Feature | Third Feature | MCC |
|---|---|---|---|
| Tonality index | Spectral slope | Averaged PSD-17 | 0.7267 |
| Tonality index | Spectral slope | WT coef mean ratio-1 | 0.7246 |
| Tonality index | MFCC-3 | ZCR | 0.7201 |
| Tonality index | Spectral slope | Spectral crest factor | 0.7199 |
| Tonality index | Spectral slope | MFCC-5 | 0.7190 |
| Tonality index | Spectral slope | Spectral flatness | 0.7169 |
| Tonality index | Spectral slope | Averaged PSD-18 | 0.7154 |
| Tonality index | Spectral slope | STD | 0.7154 |
| MFCC-10 | Spectral slope | 0.7126 | |
| Tonality index | MFCC-1 | Entropy ratio | 0.7120 |
| WT coef energy-5 | MFCC-3 | Spectral irregularity | 0.7120 |
| Tonality index | Spectral slope | Averaged PSD-11 | 0.7115 |
Fig 6Effect of feature vector length on F1 Score of LRM.
Fig 7Effect of feature vector length on MCC of LRM.