| Literature DB >> 28552969 |
Renard Xaviero Adhi Pramono1, Stuart Bowyer1, Esther Rodriguez-Villegas1.
Abstract
BACKGROUND: Automatic detection or classification of adventitious sounds is useful to assist physicians in diagnosing or monitoring diseases such as asthma, Chronic Obstructive Pulmonary Disease (COPD), and pneumonia. While computerised respiratory sound analysis, specifically for the detection or classification of adventitious sounds, has recently been the focus of an increasing number of studies, a standardised approach and comparison has not been well established.Entities:
Mesh:
Year: 2017 PMID: 28552969 PMCID: PMC5446130 DOI: 10.1371/journal.pone.0177926
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Study selection flow diagram.
Normal breath sounds.
| Breath Sounds | Location | Range | Pitch | Quality | Timing (I:E ratio) | Pause |
|---|---|---|---|---|---|---|
| Vesicular | Most of lung fields | 100—1,000 Hz Energy drop at 200 Hz | Low | Low-pass filtered noise like Soft Rustling sound | During inspiration and early expiration (2:1 ratio) | Pause between different breath cycle |
| Broncho-Vesicular | Between scapulae on posterior chest and center part of anterior chest | Intermediate between Vesicular and Bronchial | Intermediate | Intermediate intensity | During both inspiration and expiration (1:1 ratio) | N/M |
| Bronchial | Large airways on chest near second and third intercostal space | Similar to Tracheal | High | Loud Hollow | During both inspiration and expiration (1:2 ratio) | Short pause between inspiration and expiration phase |
| Tracheal | Suprasternal notch on trachea | 100—5,000 Hz Energy drop at 800 Hz | High | Harsh Very loud | During both inspiration and expiration (1:1 ratio) | Distinct pause between inspiration and expiration phase |
| Mouth | Mouth | 200—2,000 Hz | N/M | White-noise like Silent when normal | N/M | N/M |
aInformation from [8–10, 12, 13]
bInformation from [8, 9]
cInformation from [8, 13]
dInformation from [8, 11]
eInformation from [8]
Abbreviation N/M: Not Mentioned in [8–13]
Types of adventitious sounds and its characteristics.
| Types | Continuity | Duration | Timing | Pitch | Quality | Cause | Disease Associated |
|---|---|---|---|---|---|---|---|
| Wheeze | Continuous | > 80 | Inspiratory, Mostly Expiratory, Biphasic | High (> 400 | Sibilant, Musical | Airway narrowing, airflow limitation | Asthma, COPD, Foreign body |
| Rhonchi | Continuous | > 80 | Inspiratory, Mostly Expiratory, Biphasic | Low (< 200 | Sibilant, Musical | Secretion in bronchial, muchosal thickening | Bronchitis, COPD |
| Stridor | Continuous | > 250 | Mostly Inspiratory, Expiratory, Both | High (> 500 | Sibilant, Musical | Turbulent airflow in larynx or lower bronchial tree (Upper airway obstruction) | Epiglottitis, foreign body, croup, laryngeal oedema |
| Fine Crackle | Discontinuous | ± 5 ms | Inspiratory (late) | High (650 Hz) | Non-musical, Explosive | Explosive opening of small airways | Pneumonia, Congestive heart failure, Lung fibrosis |
| Coarse Crackle | Discontinuous | ± 15 ms | Mostly Inspiratory (early), Expiratory, Both | Low (350 Hz) | Non-musical, Explosive | Air bubble in large bronchi or bronchiectatic segments | Chronic bronchitis, bronchiectasis, COPD |
| Pleural Rub | Discontinuous | > 15 | Biphasic | Low (< 350 | Non-Musical, Rhythmic | Pleural membrane rubbing against each other | Inflammation of lung membrane, lung tumour |
| Squawk | Continuous | ± 200 ms | Inspiratory | Low (200—300 Hz) | Short Musical and non-musical | Oscillation of peripheral airways | Hypersensitivity pneumonia, pneumonia |
| Gasp | Continuous | > 250 | Inspiratory | High | Whoop | Gasping for breath | Whooping cough |
aInformation from [10, 17, 19, 20, 23]
bInformation from [8, 10, 20, 21]
cInformation from [10, 14, 20, 23]
dInformation from [8, 10, 18–20]
eInformation from [8, 15, 16, 19–21, 24]
fInformation from [8, 10, 20, 22]
Sound and analysis type.
| Ref | Year | Sound Type | Approach | Level | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| W | R | C | S | E | U | SC | De | Cl | Se | Ev | Re | ||
| [ | 2016 | • | o | o | • | • | • | • | |||||
| [ | 2016 | • | • | • | • | ||||||||
| [ | 2016 | • | • | • | • | • | |||||||
| [ | 2016 | • | • | • | |||||||||
| [ | 2016 | • | • | • | • | • | |||||||
| [ | 2016 | o | • | • | • | ||||||||
| [ | 2016 | • | • | • | • | • | |||||||
| [ | 2016 | • | • | • | • | ||||||||
| [ | 2016 | • | • | • | • | • | |||||||
| [ | 2016 | • | • | • | • | ||||||||
| [ | 2015 | • | o | • | • | • | • | ||||||
| [ | 2015 | • | • | • | • | • | |||||||
| [ | 2015 | • | • | • | • | • | |||||||
| [ | 2015 | • | • | • | • | • | • | ||||||
| [ | 2015 | • | • | • | |||||||||
| [ | 2015 | • | • | • | • | • | • | • | |||||
| [ | 2015 | • | o | • | • | ||||||||
| [ | 2015 | • | o | • | • | • | • | ||||||
| [ | 2015 | • | • | • | • | • | • | ||||||
| [ | 2015 | • | • | • | • | • | |||||||
| [ | 2015 | o | o | • | • | • | • | ||||||
| [ | 2015 | • | • | • | |||||||||
| [ | 2015 | • | • | • | |||||||||
| [ | 2015 | • | • | • | • | • | • | • | |||||
| [ | 2015 | • | • | • | |||||||||
| [ | 2015 | • | • | • | |||||||||
| [ | 2014 | • | • | • | • | ||||||||
| [ | 2014 | • | • | • | • | ||||||||
| [ | 2014 | • | • | • | • | • | |||||||
| [ | 2014 | • | • | • | • | ||||||||
| [ | 2014 | • | • | • | |||||||||
| [ | 2014 | • | • | • | • | • | |||||||
| [ | 2014 | • | • | • | • | • | |||||||
| [ | 2014 | • | • | • | |||||||||
| [ | 2014 | • | • | • | • | ||||||||
| [ | 2013 | • | • | • | • | ||||||||
| [ | 2013 | • | • | • | • | • | |||||||
| [ | 2013 | • | • | • | • | • | |||||||
| [ | 2013 | • | • | • | |||||||||
| [ | 2013 | • | • | • | |||||||||
| [ | 2013 | • | • | • | • | ||||||||
| [ | 2012 | • | • | • | • | ||||||||
| [ | 2012 | • | • | • | • | • | • | • | • | ||||
| [ | 2012 | • | • | • | |||||||||
| [ | 2012 | • | • | • | |||||||||
| [ | 2012 | • | • | • | • | ||||||||
| [ | 2012 | • | • | • | • | ||||||||
| [ | 2012 | • | • | • | |||||||||
| [ | 2012 | • | • | • | |||||||||
| [ | 2011 | • | • | • | • | ||||||||
| [ | 2010 | • | • | • | • | • | |||||||
| [ | 2010 | • | • | • | |||||||||
| [ | 2009 | • | • | • | |||||||||
| [ | 2009 | • | • | • | |||||||||
| [ | 2009 | • | • | • | • | • | • | • | |||||
| [ | 2009 | • | • | • | • | ||||||||
| [ | 2009 | • | • | • | • | • | • | • | • | ||||
| [ | 2009 | • | • | • | |||||||||
| [ | 2009 | • | • | • | • | ||||||||
| [ | 2008 | • | • | • | • | • | |||||||
| [ | 2008 | • | • | • | • | ||||||||
| [ | 2008 | • | • | • | |||||||||
| [ | 2007 | • | • | • | |||||||||
| [ | 2007 | o | • | • | • | • | • | ||||||
| [ | 2007 | • | • | • | • | ||||||||
| [ | 2005 | • | • | • | • | ||||||||
| [ | 2005 | • | • | • | |||||||||
| [ | 2005 | • | • | • | |||||||||
| [ | 2005 | • | • | • | |||||||||
| [ | 2004 | • | • | • | • | ||||||||
| [ | 2000 | • | • | • | • | • | • | ||||||
| [ | 1997 | • | • | • | • | ||||||||
| [ | 1997 | • | • | • | • | • | • | ||||||
| [ | 1996 | • | • | • | • | ||||||||
| [ | 1995 | • | • | • | |||||||||
| [ | 1992 | • | • | • | • | • | |||||||
| [ | 1984 | • | • | • | |||||||||
Symbol ‘•’ denotes focus of study in corresponding article
while ‘o’ denotes sound included in study but not as main focus
W: Wheeze, R: Rhonchi, C: Crackle, S: Stridor
E: Egophony, Squawk, or Pleural Rub
U: Unspecified CAS or DAS, SC: Sound Cause
De: Detection, Cl: Classification
Se: Segment, Ev: Event, Re: Recording
Sensor and data source for lung sound analysis.
| Ref | Year | Data Source | # Sensor | Sensor Position | Total Data | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| Sensor | Database | Neck | Anterior | Posterior | Mouth | Multiple | ||||
| [ | 2016 | [ | - | 1 | • | 95 recordings | ||||
| [ | 2016 | custom | - | 1 | • | • | • | • | 227 recordings | |
| [ | 2016 | [ | ATS, COPD | 1 | N/M | 112 recordings | ||||
| [ | 2016 | [ | - | 1 | • | 18 volunteers | ||||
| [ | 2016 | [ | - | 5 | • | • | • | 870 events | ||
| [ | 2016 | [ | - | 1 | • | • | • | 20 volunteers | ||
| [ | 2016 | Electronic | - | 1 | • | • | • | 3120 short recordings | ||
| [ | 2016 | - | [ | N/A | 36 recordings | |||||
| [ | 2016 | - | [ | N/A | 30 volunteers | |||||
| [ | 2016 | [ | - | 14 | • | • | 600 events | |||
| [ | 2015 | custom | - | 1 | • | • | • | • | 38 patients | |
| [ | 2015 | Condenser | [ | 1 | • | 20 recordings and additional data | ||||
| [ | 2015 | [ | - | 1 | • | 58 recordings | ||||
| [ | 2015 | N/M | [ | N/M | 45 recordings | |||||
| [ | 2015 | N/M | 41 recordings | |||||||
| [ | 2015 | Piezoelectric | - | 1 | • | • | • | 130 recordings | ||
| [ | 2015 | [ | - | 1 | • | 45 recordings | ||||
| [ | 2015 | [ | - | 1 | • | • | • | 12 volunteers | ||
| [ | 2015 | - | [ | N/A | 28 recordings | |||||
| [ | 2015 | [ | - | 1 | N/M | 24 recordings | ||||
| [ | 2015 | [ | - | 14 | • | • | 40 recordings | |||
| [ | 2015 | - | [ | N/A | 26 recordings | |||||
| [ | 2015 | [ | - | 14 | • | • | 7 volunteers | |||
| [ | 2015 | Piezoelectric | - | 1 | • | • | 230 recordings | |||
| [ | 2015 | [ | [ | 3 | • | • | • | 260 segments | ||
| [ | 2015 | [ | - | 1 | • | • | • | 100 events | ||
| [ | 2014 | - | [ | N/A | 9 recordings | |||||
| [ | 2014 | [ | - | 1 | • | • | 60 volunteers | |||
| [ | 2014 | [ | [ | 1 | • | 339 events | ||||
| [ | 2014 | N/M | 371 recordings | |||||||
| [ | 2014 | - | [ | N/A | 2 recordings | |||||
| [ | 2014 | [ | 1 | • | 30 recordings | |||||
| [ | 2014 | - | [ | N/A | 13 events | |||||
| [ | 2014 | - | [ | N/A | 68 recordings | |||||
| [ | 2014 | - | [ | N/A | 92 events | |||||
| [ | 2013 | - | 7 | • | • | • | • | 60 volunteers | ||
| [ | 2013 | N/M | 6 events | |||||||
| [ | 2013 | [ | - | 1 | • | • | • | • | 40 recordings | |
| [ | 2013 | - | [ | N/A | 68 recordings | |||||
| [ | 2013 | [ | - | 14 | • | • | 26 volunteers | |||
| [ | 2013 | soft | - | 1 | • | 8 volunteers | ||||
| [ | 2012 | [ | - | 1 | • | • | 28 recordings | |||
| [ | 2012 | Piezoelectric | - | 1 | • | 126 recordings | ||||
| [ | 2012 | - | [ | N/A | 47 recordings | |||||
| [ | 2012 | N/M | 180 segments | |||||||
| [ | 2012 | - | ACCP | N/A | 10 short recordings | |||||
| [ | 2012 | N/M | 26 recordings | |||||||
| [ | 2012 | N/M | 433 segments | |||||||
| [ | 2012 | [ | [ | 1 | • | 47 recordings | ||||
| [ | 2011 | [ | [ | 1 | • | 585 events | ||||
| [ | 2010 | - | [ | N/A | 4-7 recordings each class | |||||
| [ | 2010 | [ | - | 5 | • | • | • | 21 volunteers | ||
| [ | 2009 | [ | - | 14 | • | • | 7 volunteers | |||
| [ | 2009 | - | [ | N/A | 24 recordings | |||||
| [ | 2009 | Electronic | - | 1 | • | 36 recordings | ||||
| [ | 2009 | - | [ | N/A | 25 recordings | |||||
| [ | 2009 | Condenser | - | 1 | • | • | • | 162 volunteers | ||
| [ | 2009 | - | [ | N/A | 40 events | |||||
| [ | 2009 | - | [ | N/A | 17 recordings | |||||
| [ | 2008 | [ | - | 1 | • | 65 volunteers | ||||
| [ | 2008 | - | [ | N/A | 40 events | |||||
| [ | 2008 | [ | [ | 1 | • | 14 volunters | ||||
| [ | 2007 | ECM | - | 1 | • | 30 volunteers | ||||
| [ | 2007 | - | [ | N/A | 18 recordings | |||||
| [ | 2007 | [ | - | 5 | • | • | • | • | 13 volunteers | |
| [ | 2005 | Electret | - | 2 | • | • | 57 volunteers | |||
| [ | 2005 | [ | - | 1 | • | 16 volunteers | ||||
| [ | 2005 | - | 25 | • | • | 29 volunteers | ||||
| [ | 2005 | N/M | 2 volunteers | |||||||
| [ | 2004 | Piezoelectric | - | 1 | • | 31 volunteers | ||||
| [ | 2000 | LS-60 | [ | 2 | • | • | 2127+321 events | |||
| [ | 1997 | - | ACCP | N/A | 2 recordings | |||||
| [ | 1997 | - | 2 | • | • | 69 volunteers | ||||
| [ | 1996 | N/M | • | 13 volunteers | ||||||
| [ | 1995 | N/M | 710 segments | |||||||
| [ | 1992 | - | 1 | N/M | 9 patients | |||||
| [ | 1984 | - | [ | N/A | 147 events | |||||
★ denotes microphone,
♦ denotes stethoscope,
■ denotes accelerometer
ATS: American Thoracic Society website, COPD: COPD website
ACCP: American College Chest Physician teaching tape
N/A: Not Applicable, N/M: Not Mentioned
Data, features, and methods of analysis.
| Ref | Year | Data Set | Features | Method | Performance | |||
|---|---|---|---|---|---|---|---|---|
| Training | Validation | Test | Total | |||||
| [ | 2016 | 70 Rec, 20 W, 50 N | 25 Rec, 7 W, 18 N | 39 Rec, 10 W, 29 N | 95 Rec | Spectral features (PSD mean, harmonics) | SVM, LRM | 71.4% Se, 88.9% Sp, for SVM on validation set at Rec level |
| [ | 2016 | 5-fold CV | 227 Rec | Denoising autoencoders | SVM | 90% Se, 64% Sp for W Rec level and 90% Se, 44% Sp for C Rec level | ||
| [ | 2016 | N/A | 112 Rec | 112 Rec | Rule-based Seg selection, Power Ratio | Threshold | 90% Se, 90.48% Sp at Rec level | |
| [ | 2016 | N/M | 3036 Seg | MFCC | GMM | 88.1% Se, 99.5% Sp at Seg level | ||
| [ | 2016 | 65% | 10-fold CV | 35% | 870 Ev | Ensemble Empirical Mode Decomposition and Instantaneous Frequency | SVM | 94.2% Se, 96.1% Sp, for SVM on best iteration of test set at Ev level |
| [ | 2016 | 10-fold CV | LOOCV | 400 Ev | Musical features, wavelet-based, teager energy, entropy | LRM | 76 ± 23% Se, 77 ± 22% PPV at Seg level | |
| [ | 2016 | LOOCV | 3120 Rec | MFCC | HMM | Best Acc at Seg level 82.82%, average Acc of 87.7% at Rec level | ||
| [ | 2016 | 219 Ev, 71 N, 39 FC, 39 CC, 35 mono W, 35 poly W | 40 holdout CV | 99 Ev, 31 N, 18 FC, 18 CC, 16 mono W, 16 poly W | 318 Ev | Higher Order Statistics (Cumulants) | GA + k-NN and NB | 94.6% Overall Acc on test set at Ev level |
| [ | 2016 | LOOCV | 72 Ev | LFCC, MFCC, IMFCC, and LPCC | MLP | 97.83% best Overall Acc using MFCC at Ev level | ||
| [ | 2016 | LOOCV | 600 Ev | Energy of High Q-Factor Wavelet coefficients | k-NN, SVM | 95.17% average Acc for SVM at Ev level | ||
| [ | 2015 | LOOCV | 57 Rec | Peak to mean ratio, expected number of false positives | Threshold+SVM | 86% Acc on Rec level | ||
| [ | 2015 | 20 Rec | - | Multiple sets | > 20 Rec | 13 MFCC each with first and second derivatives | k-NN | Performance of 6 different types of test reported as Acc |
| [ | 2015 | 23 Rec, 13 W, 10 N | - | 35 Rec, 19 W, 16 N | 58 Rec | Duration, frequency range, area, power, and slope of spectrum | BPNN | 94.6% Se, 100% Sp at Rec level |
| [ | 2015 | N/A | 45 Rec | 45 Rec | Entropy-based Features | Threshold | 99% Acc Stridor, 70% Acc W, 87% Acc C, 99% Acc N, at Rec level | |
| [ | 2015 | 41 Rec | 41 Rec | Spectral features | GMM | 92.85% Se, 100% Sp at Rec level | ||
| [ | 2015 | LOOCV | 130 Rec | MFCC, correlation score with other auscultation point and other Seg | HMM | Best Acc of 92.26% at Ev level and best Acc of 91% at Rec level | ||
| [ | 2015 | 21 Rec, 5 W, 21 Non-W | 20%-80% Train Validation Set repeated 20 times | Leave-one-out CV | 45 Rec | MFCC, Kurtosis, Entropy | 2 SVM + Threshold | 97.68% Reliability (TPR.TNR) using MFCC at Seg level |
| [ | 2015 | 10-fold CV | 113 Ev | Musical features and spectrogram signature | LRM, RF | 90.9% ± 2% Se, 99.4% ± 1% Sp for RF at Seg level | ||
| [ | 2015 | 70% of data | 15% of data | 15% of data | 28 Rec | Averaged Power Spectrum | ANN | 97.8% Se, 100% Sp on test set at Ev level |
| [ | 2015 | N/A | 24 Rec | 24 Rec | Fractal Dimension, CORSA criterion for Crackle | Threshold | Average Se of 89 ± 10%, PPV of 95 ± 11% at Ev level for different Rec | |
| [ | 2015 | LOOCV | 40 Rec | AR Model | GMM, SVM | 90% best total Acc for GMM on Rec level | ||
| [ | 2015 | LTOCV | 1188 Seg | MFCC, WPT, FT | C-Weighted SVM | 81.5 ± 10% Se, 82.6 ± 7% Sp for MFCC features on Seg level | ||
| [ | 2015 | N/M | 231 Ev | Quartile Frequency Ratios, Mean Crossing Irregularity | SVM, k-NN, NB | 75.78% best Overall Acc for kNN at Ev level | ||
| [ | 2015 | LOOCV | 230 Rec | MFCC | Subject adaptation HMM | 89.4% Se, 80.9% Sp at Ev level and 90.4% Se, 78.3% Sp at Rec level | ||
| [ | 2015 | 10-fold CV | 260 Seg | Audio Spectral Envelope and Tonality Index | SVM | 93% Overall Acc at Seg level | ||
| [ | 2015 | N/A | 100 Ev, 50 C, 50 N | 100 Ev | Mathematical morphology | Threshold | 86% Se, 92% Sp at Ev level | |
| [ | 2014 | N/M | Delay Coordinate | Threshold | 98.39% Acc at Ev level | |||
| [ | 2014 | 5-fold CV | 60 Vol | frequency ratio, average instantaneous frequency, eigenvalues | SVM | Individual Acc reported for all case of one-versus-one and one-versus-all for all features at Rec level | ||
| [ | 2014 | LOOCV | 578 Ev | Instantaneous Kurtosis, Discriminanting Function, Sample Entropy | SVM | 97.7% Mean Acc (Inhale), 98.8% Mean Acc (exhale) at Ev level | ||
| [ | 2014 | 371 Ev | 371 Rec | Centroid, time duration, slope, and area ratio of spectrum | SVM | 88.7% Se, 93.9% Sp at Rec level | ||
| [ | 2014 | LOOCV | 2 Rec | Teager energy, wavelet, fractal dimension, empirical mode decomposition, entropy, and GARCH process | LRM | MCC of 80% at Seg level | ||
| [ | 2014 | 5-fold CV | 120 Ev | Lacunarity, sample entropy, skewness, and kurtosis | SVM, ELM | 86.30% Se, 86.90% Sp for ELM at Ev level | ||
| [ | 2014 | LOOCV | 13 Ev | MFCC | MLP | 100% Acc W, 75% Acc C, 80% Acc N at Ev level | ||
| [ | 2014 | 10-fold CV | 68 Rec | MFCC | SVM, k-NN | 100% Acc N, 100% Acc AOP, 96% Acc PP for kNN at Rec level | ||
| [ | 2014 | 60 Ev | 14 Ev | 18 Ev | 92 Ev | Wavelet packet transform | ANN | 98.89% best average Acc for Symlet-10 wavelet base at Ev level on test set |
| [ | 2013 | 75%-25% Train Validation Set repeated 6 times | 345 Rec | Spectrogram evaluation for W, db5 Wavelet degree of similarity for C | ANN | 80% Se, 67% Sp at Rec level | ||
| [ | 2013 | N/A | 6 Ev | 6 Ev | Time Frequency Analysis and Wavelet Packet Decomposition | Threshold | All Ws detected | |
| [ | 2013 | N/A | 40 Rec | 40 Rec | Time Frequency Analysis | Threshold | 99.2% Se, 72.5% Sp at Ev level | |
| [ | 2013 | 60%-40% Train Validation Set repeated 25 times | 68 Rec | MFCC | SVM | 94.11% Acc N, 92.31% Acc AOP, 88% Accruacy PP, for SVM at Rec level | ||
| [ | 2013 | 2000 Seg, 1000 N, 1000 C | 2000 Seg, 1000 N, 1000 C | 2000 Seg, 1000 N, 1000 C | 6000 Seg | Time Frequency Analysis (Spectrogram), Time Scale Analysis (Wavelet) | SVM, MLP, k-NN | 97.5% Overall Acc rate for SVM using Time Frequency Analysis at Seg level |
| [ | 2013 | N/A | 59 Rec | 59 Rec | Correlation Coefficient | Threshold | 88% Se, 94% Sp at Rec level | |
| [ | 2012 | 10-fold CV | 28 Rec | Cortical Model | SVM | 89.44% Se, 80.50% Sp at Rec level | ||
| [ | 2012 | LOOCV | 126 Rec, 723 Ev | Power, spectral features, and duration distribution | HMM | 88.7% Se, 91.5% Sp at Ev level and 87% Se, 81% Sp at Rec level | ||
| [ | 2012 | N/A | 47 Rec | 47 Rec | Local similarity measure using Mutual Information, Weighted cepstral features | Threshold | High Acc for local similarity measure and separability index of 1 for weighted cepstral | |
| [ | 2012 | N/A | 180 Seg | 180 Seg | fractional Hilbert transform | Threshold | Acc of 90.5% at Seg level | |
| [ | 2012 | N/A | 33 C Ev | 33 Ev | fractional Hilbert transform and correlation coefficient | Threshold | Se 94.28%, PPV 97.05% at Ev level | |
| [ | 2012 | N/A | 26 Rec, 13 N, 13 W | 26 Rec | LPC prediction error ratio | Threshold | 70.9% Se, 98.6% Sp at Ev level | |
| [ | 2012 | N/A | 433 Seg | 433 Seg | Abnormality level | Threshold | 84.5% Acc at Seg level | |
| [ | 2012 | 50%-50% Train Validation Set repeated 100 times | 689 Ev | Multi-scale PCA (Wavelet) | Empirical Classification | 97.3% ± 2.7% Overall Acc for N vs CAS, 98.34% Overall Acc for N vs CAS+DAS at Ev level | ||
| [ | 2011 | LOOCV | 585 Ev | Temporal-Spectral Dominance spectrogram | k-NN | 92.4% ± 2.9% Overall Acc at Ev level | ||
| [ | 2010 | LOOCV | 4-7 Rec Each | MFCC | GMM | 52.5% Overall Acc on validation | ||
| [ | 2010 | N/A | 21 Vol, 393 W Ev | 393 Ev | Continuous Wavelet Transform | Man-Whitney U Test | Significance test for features | |
| [ | 2009 | LOOCV | 492 Seg | Kurtosis, Renyi entropy, frequency power ratio, Mean crossing irregularity | FDA | 93.5% Overall Acc at Seg level | ||
| [ | 2009 | LOOCV | 2807 Seg | Fourier Transform, LPC, Wavelet Transform, MFCC | VQ, GMM, ANN | 94.6% Se, 91.9% Sp for GMM using MFCC at Seg level | ||
| [ | 2009 | 180 Ev | - | 180 Ev | 360 Ev | averaged power spectrum | MLP, GAL, ISNN | Overall Acc of 98% for ISNN at Ev level |
| [ | 2009 | 75%-25% train-test split repeated 200 times | 362 Ev | Lacunarity | Discriminant Analysis | 99.75% maximum mean Acc at Seg level | ||
| [ | 2009 | LOOCV | 1544 Ev | MFCC | HMM | 93.2% Se, 64.8% Sp at Ev level | ||
| [ | 2009 | 40 Ev, 20 W, 20 N | - | 28 Rec, 112 Ev, 40 W, 72 N | 152 Ev | Amplitude and Frequency of largest edge of pre-processed spectrogarm | MLP | 86.1% Se, 82.5% Sp on test set at Ev level |
| [ | 2009 | N/A | 17 Rec | 17 Rec | Entropy-based features | Threshold | 84.4% Se, 80% Sp at Rec level | |
| [ | 2008 | 40 Vol | LOOCV | 25 Vol | 65 Vol | AR Coefficients | k-NN, Minimum Distance-based | 92% Se, 100% Sp using k-NN on test set at Rec level |
| [ | 2008 | N/A | 40 Ev | 40 Ev | Peak selection based on time duration | Threshold | 84% Se, 86% Sp at Ev level | |
| [ | 2008 | N/A | 186 Ev | 186 Ev | Distortion in Histogram of Sample Entropy | Threshold | 97.9% Acc Expiration, 85.3% Acc Inspiration at Ev level | |
| [ | 2007 | N/M | 870 Ev | MFCC | GMM | Acc 94.9% at Seg level | ||
| [ | 2007 | N/A | 18 Rec | 182 C Ev | Fractal Dimension | Threshold | 92.9% Se, 94.4% PPV at Ev level, 93.9% best Acc for classification | |
| [ | 2007 | 3 Vol, 85 W Ev | - | 10 Vol, 337 W Ev | 422 W Ev | Peak selection based on local maxima, coexistence, continuity, grouping | Threshold | Se 95.5 ± 4.8%, Sp 93.7 ± 9.3% at Ev level on test set |
| [ | 2005 | 50%-50% train-test Seg from same Ev split | 57 Vol | AR parameters and Cepstral Coefficients | MLP | 10-20% average misclassification error on test set at Ev level for cepstral features | ||
| [ | 2005 | N/A | 16 Vol | 16 Vol | spectrogram image | Edge Detection | Se and Sp above 89% | |
| [ | 2005 | 912 Seg | 114 Seg | 114 Seg | 1140 Seg | multi-variate AR model | BPNN | 80.7% Se, 84.21% Sp at Seg level on validation set |
| [ | 2005 | 160 Ev, 80 CC, 80 FC | - | 231 Ev, 158 CC, 73 FC | 391 Ev | wavelet network | Discriminant Function | 84% and 70% Acc for FC and CC respectively on test set at Ev level |
| [ | 2004 | N/A | 31 Vol | 31 Vol | energy | Threshold | 100% Se and Sp for a high airflow and 71% Se, 88.2% Sp for low airflow, at Ev level | |
| [ | 2000 | 1253 Ev, 509 Ab, 744 N | repeated 5 times | 1195 Ev, 530 Ab, 665 N | 2448 Ev | averaged power spectrum | BPNN | Best Se 59%, 81% Sp for recorded sound and Se 87%, 95% Sp for CD data at Ev level for Ab vs N respiratory sound classification |
| [ | 1997 | N/A | 2 Rec | 2 Rec | Matched wavelet | Threshold | Detection Acc of 99.8% and classification Acc of almost 100% at Seg level | |
| [ | 1997 | LOOCV | 69 Vol | AR model, crackle parameters | k-NN, multinomial, voting | Overall Acc of 71.07% at Rec level to classify pathology | ||
| [ | 1996 | 50%-50% training-test split | 13 Vol | Wavelet packet decomposition | LVQ (ANN Variant) | 59% Se, 24% PPV for FC, 19% Se, 6% PPV for CC, and 58% Se, 18% PPV for W at Seg level | ||
| [ | 1995 | 242 Seg, 128 W, 114 N | - | 2 test set: 233 Seg, 107 W, 126 N, and 235 Seg, 140 W, 95 N | 710 Seg | Power spectrum | BPNN, RBF, SOM, LVQ | Overall Acc of 93% and 96% on the two sets by using LVQ at Seg level |
| [ | 1992 | N/A | 9 Vol | 9 Vol | Energy envelope, Crackle characteristics | Threshold, Hierarchical clustering | 100% Acc on classifying FC vs CC at Ev level | |
| [ | 1984 | 42 Ev, 6 for each types | - | 105 Ev, 10-15 for each types | 147 Ev | LPC | Clustering (Minimum Distance) | Overall Acc of 95.24% at Ev level |
Rec: Recording, Ev: Event, Seg: Segment
W: Wheeze, FC: Fine Crackle, CC: Coarse Crackle, N: Normal, Ab: Abnormal, Vol: Volunteer
CV: Cross-Validation, Se; Sensitivity, Sp: Specificity, PPV: Positive Predictive Value, Acc: Accuracy
N/A: Not Applicable, N/M: Not Mentioned
Accuracy percentage measure from literature.
| WSD (%) | WED (%) | CSD (%) | WSC (%) | WEC (%) | CEC (%) | |
|---|---|---|---|---|---|---|
| 93.8 [ | 100 [ | 83.5 [ | 93 [ | 95.15 [ | 95 [ | |
| accuracy range | 71.2–97.9 | 79.6–100 | 62.27–99.8 | 90.5–96 | 75.78–100 | 89–98.15 |