| Literature DB >> 32640710 |
Sumair Aziz1, Muhammad Umar Khan1, Majed Alhaisoni2, Tallha Akram3, Muhammad Altaf3.
Abstract
Congenital heart disease (CHD) is a heart disorder associated with the devastating indications that result in increased mortality, increased morbidity, increased healthcare expenditure, and decreased quality of life. Ventricular Septal Defects (VSDs) and Arterial Septal Defects (ASDs) are the most common types of CHD. CHDs can be controlled before reaching a serious phase with an early diagnosis. The phonocardiogram (PCG) or heart sound auscultation is a simple and non-invasive technique that may reveal obvious variations of different CHDs. Diagnosis based on heart sounds is difficult and requires a high level of medical training and skills due to human hearing limitations and the non-stationary nature of PCGs. An automated computer-aided system may boost the diagnostic objectivity and consistency of PCG signals in the detection of CHDs. The objective of this research was to assess the effects of various pattern recognition modalities for the design of an automated system that effectively differentiates normal, ASD, and VSD categories using short term PCG time series. The proposed model in this study adopts three-stage processing: pre-processing, feature extraction, and classification. Empirical mode decomposition (EMD) was used to denoise the raw PCG signals acquired from subjects. One-dimensional local ternary patterns (1D-LTPs) and Mel-frequency cepstral coefficients (MFCCs) were extracted from the denoised PCG signal for precise representation of data from different classes. In the final stage, the fused feature vector of 1D-LTPs and MFCCs was fed to the support vector machine (SVM) classifier using 10-fold cross-validation. The PCG signals were acquired from the subjects admitted to local hospitals and classified by applying various experiments. The proposed methodology achieves a mean accuracy of 95.24% in classifying ASD, VSD, and normal subjects. The proposed model can be put into practice and serve as a second opinion for cardiologists by providing more objective and faster interpretations of PCG signals.Entities:
Keywords: computer aided diagnosis; congenital heart disease; empirical mode decomposition; feature extraction; machine learning; mel-frequency cepstral coefficients; phonocardiogram; statistical analysis; support vector machines
Mesh:
Year: 2020 PMID: 32640710 PMCID: PMC7374414 DOI: 10.3390/s20133790
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Comparison with existing literature.
| Ref | Year | Dataset | Classes | Features | Classifier | Results |
|---|---|---|---|---|---|---|
| [ | 2016 | Physionet Challenge 2016 [ | Normal(2488), Abnormal(665) | Time-frequency, | LogitBoost, Random Forest | Acc: 84.48% |
| [ | 2016 | Physionet Challenge 2016 [ | Normal(2575), Abnormal(665) | Dynamic time warping | SVM | Acc: 82.4% |
| [ | 2016 | Physionet Challenge 2016 | Normal(2575), Abnormal(665) | 124 Time-frequency features | Adaboost, CNN | Acc: 89% |
| [ | 2016 | Self-collected | Normal(132), Abnormal seven classes(131) | Arash-Band | SVM | Acc: 87.45% |
| [ | 2017 | Self-collected | Small VSD(60), Large VSD(60) | Statistical, DWT features | Multilayer Perceptron (MLP) | Acc: 96.6% |
| [ | 2017 | Self-collected | Normal, VSD | (STFT), MFCC | KNN | Acc: 93.2% |
| [ | 2018 | PhysioNet Computing in Cardiology Challenge | Normal(2575), Abnormal(665) | GFCC | Weighted SVM | Sen: 90.3% Spec: 89% |
| [ | 2018 | UoC-murmur database, PhysioNet-2016 | Normal(336), CHD(130), Normal/Abnormal(2435) | Mel-Spectrogram, MFFC and sub-band envelopes | CNN | Acc: 81.5% Sen: 84.5% |
| [ | 2018 | PhysioNet Computing in Cardiology Challenge-2016 | Normal(50), Abnormal(50) | Cepstrum Analysis | SVM | Acc: 95% |
| [ | 2018 | Self-collected | Normal(40), Abnormal(58) | CBFE, FEUAP, FSDA, DDE | KNN | Acc: 84.39% |
| [ | 2019 | Self-collected | Normal(175), Abnormal(108) | MFCC, normalized | SVM | Acc: 92.6% |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Figure 1Sketch of the proposed cardiac disorder classification system.
Description of PCG dataset.
| Status | No. of Signals | No. of Subjects | Male | Female |
|---|---|---|---|---|
| Normal | 140 | 28 | 17 | 11 |
| ASD | 85 | 17 | 12 | 5 |
| VSD | 55 | 11 | 7 | 4 |
Figure 2PCG signals collected from normal, arterial septal defect (ASD), and ventricular septal defect (VSD) subjects.
Figure 3Intrinsic mode functions (IMFs) extracted from the PCG signal of a normal subject.
Figure 4IMFs extracted from the PCG signal of an ASD subject.
Figure 5IMFs extracted from the PCG signal of a VSD subject.
Figure 6Preprocessed PCG signal of normal, ASD, and VSD subjects.
Figure 7One-dimensional local ternary pattern (1D-LTP) feature extraction steps.
Figure 8The process of mel-frequency cepstral coefficient (MFCC) feature extraction.
Parameters of selected classifiers.
| Classifier | Kernel Function | Kernel Scale | Box Constraint Level | Multiclass Method | Standardize Data |
|---|---|---|---|---|---|
|
| Linear | Automatic | 1 | One-vs-one | True |
|
| Quadratic | Automatic | 1 | One-vs-one | True |
|
| Cubic | Automatic | 1 | One-vs-one | True |
|
| Gaussian | 44 | 1 | One-vs-one | True |
Figure 9Scatter plots of 1D-LTP features.
Performance comparison of SVM on different feature sets for binary experiments. Bold font indicates the best result obtained against each feature set.
| Feature Set | Classifier | Accuracy (%) | Sensitivity (%) | Specificity (%) | PPV (%) | NPV (%) | Error (%) |
|---|---|---|---|---|---|---|---|
|
| SVM-L | 89.88 | 76.19 | 94.44 | 82.05 | 92.25 | 10.12 |
| SVM-Q | 89.29 | 80.95 | 92.06 | 77.27 | 93.55 | 10.71 | |
|
|
|
|
|
|
|
| |
| SVM-G | 75.6 | 7.14 | 98.41 | 60 | 76.07 | 24.4 | |
|
| SVM-L | 94.05 | 88.1 | 96.03 | 88.1 | 96.03 | 5.95 |
|
|
|
|
|
|
|
| |
| SVM-C | 91.07 | 76.19 | 96.03 | 86.49 | 92.37 | 8.93 | |
| SVM-G | 86.31 | 47.62 | 99.21 | 95.24 | 85.03 | 13.69 | |
|
| SVM-L | 94.05 | 90.48 | 95.24 | 86.36 | 96.77 | 5.95 |
| SVM-Q | 94.05 | 88.1 | 96.03 | 88.1 | 96.03 | 5.95 | |
|
|
|
|
|
|
|
| |
| SVM-G | 93.45 | 88.1 | 95.24 | 86.05 | 96 | 6.55 |
Confusion matrix for detection (normal vs. abnormal) experiments.
| Predicted Class | ||
|---|---|---|
| Actual Class | Normal | Abnormal |
|
| 90% | 10% |
|
| 2% | 98% |
Performance comparison of SVM using different feature sets for multiclass experiments. Bold font indicates the best result obtained against each feature set.
| Feature Set | Classifier | Accuracy(%) | Sensitivity(%) | Specificity(%) | PPV(%) | NPV(%) | Error(%) |
|---|---|---|---|---|---|---|---|
|
| SVM-L | 83.93 | 92.86 | 85.71 | 68.42 | 97.3 | 16.07 |
| SVM-Q | 86.9 | 90.48 | 90.48 | 76 | 96.61 | 13.1 | |
|
|
|
|
|
|
|
| |
| SVM-G | 83.33 | 97.62 | 81.75 | 64.06 | 99.04 | 16.67 | |
|
| SVM-L | 94.64 | 97.62 | 93.65 | 83.67 | 99.16 | 5.36 |
| SVM-Q | 94.05 | 90.48 | 95.24 | 86.36 | 96.77 | 5.95 | |
|
|
|
|
|
|
|
| |
| SVM-G | 93.45 | 92.86 | 93.65 | 82.98 | 97.52 | 6.55 | |
|
| SVM-L | 93.45 | 97.62 | 92.06 | 80.39 | 99.15 | 6.55 |
| SVM-Q | 94.43 | 95.05 | 94.41 | 85.06 | 98.28 | 5.57 | |
|
|
|
|
|
|
|
| |
| SVM-G | 93.45 | 100 | 91.27 | 79.25 | 100 | 6.55 |
Confusion matrix for multiclass experiments.
| Predicted Class | |||
|---|---|---|---|
| Actual Class | Normal | ASD | VSD |
|
| 90% | 10% | 0% |
|
| 6% | 94% | 0% |
|
| 0% | 0% | 100% |
ANOVA test on two selected classifiers based on the proposed method.
| Variance Source | SS | df | MSE | F-Statistics | |
|---|---|---|---|---|---|
|
| 1.8482 | 1 | 1.84815 | 0.63 | 0.4721 |
|
| 11.7503 | 4 | 2.93758 | - | - |
|
| 13.5985 | 5 | - | - | - |
Figure 10Box-plot of accuracy values for selected classifiers (1:SVM-C, 2:SVM-Q).
Figure 11The means of both classifiers belong to a single group and are not significantly different.
Figure 12Confidence interval simulation results.