| Literature DB >> 35161664 |
Sakib Mahmud1, Nabil Ibtehaz1, Amith Khandakar1, Anas M Tahir1, Tawsifur Rahman1, Khandaker Reajul Islam1, Md Shafayet Hossain2, M Sohel Rahman3, Farayi Musharavati4, Mohamed Arselene Ayari5,6, Mohammad Tariqul Islam2, Muhammad E H Chowdhury1.
Abstract
Cardiovascular diseases are the most common causes of death around the world. To detect and treat heart-related diseases, continuous blood pressure (BP) monitoring along with many other parameters are required. Several invasive and non-invasive methods have been developed for this purpose. Most existing methods used in hospitals for continuous monitoring of BP are invasive. On the contrary, cuff-based BP monitoring methods, which can predict systolic blood pressure (SBP) and diastolic blood pressure (DBP), cannot be used for continuous monitoring. Several studies attempted to predict BP from non-invasively collectible signals such as photoplethysmograms (PPG) and electrocardiograms (ECG), which can be used for continuous monitoring. In this study, we explored the applicability of autoencoders in predicting BP from PPG and ECG signals. The investigation was carried out on 12,000 instances of 942 patients of the MIMIC-II dataset, and it was found that a very shallow, one-dimensional autoencoder can extract the relevant features to predict the SBP and DBP with state-of-the-art performance on a very large dataset. An independent test set from a portion of the MIMIC-II dataset provided a mean absolute error (MAE) of 2.333 and 0.713 for SBP and DBP, respectively. On an external dataset of 40 subjects, the model trained on the MIMIC-II dataset provided an MAE of 2.728 and 1.166 for SBP and DBP, respectively. For both the cases, the results met British Hypertension Society (BHS) Grade A and surpassed the studies from the current literature.Entities:
Keywords: arterial blood pressure; autoencoder; diastolic blood pressure; electrocardiogram; feature extraction; photoplethysmogram; systolic blood pressure
Mesh:
Year: 2022 PMID: 35161664 PMCID: PMC8840244 DOI: 10.3390/s22030919
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Overview of the datasets (after pre-processing).
| Datasets | BP Parameters | Minimum | Maximum | Mean | Standard Deviation |
|---|---|---|---|---|---|
| UCI Dataset | SBP | 80.026 | 189.984 | 132.609 | 21.703 |
| DBP | 50.000 | 119.927 | 63.705 | 9.978 | |
| MAP | 57.941 | 149.062 | 87.228 | 12.737 | |
| BCG Dataset | SBP | 80.313 | 186.641 | 124.535 | 15.237 |
| DBP | 43.899 | 96.829 | 65.011 | 9.180 | |
| MAP | 62.975 | 128.391 | 86.878 | 10.046 |
Figure 1Flowchart representing the data pre-processing pipeline for the UCI Dataset. The pipeline for the Ballistocardiogram (BCG) Dataset is almost identical except the ABP signals were denormalized first by multiplying with the normalizing factor of 100.
Figure 2Snapshot of five signals (photoplethysmography (PPG), velocity of PPG (VPG), acceleration of PPG (APG), electrocardiogram (ECG), and arterial blood pressure (ABP)) from a segment of the UCI dataset after pre-processing.
Figure 3Histogram (a,b) and box plot (c,d) of systolic blood pressure (SBP), diastolic blood pressure (DBP), and mean arterial pressure (MAP) of signals in the training set before (a,c) and after (b,d) data cleansing, respectively.
Figure 4Complete pipeline for blood pressure (BP) prediction using a two-stage machine learning approach (dash means an iterative process).
Description of training and testing sets for Experiment 1.
| No. of Channels | Channels | Target | Total Samples in the Train Set | Total Samples in the Test Set |
|---|---|---|---|---|
| 1 | PPG | ABP | 147,116 | 53,043 |
| 2 | PPG, ECG | ABP | 147,116 | 53,043 |
| 3 | PPG, VPG, APG | ABP | 147,116 | 53,043 |
| 4 | PPG, VPG, APG, ECG | ABP | 147,116 | 53,043 |
Description of training and testing sets for Experiment 2 (Method 1).
| No. of Channels | Channels | Target | Total Samples in the Train Set from UCI | Total Samples in the Test Set from BCG |
|---|---|---|---|---|
| 1 | PPG | ABP | 200,159 | 1872 |
| 2 | PPG, ECG | ABP | 200,159 | 1872 |
| 3 | PPG, VPG, APG | ABP | 200,159 | 1872 |
| 4 | PPG, VPG, APG, ECG | ABP | 200,159 | 1872 |
Description of training and testing sets for Experiment 2 (Method 2).
| No. of Channels | Channels | Target | Total Samples in the Train Set | Total Samples in the Test Set |
|---|---|---|---|---|
| 1 | PPG | ABP | 1498 | 374 |
| 2 | PPG, ECG | ABP | 1498 | 374 |
| 3 | PPG, VPG, APG | ABP | 1498 | 374 |
| 4 | PPG, VPG, APG, ECG | ABP | 1498 | 374 |
Mean absolute error (MAE) of BP prediction for variable encoder depth.
| Fixed Parameters | Encoder Levels | MAE | |
|---|---|---|---|
| SBP | DBP | ||
| Encoder Type: U-Net | 1 | 2.333 | 0.713 |
| Encoder Width: 128 | 2 | 3.169 | 1.099 |
| Kernel Size: 3 | 3 | 3.763 | 1.243 |
| No. of Channels: 4 | 4 | 4.416 | 1.419 |
| No. of Extracted Feature: 1024 | |||
| Regressor: MLP | |||
Figure 5Heatmap depicting MAE for SBP (left) and DBP (right) prediction while varying encoder width and number of extracted features. Here, the color scale varies from red (high performance) to green (low performance).
MAE of BP prediction for variable channels.
| Fixed Parameters | No. of Channels | MAE | |
|---|---|---|---|
| SBP | DBP | ||
| Encoder Type: U-Net | 1 | 4.971 | 1.361 |
| Encoder Depth: 1 | 2 | 2.513 | 0.825 |
| Encoder Width: 128 | 3 | 2.739 | 0.960 |
| Kernel Size: 3 | 4 | 2.333 | 0.713 |
| No. of Extracted Feature: 1024 | |||
| Regressor: MLP | |||
MAE of BP prediction for variable kernel or filter size.
| Fixed Parameters | Kernel Size | MAE | |
|---|---|---|---|
| SBP | DBP | ||
| Encoder Type: U-Net | 1 | 2.387 | 0.876 |
| Encoder Depth: 1 | 3 | 2.333 | 0.713 |
| Encoder Width: 128 | 5 | 2.503 | 0.949 |
| No. of Channels: 4 | 7 | 2.900 | 0.888 |
| No. of Extracted Feature: 1024 | 9 | 3.421 | 1.568 |
| Regressor: MLP | 11 | 4.544 | 1.388 |
Figure 6Architecture of the shallow U-Net model for feature extraction.
MAE of systolic blood pressure (SBP) and diastolic blood pressure (DBP) for different machine learning (ML) techniques in Experiment 1.
| Fixed Parameters | Regressor Algorithm | MAE for SBP | MAE for DBP |
|---|---|---|---|
| Encoder Type: U-Net | MLP | 2.333 | 0.713 |
| GradBoost | 5.837 | 1.418 | |
| SGD | 5.945 | 2.261 | |
| SVM | 5.980 | 2.269 | |
| XGBoost | 6.089 | 1.429 | |
| K-Nearest Neighbor | 6.543 | 1.510 | |
| AdaBoost | 8.584 | 2.234 |
Evaluation of BP prediction in Experiment 1 in terms of British Hypertension Society (BHS) Standard.
| Cumulative Error Percentage | ||||
|---|---|---|---|---|
|
≤ |
≤ |
≤ | ||
| Our Results | SBP | 92.02% | 99.18% | 99.85% |
| DBP | 99.01% | 99.91% | 100.0% | |
| BHS Metric | Grade A | 60% | 85% | 95% |
| Grade B | 50% | 75% | 90% | |
| Grade C | 40% | 65% | 85% | |
Evaluation of BP prediction in Experiment 1 in terms of Association for the Advancement of Medical Instrumentation (AAMI) standard.
|
|
| Number of Subjects | ||
|---|---|---|---|---|
| Our Results | SBP | 0.09 | 0.94 | 942 |
| DBP | −0.019 | 2.876 | ||
| AAMI Standard |
≤ |
≤ |
≥ | |
Figure 7(a) Regression plots for DBP (left) and SBP (right) predictions vs. respective ground truths; (b) Bland–Altman plots for DBP (left) and SBP (right) predictions.
Comparison of past studies based on MAE performance.
| Study | Year Published | Dataset | Input Signals | Method | MAE (mmHg) | |
|---|---|---|---|---|---|---|
| SBP | DBP | |||||
| Kurylayak et al. [ | 2013 | MIMIC, 15,000 Pulsations | PPG | ANN | 3.80 | 2.21 |
| Wang et al. [ | 2018 | MIMIC, 72 Subjects | PPG | ANN | 4.02 | 2.27 |
| Slapničar et al. [ | 2019 | MIMIC, 510 Subjects | PPG | CNN | 9.43 | 6.88 |
| Miao et al. [ | 2019 | 1711 ICU and 30 Arrythmia Patients | ECG | CNN + LSTM | 7.10 | 4.61 |
| Esmaelpoor et al. [ | 2020 | MIMIC-II (200 Subjects) | PPG | CNN + LSTM | 1.91 | 0.67 |
| Ibtehaz et al. [ | 2020 | MIMIC-II (942 Subjects) | PPG | CNN + CNN | 5.73 | 3.45 |
| Li et al. [ | 2020 | MIMIC-II (3000 Records from UCI Repository) | PPG, ECG | LSTM | 4.63 | 3.15 |
| Hsu et al. [ | 2020 | MIMIC-II (9000 Records from UCI Repository) | PPG, ECG | ANN | 3.21 | 2.23 |
| Athaya et al. [ | 2021 | MIMIC-II (100 Subjects) | PPG | CNN | 3.68 | 1.97 |
| Harfiya et al. [ | 2021 | MIMIC-II (5289 Records from UCI Repository) | PPG | LSTM | 4.05 | 2.41 |
| Baker et al. [ | 2021 | MIMIC-III | PPG, ECG | CNN + LSTM | 4.41 | 2.91 |
| Rong et al. [ | 2021 | MIMIC-II (UCI Repository) | PPG | CNN + LSTM | 5.59 | 3.36 |
| Sagirova et al. [ | 2021 | 512 Patients | ECG, PPG | |||
| Qin et al. [ | 2021 | MIMIC-II (1227 Records from UCI Repository) | PPG | VAE | 7.95 | 4.11 |
| This Study | 2021 | MIMIC-II (942 Subjects–12,000 Recordings from UCI Repository) | PPG, ECG | CNN + ANN | 2.333 | 0.713 |
| MIMIC-II + BCG (942 + 40 = 982 Subjects) | 2.728 | 1.166 | ||||
| AAMI Standard |
≤ | |||||
Note: It is important to mention that Hsu et al. [23] in their paper reported that they used 9000 subjects’ data for BP prediction from the UCI repository, but it was 9000 out of 12,000 instances or recordings of data collected from the MIMIC-II dataset. These are the data from 942 patients as reported by Kachuee et al. [14], the originator of this dataset. A similar occurrence happened for the case of Harfiya et al. [25] where they reported 5289 signal instances from the UCI repository as 5289 patients. In comparison, this study fully utilized all 12,000 instances.
Comparison of past studies based on their performance of BHS metrics.
| Study | SBP (%) in BHS Metrics | DBP (%) in BHS Metrics | ||||||
|---|---|---|---|---|---|---|---|---|
| Grade A | Grade B | Grade C | Attained Grade | Grade A | Grade B | Grade C | Attained Grade | |
| Esmaelpoor et al. [ | 74 | 94 | 98 | A | 93 | 99 | 100 | A |
| Ibtehaz et al. [ | 71 | 85 | 91 | B | 83 | 92 | 96 | A |
| Li et al. [ | 60 | 80 | 89 | B | 77 | 96 | 100 | A |
| Hsu et al. [ | 81 | 96 | 98 | A | 90 | 98 | 100 | A |
| Athaya et al. [ | 76 | 94 | 99 | A | 94 | 99 | 100 | A |
| Harfiya et al. [ | 71 | 94 | 99 | A | 91 | 99 | 100 | A |
| Miao et al. [ | 50 | 76 | 90 | B | 66 | 90 | 97 | A |
| Baker et al. [ | 68 | 90 | 97 | A | 83 | 96 | 99 | A |
| Rong et al. [ | 54 | 87 | 94 | B | 83 | 95 | 98 | A |
| Qin et al. [ | 59 | 86 | 95 | B | 82 | 96 | 99 | A |
| This Study | 92 | 99 | 99 | A | 99 | 100 | A | |
*%: The percentage of predicted signals falling within 5 (Grade A), 10 (Grade B), and 15 (Grade C) mmHg of their respective ground truth signals, respectively.