Literature DB >> 34394983

Improving the Accuracy in Classification of Blood Pressure from Photoplethysmography Using Continuous Wavelet Transform and Deep Learning.

Jiaze Wu¹, Hao Liang^1,2, Changsong Ding³, Xindi Huang³, Jianhua Huang⁴, Qinghua Peng^1,2.

Abstract

BACKGROUND: Continuous wavelet transform (CWT) based scalogram can be used for photoplethysmography (PPG) signal transformation to classify blood pressure (BP) with deep learning. We aimed to investigate the determinants that can improve the accuracy of BP classification based on PPG and deep learning and establish a better algorithm for the prediction.
METHODS: The dataset from PhysioNet was accessed to extract raw PPG signals for testing and its corresponding BPs as category labels. The BP category of normal or abnormal followed the criteria of the 2017 American College of Cardiology/American Heart Association (ACC/AHA) Hypertension Guidelines. The PPG signals were transformed into 224 ∗ 224 ∗ 3-pixel scalogram via different CWTs and segment units. All of them are fed into different convolutional neural networks (CNN) for training and validation. The receiver-operating characteristic and loss and accuracy curves were used to evaluate and compare the performance of different methods.
RESULTS: Both wavelet type and segment length could affect the accuracy, and Cgau1 wavelet and segment-300 revealed the best performance (accuracy 90%) without obvious overfitting. This method performed better than previously reported MATLAB Morse wavelet transformed scalogram on both of our proposed CNN and CNN-GoogLeNet.
CONCLUSIONS: We have established a new algorithm with high accuracy to predict BP classification from PPG via matching of CWT type and segment length, which is a promising solution for rapid prediction of BP classification from real-time processing of PPG signal on a wearable device.

Entities: Chemical

Year: 2021 PMID： 34394983 PMCID： PMC8360747 DOI： 10.1155/2021/9938584

Source DB: PubMed Journal: Int J Hypertens Impact factor: 2.420

1. Introduction

Elevated blood pressure (BP) has been a potent issue inducing stroke, heart attack, and kidney failure. Though some proven and well-tolerated lifestyle and drug treatment strategies can control BP, it remains the major preventable cause of cardiovascular diseases (CVD) and all-cause death in the world [1]. Accurate BP measurement and recording are essential to grade BP level, ascertain BP-related CVD stratification, and guide the management of hypertension. Cuff-based BP measurement devices have been still widely used in hospital settings to detect abnormal BP. However, in patients treated for hypertension, both “white coat effect” (higher office BPs than out-of-office BPs) and “masked uncontrolled hypertension” (controlled office BPs but uncontrolled BPs in out-of-office settings) are difficult to detect [2]. ABPM and HBPM help doctors follow the risk profiles of the patients' white coat hypertension and masked hypertension counterparts, respectively [3, 4]. Both ABPM and HBPM typically based on multiple measurements of BP provide better methods to predict long-term CVD outcomes than office BP [4]. Office BP measurement in hospital, ABPM, and HBPM often require the technique by pressurizing and releasing an upper arm cuff to detect systolic and diastolic pressure. This old method is inconvenient and difficult to monitor BP persistently due to physiological limitations, for example, periodic cuff inflation and deflation will disturb sleeping to prevent the real BP detection during sleep. Some new methods of cuff-less BP detection and evaluation have been proposed, such as estimation of BP and using photoplethysmography (PPG) and electrocardiogram (ECG) signals. Although the pulse transit time (PTT) or pulse arrival time (PAT) method extracted from PPG and ECG achieved overall acceptable accuracy for BP estimation [5, 6], the algorithm required at least two signals, increasing the operational complexity and cost for wearable devices. PTT must deal with different physiological parameters, so a calibration procedure is required [7]. Synchronization is another issue in using these signals in real-time because when recording two or more signal channels via independent systems, an uncontrollable artificial delay exists as the data acquisition is started manually [8]. The former study even reported that PTT has a negative correlation with systolic BP, which is not reliable enough to become a surrogate marker of systolic BP [9]. It is clinically and practically desirable to apply the PPG signal only for estimating BP. Riaz et al. reported that the simplest linear classifiers produce satisfying results for indicating classes of normal or abnormal BP according to the proposed PPG wave feature [10]. Khalid et al. compared three machine learning algorithms (regression tree, multiple linear regression, and support vector machine) via pulse features to estimate BPs using raw PPG signals and concluded that the regression tree algorithm was the best approach. PPG wave can be transformed into scalograms by fast Fourier transform (FFT) or continuous wavelet transform (CWT). The transformations promote recognition and learning by computer. FFT naturally applying to stationary signals whose frequency contents do not change is unable to tell when these frequency components exist in time [11], but almost all biological signals are nonstationary. CWT maps the time and frequency information simultaneously by the time-frequency plane and helps to investigate what spectral components exist at any given interval of time from biological signals, such as ECG and PPG. Mansouri SR et al. employed a deep convolutional neural network (CNN) to estimate BP through the scalogram representing a CWT transformed from PPG and ECG waves and demonstrated a low root mean square error (RMSE) rate of 3.36 mmHg and high accuracy of 86.3% [12]. Previous studies usually focused on one factor individually, such as signal transform feature extraction, or new deep learning method, but CWT waveform, segment length, and methods are common key factors for accuracy of prediction. CWTs generate distinct scalograms according to different classes of wavelet bases (Shannon, Mexican hat, Morlet, etc.) and segment length, but the recent studies used the same scalogram (MATLAB default/Morse wavelet transform) with a fixed length of PPG segment (5 s) for BP classification by deep learning [12, 13]. Whether CWT type and segment length could influence the accuracy of the classification is still unknown, and the optimal combination based on these factors also need to be established. This research aimed to find out the determinants to improve the accuracy of BP classification using PPG and deep learning and establish a better algorithm for BP prediction.

2. Materials and Methods

2.1. Dataset

The data were from Multiparameter Intelligent Monitoring in Intensive Care-III (MIMIC-III) Waveform Database provided by PhysioNet [14]. In brief, the database contains 67,830 record sets for approximately 30,000 intensive care unit (ICU) patients, and almost all record sets include a waveform record containing signals of continuous arterial blood pressure (ABP) waveforms, fingertip PPG signals, ECG, and respiration. The data with patient ID and corresponding available PPG and ABP signals were extracted for analysis in our study. A manual check was conducted to exclude the records with the movement artefact, missing peaks, no signal, and so on. Finally, 311,000 signals with a frequency of 125 Hz were put into the analysis, and the first 90% of the randomized data from the dataset are used for training and the rest are for testing.

2.2. Hypertension Criteria

The ABP signals were used to label the blood pressure levels and confirm the classification of BP. The records were divided into normal or abnormal following the 2017 American College of Cardiology/American Heart Association (ACC/AHA) Hypertension Guidelines [3].

2.3. Signal Preprocessing

The ABP and PPG signals were used as the target source and predicted source, respectively. Each PPG segment was firstly processed with a moving average filter by Python function (numpy. convolve). This filter is a moving average filter to smooth the PPG signal. Baseline wandering caused by the respiratory activity was also removed from the segments. In a segment, the mean value of synchronous ABP wave peaks was calculated as systolic pressure and the mean value of wave troughs as diastolic pressure. Then, the BP category was labelled as described in the ACC/AHA Hypertension Guidelines.

2.4. PPG Signal Transformation

The output of CWT was using Python functions of pywt (ver. 1.1) and Matplotlib (ver. 3.3): x-axis representing time, y-axis representing scale (analogous to frequency), and z-axis showing coefficient value. Then, a scalogram is plotted as a smooth 2D image of time and frequency, and the amplitude of the frequency components is shown by varying the color or intensity of that point. Typically, dark blue colors represent the low amplitudes and bright yellow colors mean large amplitude coefficients. The PPG signal is transformed into different scalograms using Frequency B-Spline wavelet (fbsp1-15-1), Shannon wavelet (shan15-1), Complex Gaussian wavelet (cgau1), Morlet wavelet (morl), Mexican hat wavelet (mexh), and Gaussian wavelet (gaus1) for testing. The transformed colorful scalograms by different wavelets are seen in Figure 1. The list included the equations and parameters of these analytic wavelets:

Figure 1

The transformed colorful 2D scalograms by different wavelets.

Parameter explanation: m is an integer order parameter (≥1); F is bandwidth; and F is a wavelet center frequency: Parameter explanation: F is bandwidth; F is a wavelet center frequency; and the condition F > F/2 is sufficient to ensure that zero is not in the frequency support interval: Parameter explanation: the number “n” means vanishing moments, where diff denotes the symbolic derivative and C is a constant: Parameter explanation: the number “n” means vanishing moments, where diff denotes the symbolic derivative and C is such that the 2-norm of gaus (x, n) = 1.

2.5. Segment Length

The recordings divided into different segments by different units containing reference BPs for analysis are listed here, and all the images were converted to 224 ∗ 224 ∗ 3-pixel size: 0.8 s (segment 100): train 2799, validation 311. 1.2 s (segment 150): train 1866, validation 207. 1.6 s (segment 200): train 1399, validation 155. 2.0 s (segment-250): train 1118, validation 124. 2.4 s (segment-300): train 931, validation 103. 2.8 s (segment 350): train 798, validation 88. 3.2 s (segment 400): train 698, validation 77. 3.6 s (segment-450): train 620, validation 69. 4.0 s (segment 500): train 558, validation 62. The accuracy of each CWT using different segments for BP classification was illustrated as a line chart to compare which length of the segment was better.

2.6. Convolutional Neural Networks

Our proposed CNN is a deep learning tool applied to analyzing visual imagery (Figure 2). A classical CNN model consists of input and output layers, as well as several hidden layers [15]. The input is the CWT generated from the PPG signal that was sorted as 224 ∗ 224 ∗ 3-pixel image. Scalogram images transformed from MATLAB (Morse wavelet) were also tested on our CNN. The hidden layers including two convolution layers (C1 and C3), two pool layers (S2 and S4), and two fully connected layers (F5 and F6) define the core architecture of the network, where most of the computation and learning take place. The F7 (output layer) containing 1 neuron (sigmoid activation) carries on the work into a logistic function using sigmoid.

Figure 2

The layers of the proposed convolutional neural networks.

C1: 64 kernels of size (3 ∗ 3) with a stride setting of one and the same padding (ReLU activation). S2: max pooling (pool size is 2) with a stride of two and the same padding. C3: 128 kernels of size (5 ∗ 5) with a stride of one and the same padding (ReLU activation). S4: max pooling (pool size is 2) with a stride of two and the same padding. F5: 256 neurons (ReLU activation). F6: 128 neurons (ReLU activation). We also performed the transfer learning through pretrained CNN-GoogLeNet which can be directly applied to other computer vision identification. The transformed signals by CWT were also converted to 224 ∗ 224 ∗ 3 sized images and then ran on the CNN-GoogLeNet. It is aimed at comparing our work to the previous study using MATLAB Morse wavelet transformed scalogram [13] and CNN-GoogLeNet.

2.7. Accuracy Evaluation

All the accuracy of different CWTs and segments were calculated and shown on the line chart. The accuracy of the best three combinations testing on our proposed CNN was displayed using receiver-operating characteristic (ROC) curve analysis. Then, we applied our best match of CWT and segment using our proposed CNN and transfer learning on CNN-GoogLeNet [16] to compare with MATLAB scalogram. The accuracy comparison and overfitting evaluation were present through loss and accuracy curves.

3. Results

3.1. Data Overview

Normal BP: 1641 samples; elevated BP: 1739 samples; Stage I hypertension (I-HT): 2692 samples; Stage II hypertension (II-HT): 334 samples; normal BP: 104.81 ± 10.49/71.55 ± 3.94; abnormal BP: 132.22 ± 4.76/85.09 ± 6.92.

3.2. Comparison of Overall Accuracy

Table 1 shows all the testing accuracy of different CWTs and segments. Most of the accuracy was more than 70%; the lowest accuracy was 65%. The cgau1 with 2.4 s segment revealed the best performance with accuracy of 91%. Figure 3 shows the accuracy of different CWTs fluctuated up and down with different segments. The peaks of CWTs mostly appeared at the segments in the range of 250 (2.0 s) ∼ 300 (2.8 s).

Table 1

All the testing accuracy of different CWTs and segments to predict classification of blood pressure.

Accuracy (%)	100 (0.8 s)	150 (1.2 s)	200 (1.6 s)	250 (2.0 s)	300 (2.4 s)	350 (2.8 s)	400 (3.2 s)	450 (3.6 s)	500 (4.0 s)
No CWT	77	70	79	81	70	74	68	68	75
fbsp1-15-1	70	73	72	78	71	68	72	70	81
shan15-1	77	84	78	81	77	77	75	65	76
cgau1	73	84	81	87	90	85	77	70	78
morl	75	74	79	84	68	72	75	68	82
mexh	82	80	81	86	81	73	78	69	77
gaus1	78	79	82	86	85	82	74	69	77

Figure 3

The accuracies of different algorithms by matching CWTs with different segment lengths.

3.3. Accuracy Evaluation from ROC

The first three best combinations of CWT and segment were cgau1 and segment-300; gaus1 and segment-250; and mexh and segment-250. The accuracies were 90%, 86%, and 86%, respectively. The accuracy based on the testing set was close to the accuracy of the training set in cgau1, so there was no overfitting problem from the ROC (Figure 4).

Figure 4

The receiver-operating characteristic curves of cgau1 and segment-300; gaus1 and segment-250; and mexh and segment-250 for prediction of blood pressure classification.

3.4. The Cgau1 and Segment-300 Examples of Different BP Categories

The image examples of different BP categories transformed from cgau1 are shown in Figure 5. The images transformed from cgau1 and segment-300 can be distinguished visually by the naked eyes. The feature of the normal BP (110/72 mmHg) image is smaller peaks and bright yellow at the dexter base. The feature of elevated BP (128/78 mmHg) image is bright yellow at the center bottom of the left higher peaks. The features of I-HT and II-HT are similar with minor variations.

Figure 5

The image examples transformed from cgau1 for different blood pressure categories.

3.5. Comparison with Previous Work

We compared cgau1 and segment-300 to scalogram and segment-300 transformation previously used by other studies. The ROC (Figure 6) showed that the accuracy of cgau1 and segment-300 (90%) was better than that of MATLAB Morse and segment-300 (82%) in our proposed CNN.

Figure 6

The accuracy of cgau1 and segment-300 and MATLAB scalogram and segment-300 in our proposed CNN from the receiver-operating characteristic curves.

When the cgau1 and segment-300 and MATLAB Morse and segment-300 methods were tested on CNN-GoogLeNet by transfer learning, cgau1 and segment-300 also performed better than MATLAB Morse and segment-300. The loss and accuracy training process (Figure 7) showed that the accuracy and loss were 90.36% versus 84.5% and 0.23 versus 0.42, respectively. There was also no overfitting in both. It means that our best model was efficient in other CNN models and stable for predictions when new data were applied.

Figure 7

The loss and accuracy training process of cgau1 and segment-300 and MATLAB scalogram and segment-300 in CNN-GoogLeNet by transfer learning.

4. Discussion

PPG is a low-cost, miniature, and wearable optical biosensor that can be applied to cardiovascular monitoring, including the detection of blood oxygen saturation, heart rate, BP, and cardiac output [17, 18]. Waveform propagation and waveform morphology are the two main methods to detect BP from PPG [19]. The waveform propagation method requires multiple sensors, and the issue of synchronization of signals increases the difficulty and instability of BP analysis. Documented PPG and ECG signal data from MIMIC are not perfectly synchronized. Since the PPG signal produces pulse waveforms that are very similar to pressure waveforms generated by tonometry, there is clear evidence that the BP fluctuations are reflected in the PPG waveform morphology [20]. The PPG signal is a complex mixture with a coverage area of arteries, veins, and numerous capillaries [21]. A raw PPG signal provides much information about the circulatory system generated from pulsatile and nonpulsatile blood volume [22]. Then, it is not easy to extract the features of BP information from raw messy signals by waveform morphology. Slope transit time (STT) [23] and BD area [24] were correlated with BP but are far from the accurate prediction of BP. The CWT is an effective tool to expose the characteristics of different BP levels transformed from raw PPG signals. Figure 7 in our study clearly illustrated the features of different BP categories, and the transformed images are easy to recognize. Deep learning technology plays a pivotal role in image recognition [25] and is qualified for the work of classifying the CWT scalograms. The results in our study showed that except for fbsp1-15-1, the accuracy of CWTs was higher than that of no CWT when testing on our proposed CNN. CNN use relatively little preprocessing compared to other image classification algorithms, which has gained a lot of popularity in this field [26]. Liang et al. uses the MATLAB scalogram and pretrained CNN (GoogLeNet) to classify BP, and the three classification trials of normotension versus prehypertension, normotension versus hypertension, and normotension + prehypertension versus hypertension F1-scores were 80.52%, 92.55%, and 82.95%, respectively [13]. The accuracy of the MATLAB scalogram on our proposed CNN was 81.55%, which was similar on CNN-GoogLeNet. Cgau1 transform performed better compared with MATLAB default scalogram on both of our proposed CNN and CNN-GoogLeNet. These results indicated that different CWTs would affect the accuracy of BP classification via deep learning. Few studies reported whether the segment length could influence the accuracy of BP estimation via PPG. Most studies extracted a 5-second segment as a testing unit [13, 27]. The longer the segment is, the smaller the sample size it will make. The MIMIC dataset is finite, and the 5-second segment will reduce the sample size for testing and validation, so we used the segment units under 4 seconds to keep enough sample size for testing. Interestingly, the performance was quite dissimilar using different segment units. Most of the CWTs performed best on segment-250 (2.0 s) and segment-300 (2.4 s). The low accuracy appeared at segment-450 in all the CWTs. It tells us that a shorter length of the segment would lower the accuracy for prediction, but it does not mean the longer the better. The combination of appropriate CWT, segment length, and deep learning model will construct the optimal algorithm for accurate prediction. The combination of cgau1 CWT and segment-300 (2.4 s) in our work is the best solution using CNN for BP classification. The method performed better than MATLAB scalogram and segment-300 [13]. The accuracies are quite similar when tested on self-established CNN and CNN-GoogLeNet of transfer learning. Thus, our model is a feasible and promising solution for continuous cuffless BP monitoring on different platforms. If this low-cost, cuffless BP monitors can be developed, it is likely that conventional paradigms of BP measurement would be disrupted. Even some special medical conditions can also benefit from this device; for example, it is proven that cuffless BP measurement still shows high accuracy and reliability in arrhythmia patients [28] and children [29]. Although the PPG signal is altered in individuals suffering from obesity [30], it can be optimized based on multiple types of information fusion [31]. There are some limitations to our study. Firstly, we did not test all the wavelets on the MIMIC dataset, and cgau1 may be not the best CWT for BP classification. Other wavelets, such as Gabor [32] and Paul [33] are the potential to exceed the ability of cgau1 and segment-300 to predict BP estimation. Secondly, our model is not validated on new data and whether the accuracy will keep as high as 90% in real-life scenarios is still unknown. Thirdly, there is a “length effect” on the accuracy of BP assessment by CWT signals, as 2.0 s and 2.4 s were the optimal segments, but a longer segment unit did not improve the accuracy. The possible reason is that longer segments could reduce the amount of training data, which may influence the training result for deep learning. We need more data to confirm the length effect.

5. Conclusion

Type of CWT and segment length are the determinants for improving the accuracy to forecast blood pressure classification from PPG using deep learning. The cgau1 and segment-300 method performed better than the previously established MATLAB Morse scalogram and segment-300 approach on both of our proposed CNN and CNN-GoogLeNet. Thus, we established a new algorithm with high accuracy to predict blood pressure classification from PPG via matching of CWT type and segment length. Our study provides a promising solution that can be applied to the real-time processing of the PPG signal from the wearable device for rapid classification of BP.

22 in total

1. PhysioBank, PhysioToolkit, and PhysioNet: components of a new research resource for complex physiologic signals.

Authors: A L Goldberger; L A Amaral; L Glass; J M Hausdorff; P C Ivanov; R G Mark; J E Mietus; G B Moody; C K Peng; H E Stanley
Journal: Circulation Date: 2000-06-13 Impact factor: 29.690

2. Continuous estimation of systolic blood pressure using the pulse arrival time and intermittent calibration.

Authors: W Chen; T Kobayashi; S Ichikawa; Y Takeuchi; T Togawa
Journal: Med Biol Eng Comput Date: 2000-09 Impact factor: 2.602

Review 3. Photoplethysmography and its application in clinical physiological measurement.

Authors: John Allen
Journal: Physiol Meas Date: 2007-02-20 Impact factor: 2.833

Review 4. 2017 ACC/AHA/AAPA/ABC/ACPM/AGS/APhA/ASH/ASPC/NMA/PCNA Guideline for the Prevention, Detection, Evaluation, and Management of High Blood Pressure in Adults: A Report of the American College of Cardiology/American Heart Association Task Force on Clinical Practice Guidelines.

Authors: Paul K Whelton; Robert M Carey; Wilbert S Aronow; Donald E Casey; Karen J Collins; Cheryl Dennison Himmelfarb; Sondra M DePalma; Samuel Gidding; Kenneth A Jamerson; Daniel W Jones; Eric J MacLaughlin; Paul Muntner; Bruce Ovbiagele; Sidney C Smith; Crystal C Spencer; Randall S Stafford; Sandra J Taler; Randal J Thomas; Kim A Williams; Jeff D Williamson; Jackson T Wright
Journal: J Am Coll Cardiol Date: 2017-11-13 Impact factor: 24.094

Review 5. Blood Pressure Measurement and Treatment Decisions.

Authors: Kazuomi Kario; Lutgarde Thijs; Jan A Staessen
Journal: Circ Res Date: 2019-03-29 Impact factor: 17.367

6. Continuous Blood Pressure Estimation From Electrocardiogram and Photoplethysmogram During Arrhythmias.

Authors: ZengDing Liu; Bin Zhou; Ye Li; Min Tang; Fen Miao
Journal: Front Physiol Date: 2020-09-09 Impact factor: 4.566

7. The use of ambulatory blood pressure monitoring among Medicare beneficiaries in 2007-2010.

Authors: Daichi Shimbo; Shia T Kent; Keith M Diaz; Lei Huang; Anthony J Viera; Meredith Kilgore; Suzanne Oparil; Paul Muntner
Journal: J Am Soc Hypertens Date: 2014-09-18

8. Blood Pressure Estimation Using Photoplethysmography Only: Comparison between Different Machine Learning Approaches.

Authors: Syed Ghufran Khalid; Jufen Zhang; Fei Chen; Dingchang Zheng
Journal: J Healthc Eng Date: 2018-10-23 Impact factor: 2.682

9. Hypertension Assessment via ECG and PPG Signals: An Evaluation Using MIMIC Database.

Authors: Yongbo Liang; Zhencheng Chen; Rabab Ward; Mohamed Elgendi
Journal: Diagnostics (Basel) Date: 2018-09-10

10. Photoplethysmography and Deep Learning: Enhancing Hypertension Risk Stratification.

Authors: Yongbo Liang; Zhencheng Chen; Rabab Ward; Mohamed Elgendi
Journal: Biosensors (Basel) Date: 2018-10-26

1 in total

1. A Data-Driven Model with Feedback Calibration Embedded Blood Pressure Estimator Using Reflective Photoplethysmography.

Authors: Jia-Wei Chen; Hsin-Kai Huang; Yu-Ting Fang; Yen-Ting Lin; Shih-Zhang Li; Bo-Wei Chen; Yu-Chun Lo; Po-Chuan Chen; Ching-Fu Wang; You-Yin Chen
Journal: Sensors (Basel) Date: 2022-02-27 Impact factor: 3.576

1 in total