| Literature DB >> 32492902 |
Moajjem Hossain Chowdhury1, Md Nazmul Islam Shuzan1, Muhammad E H Chowdhury2, Zaid B Mahbub3, M Monir Uddin3, Amith Khandakar2,3, Mamun Bin Ibne Reaz4.
Abstract
Hypertension is a potentially unsafe health ailment, which can be indicated directly from the blood pressure (BP). Hypertension always leads to other health complications. Continuous monitoring of BP is very important; however, cuff-based BP measurements are discrete and uncomfortable to the user. To address this need, a cuff-less, continuous, and noninvasive BP measurement system is proposed using the photoplethysmograph (PPG) signal and demographic features using machine learning (ML) algorithms. PPG signals were acquired from 219 subjects, which undergo preprocessing and feature extraction steps. Time, frequency, and time-frequency domain features were extracted from the PPG and their derivative signals. Feature selection techniques were used to reduce the computational complexity and to decrease the chance of over-fitting the ML algorithms. The features were then used to train and evaluate ML algorithms. The best regression models were selected for systolic BP (SBP) and diastolic BP (DBP) estimation individually. Gaussian process regression (GPR) along with the ReliefF feature selection algorithm outperforms other algorithms in estimating SBP and DBP with a root mean square error (RMSE) of 6.74 and 3.59, respectively. This ML model can be implemented in hardware systems to continuously monitor BP and avoid any critical health conditions due to sudden changes.Entities:
Keywords: blood pressure; feature selection algorithm; machine learning; photoplethysmograph
Mesh:
Year: 2020 PMID: 32492902 PMCID: PMC7309072 DOI: 10.3390/s20113127
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1A typical photoplethysmograph (PPG) waveform with notch, systolic peak, and diastolic peak.
Figure 2Overall system block diagram.
Data summary.
| Physical Index | Numerical Data |
|---|---|
| Females | 115 (52%) |
| Age (years) | 57 ± 15 |
| Height (cm) | 161 ± 8 |
| Weight (kg) | 60 ± 11 |
| Body Mass Index (kg/m2) | 23 ± 4 |
| Systolic Blood Pressure (mmHg) | 127 ± 20 |
| Diastolic Blood Pressure (mmHg) | 71 ± 11 |
| Heart Rate (beats/min) | 73 ± 10 |
Figure 3Comparison of waveforms that are fit and unfit for the study. (a) Fit data; (b) unfit data.
Figure 4PPG signal. (a) Before normalization; (b) after normalization.
Figure 5Filtered signals overlaid on the raw PPG signal.
Figure 6Baseline correction of PPG waveform. (a) PPG waveform with the baseline wandering and fourth degree polynomial trend; (b) PPG waveform after detrending.
Figure 7Overview of feature extraction.
Figure 8Algorithm of notch detection.
Figure 9Demonstration of dicrotic notch detection for different age groups: Case 1 (26 years), 2 (45 years), and 3 (80 years). (a) Filtered PPG signal where we draw a line from systolic peak to diastolic peak; (b) subtract the line from the signal and find its minimum point; (c) initial notch detected; (d) adjust the notch using the fix index.
Figure 10Detection of the foot of a PPG waveform. (a) Filtered PPG signal; (b) second derivative of PPG along with derivation of the zone of interest based on moving average of acceleration plethysmogram (APG) and adaptive threshold; (c) foot of the signal detected.
Figure 11(a) Illustration of time-domain features in a PPG signal. (b) First and second derivatives of PPG signal.
Figure 12Frequency-domain representation of PPG signal with important features.
Twenty-four features from the PPG signal.
| Feature | Definition |
|---|---|
| 1. Systolic Peak | The amplitude of (‘x’) from PPG waveform |
| 2. Diastolic Peak | The amplitude of (‘y’) from PPG waveform |
| 3. Height of Notch | The amplitude of (‘z’) from PPG waveform |
| 4. Systolic Peak Time | The time interval from the foot of the waveform to the systolic peak (‘t1’) |
| 5. Diastolic Peak Time | The time interval from the foot of the waveform to the height of notch (‘t2’) |
| 6. Height of Notch Time | The time interval from the foot of the waveform to the diastolic peak (‘t3’) |
| 7. ∆T | The time interval from systolic peak time to diastolic peak time |
| 8. Pulse Interval | The distance between the beginning and the end of the PPG waveform (‘tpi’) |
| 9. Peak-to-Peak Interval | The distance between two consecutive systolic peaks (tpp) |
| 10. Pulse Width | The half-height of the systolic peak |
| 11. Inflection Point Area | The waveform is first split into two parts at the notch point. The area of the first part is A1 and the area of the second part is A2. The ratio of A1 and A2 is the inflection point area (‘A1/A2 ’) |
| 12. Augmentation Index | The ratio of diastolic and systolic peak amplitude (‘y/x’) |
| 13. Alternative Augmentation Index | The difference between systolic and diastolic peak amplitude divided by systolic peak amplitude (‘(x-y)/x’) |
| 14. Systolic Peak Output Curve | The ratio of systolic peak time to systolic peak amplitude (‘t1/x’) |
| 15. Diastolic Peak Downward Curve | The ratio of diastolic peak amplitude to the differences between pulse interval and height of notch time (‘y/ tpi-t3’) |
| 16. t1/tpp | The ratio of systolic peak time to the peak-to-peak interval of the PPG waveform |
| 17. t2/tpp | The ratio of notch time to the peak-to-peak interval of the PPG waveform |
| 18. t3/tpp | The ratio of diastolic peak time to the peak-to-peak interval of the PPG waveform |
| 19. ∆T/tpp | The ratio of ∆T to the peak-to-peak interval of the PPG waveform |
| 20. z/x | The ratio of the height of notch to the systolic peak amplitude |
| 21. t2/z | The ratio of the notch time to the height of notch |
| 22. t3/y | The ratio of the diastolic peak time to the diastolic peak amplitude |
| 23. x/(tpi-t1) | The ratio of systolic peak amplitude to the difference between pulse interval and systolic peak time |
| 24. z/(tpi-t2) | The ratio of the height of notch to the difference between pulse interval and notch time |
Seventeen width-related PPG features.
| Feature | Definition |
|---|---|
| 25. Width (25%) | The width of the waveform at 25% amplitude of systolic amplitude |
| 26. Width (75%) | The width of the waveform at 75% amplitude of systolic amplitude |
| 27. Width (25%)/t1 | The ratio of pulse width at 25% of systolic amplitude to systolic peak time |
| 28. Width (25%)/t2 | The ratio of pulse width at 25% of systolic amplitude to notch time |
| 29. Width (25%)/t3 | The ratio of pulse width at 25% of systolic amplitude to diastolic peak time |
| 30. Width (25%)/∆T | The ratio of pulse width at 25% of systolic amplitude to ∆T |
| 31. Width (25%)/tpi | The ratio of pulse width at 25% of systolic amplitude to pulse interval |
| 32. Width (50%)/t1 | The ratio of pulse width at 50% of systolic amplitude to systolic peak time |
| 33. Width (50%)/t2 | The ratio of pulse width at 50% of systolic amplitude to notch time |
| 34. Width (50%)/t3 | The ratio of pulse width at 50% of systolic amplitude to diastolic peak time |
| 35. Width (50%)/∆T | The ratio of pulse width at 50% of systolic amplitude to ∆T |
| 36. Width (50%)/tpi | The ratio of pulse width at 50% of systolic amplitude to pulse interval |
| 37. Width (75%)/t1 | The ratio of pulse width at 75% of systolic amplitude to systolic peak time |
| 38. Width (75%)/t2 | The ratio of pulse width at 75% of systolic amplitude to notch time |
| 39. Width (75%)/t3 | The ratio of pulse width at 75% of systolic amplitude to diastolic peak time |
| 40. Width (75%)/∆T | The ratio of pulse width at 75% of systolic amplitude to ∆T |
| 41. Width (75%)/tpi | The ratio of pulse width at 75% of systolic amplitude to pulse interval |
Sixteen features derived from the first and second derivative.
| Feature | Definition |
|---|---|
| 42. a1 | The first maximum peak from the first derivative of the PPG waveform |
| 43. ta1 | The time interval from the foot of the PPG waveform to the time at which a1 occurred |
| 44. a2 | The first maximum peak from the second derivative of the PPG waveform after a1 |
| 45. ta2 | The time interval from the foot of the PPG waveform to the time at which a2 occurred |
| 46. b1 | The first minimum peak from the first derivative of the PPG waveform after a1 |
| 47. tb1 | The time interval from the foot of the PPG waveform to the time at which b1 occurred |
| 48. b2 | The first minimum peak from the second derivative of the PPG waveform after a2 |
| 49. tb2 | The time interval from the foot of the PPG waveform to the time at which b2 occurred |
| 50. b2/a2 | The ratio of b2 to a2 |
| 51. b1/a1 | The ratio of first minimum peak of the first derivative after a1 to first maximum peak of the first derivative |
| 52. ta1/tpp | The ratio of ta1 to the peak-to-peak interval of the PPG waveform |
| 53. tb1/tpp | The ratio of tb1 to the peak-to-peak interval of the PPG waveform |
| 54. tb2/tpp | The ratio of tb2 to the peak-to-peak interval of the PPG waveform |
| 55. ta2/tpp | The ratio of ta2 to the peak-to-peak interval of the PPG waveform |
| 56. (ta1–ta2)/tpp | The ratio of the difference between ta1 and ta2 to the peak-to-peak interval of the PPG waveform |
| 57. (tb1–tb2)/tpp | The ratio of the difference between tb1 and tb2 to the peak-to-peak interval of the PPG waveform |
Eighteen demographic time-domain features.
| Feature | Definition |
|---|---|
| 58. Height/∆T | It is known as stiffness index |
| 59. Weight/∆T | The ratio of weight to ∆T |
| 60. BMI/∆T | The ratio of BMI to ∆T |
| 61. Height/t1 | The ratio of height to the systolic peak time |
| 62. Weight/t1 | The ratio of weight to the systolic peak time |
| 63. BMI/t1 | The ratio of BMI to the systolic peak time |
| 64. Height/t2 | The ratio of height to the notch time |
| 65. Weight/t2 | The ratio of weight to the notch time |
| 66. BMI/t2 | The ratio of BMI to the notch time |
| 67. Height/t3 | The ratio of height to the diastolic peak time |
| 68. Weight/t3 | The ratio of weight to the diastolic peak time |
| 69. BMI/t3 | The ratio of BMI to the diastolic peak time |
| 70. Height/tpi | The ratio of height to the pulse interval |
| 71. Weight/tpi | The ratio of weight to the pulse interval |
| 72. BMI/tpi | The ratio of BMI to the pulse interval |
| 73. Height/tpp | The ratio of height to the peak-to-peak interval |
| 74. Weight/tpp | The ratio of weight to the peak-to-peak interval |
| 75. BMI/tpp | The ratio of BMI to the peak-to-peak interval |
Sixteen frequency-domain features.
| Feature | Definition |
|---|---|
| 76. Peak-1 | The amplitude of the first peak from the fast Fourier transform of the PPG signal |
| 77. Peak-2 | The amplitude of the second peak from the fast Fourier transform of the PPG signal |
| 78. Peak-3 | The amplitude of the third peak from the fast Fourier transform of the PPG signal |
| 79. Freq-1 | The frequency at which the first peak from the fast Fourier transform of the PPG signal occurred |
| 80. Freq-2 | The frequency at which the second peak from the fast Fourier transform of the PPG signal occurred |
| 81. Freq-3 | The frequency at which the third peak from the fast Fourier transform of the PPG signal occurred |
| 82. A0–2 | Area under the curve from 0 to 2 Hz for the fast Fourier transform of the PPG signal |
| 83. A2–5 | Area under the curve from 2 to 5 Hz for the fast Fourier transform of the PPG signal |
| 84. A0–2/A2–5 | The ratio of the area under the curve from 0 to 2 Hz to the area under the curve from 2 to 5 Hz |
| 85. Peak-1/peak-2 | The ratio of the first peak to the second peak from the fast Fourier transform of the PPG signal |
| 86. Peak-1/peak-3 | The ratio of the first peak to the third peak from the fast Fourier transform of the PPG signal |
| 87. Freq-1/freq-2 | The ratio of the frequency at first peak to the frequency at second peak from the fast Fourier transform of the PPG signal |
| 88. Freq-1/freq-3 | The ratio of the frequency at first peak to the frequency at third peak from the fast Fourier transform of the PPG signal |
| 89. Maximum Frequency | The value of highest frequency in the signal spectrum |
| 90. Magnitude at Fmax | Signal magnitude at highest frequency |
| 91. Ratio of signal energy | Ratio of signal energy between |
Ten statistical features.
| Feature | Definition | Equation |
|---|---|---|
| 92. Mean | Sum of all data divided by the number of entries |
|
| 93. Median | Value that is in the middle of the ordered set of data | Odd numbers of entries: Median = middle data entry. |
| 94. Standard Deviation | Measure variability and consistency of the sample. | |
| 95. Percentile | The data value at which the percent of the value in the data set are less than or equal to this value. | 25th = ( |
| 75th = ( | ||
| 96. Mean Absolute Deviation | Average distance between the mean and each data value. | MAD = |
| 97. Inter Quartile Range (IQR) | The measure of the middle 50% of data. | IQR = Q3–Q1 |
| 98. Skewness | The measure of the lack of symmetry from the mean of the dataset. | g1 = |
| 99. Kurtosis | The pointedness of a peak in distribution curve, in other words it is the measure of sharpness of the peak of distribution curve. | K = |
| 100. Shannon’s Entropy | Entropy measures the degree of randomness in a set of data, higher entropy indicates a greater randomness, and lower entropy indicates a lower randomness. | H(x) = − |
| 101. Spectral Entropy | The normalized Shannon’s entropy that is applied to the power spectrum density of the signal. | SEN = |
Six demographic features.
| 102. Height | 103. Weight | 104. Gender | 105. Age | 106. BMI | 107. Heart rate |
Features chosen by the feature selection algorithms.
| Feature Selection Algorithms Used | Systolic Blood Pressure | Diastolic Blood Pressure |
|---|---|---|
| RELIEFF | 105. Age, | 105. Age, |
| FSCMRMR | 105. Age, | 103. Weight, |
| CFS | 69. BMI/t3, | 69. BMI/t3, |
Evaluation of the best performing algorithm for systolic blood pressure (SBP) and diastolic blood pressure (DBP).
| Selection Criteria | Performance Criteria | Systolic Blood Pressure | Diastolic Blood Pressure | ||
|---|---|---|---|---|---|
| GPR | Ensemble Trees | GPR | Ensemble Trees | ||
| Features from the literature | MAE | 12.27 | 12.68 | 8.31 | 8.82 |
| All features | MAE | 12.06 | 12.95 | 7.70 | 8.31 |
| ReliefF | MAE |
| 12.57 | 7.87 | 8.93 |
| FSCMRMR | MAE | 13.92 | 15.10 | 8.84 | 9.66 |
| CFS | MAE | 11.91 | 13.06 |
| 8.27 |
Figure 13Optimization of the Gaussian process regression (GPR) model during training.
Evaluation of the outperforming algorithms for estimating SBP and DBP after optimization.
| Selection Criteria | Performance Criteria | Systolic Blood Pressure | Diastolic Blood Pressure | ||
|---|---|---|---|---|---|
| Optimized GPR | Optimized Ensemble Trees | Optimized GPR | Optimized Ensemble Trees | ||
| Features from the literature | MAE | 6.79 | 12.43 | 4.49 | 8.17 |
| All features | MAE | 3.30 | 10.886 | 2.81 | 7.96 |
| ReliefF | MAE |
| 11.32 |
| 5.99 |
| FSCMRMR | MAE | 6.11 | 14.65 | 6.80 | 8.22 |
| CFS | MAE | 12.95 | 16.27 | 7.59 | 7.89 |
Figure 14Comparison of the predicted output vs. actual target for SBP estimation using different GPR: (a–c) Models without optimization, (d–f) models with optimization.
Figure 15Comparison of the predicted output vs. actual target for DBP estimation using different GPR: (a–c) Models without optimization, (d–f) models with optimization.
Comparison with related works in relation to dataset, methodology, and estimation error.
| Author | Method Used | Number of Subjects | Performance Criteria | Systolic Blood Pressure | Diastolic Blood Pressure |
|---|---|---|---|---|---|
| Kachuee et al. [ | SVM | MIMIC II (1000 subjects) | MAE | 12.38 | 6.34 |
| Kim et al. [ | Multiple nonlinear regression (MLP) | 180 recordings, 45 subjects | MAE | 5.67 | - |
| Kim et al. [ | Artificial neural network (ANN) | 180 recordings, 45 subjects | MAE | 4.53 | - |
| Cattivelli et al. [ | Proprietary algorithm | MIMIC database (34 recordings, 25 subjects) | MAE | - | - |
| Zhang et al. [ | Support vector machine (SVM) | 7000 samples from 32 patients | MAE | 11.64 | 7.62 |
| Zhang et al. [ | Neural network (nine input neurons) | 7000 samples from 32 patients | MAE | 11.89 | 8.83 |
| Zadi et al. [ | Autoregressive moving average (ARMA) models | 15 subjects | MAE | - | - |
| Slapničar et al. [ | Deep learning | MIMIC III database (510 subjects) | MAE | 9.43 | 6.88 |
| Su et al. [ | Deep learning | 84 subjects | MAE | - | - |
| This work | Gaussian process regression (GPR) | 222 recordings, 126 subjects | MAE |
|
|
* Deep learning algorithm on a small database.
Comparison of this paper results with the Association for the Advancement of Medical Instrumentation (AAMI) standard.
| MEAN (mmHg) | SD (mmHg) | Subject | ||
|---|---|---|---|---|
| AAMI [ | BP | ≤5 | ≤8 | ≥85 |
| This paper | SBP | 3.02 | 9.29 | 126 |
| DBP | 1.74 | 5.54 | 126 |
Comparison of this paper results with the British Hypertension Society (BHS) standard.
| ≤5 mmHg | ≤10 mmHg | ≤15 mmHg | ||
|---|---|---|---|---|
| BHS [ | Grade A | 60% | 85% | 95% |
| This paper | SBP | 69% | 76% | 92% |