| Literature DB >> 31569798 |
Abstract
In real world scenarios, the task of estimating heart rate (HR) using video plethysmography (VPG) methods is difficult because many factors could contaminate the pulse signal (i.e., a subjects' movement, illumination changes). This article presents the evaluation of a VPG system designed for continuous monitoring of the user's heart rate during typical human-computer interaction scenarios. The impact of human activities while working at the computer (i.e., reading and writing text, playing a game) on the accuracy of HR VPG measurements was examined. Three commonly used signal extraction methods were evaluated: green (G), green-red difference (GRD), blind source separation (ICA). A new method based on an excess green (ExG) image representation was proposed. Three algorithms for estimating pulse rate were used: power spectral density (PSD), autoregressive modeling (AR) and time domain analysis (TIME). In summary, depending on the scenario being studied, different combinations of signal extraction methods and the pulse estimation algorithm ensure optimal heart rate detection results. The best results were obtained for the ICA method: average RMSE = 6.1 bpm (beats per minute). The proposed ExG signal representation outperforms other methods except ICA (RMSE = 11.2 bpm compared to 14.4 bpm for G and 13.0 bmp for GRD). ExG also is the best method in terms of proposed success rate metric (sRate).Entities:
Keywords: assisted living; biomedicine; healthcare; heart rate estimation; human-computer interaction; image processing; video pletysmography
Mesh:
Year: 2019 PMID: 31569798 PMCID: PMC6806289 DOI: 10.3390/s19194205
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Face detection and tracking—algorithm outline.
Figure 2Example image frame with the region of interest (ROI) superimposed.
Figure 3HR estimation algorithm outline.
Figure 4An example of HR time-series plots for algorithm No.1 (PSD) and ExG signal representation: (a) video No.5; (b) video No.9.
Results of the delay estimation for selected algorithms.
| Algorithm | t0 [s] | t1 [s] | t2 = t0 − t1[s] |
|---|---|---|---|
| No.1 (PSD) | 13.4 | 3 | 10.4 |
| No.2 (AR) | 6.6 | 3 | 3.6 |
| No.3 (TIME) | 5.1 | 3 | 2.1 |
Recorded video sequences covered by the study.
| Video No. | Room Settings | Participant’s Details | Camera Parameters |
|---|---|---|---|
| 1 | room 1: artificial ceiling fluorescent light + natural light (dusk, medium lighting) from a one window on the left side + light from the one computer screen | participant 1: | camera-to-face distance ~50 cm, gain = 128, white balance off |
| 2 | room 1: artificial ceiling fluorescent light + natural light (dusk, medium lighting) from a one window on the left side + light from the one computer screen | participant 2: | camera-to-face distance ~50 cm, gain = 128, white balance off |
| 3 | room 2: daylight (cloudy, poor lighting): a one roof window on the left, and a second window in the back on the right + fluorescent lamps in the back (2 m) + ceiling fluorescent lamps + right-side table lamp + light from two computer screens | participant 3: | camera-to-face distance ~50 cm, gain = 128, white balance off |
| 4 | room 2: daylight (cloudy, medium lighting): a one roof window on the left, and a second window in the back on the right + fluorescent lamps in the back (2 m) + ceiling fluorescent lamps + light from two computer screens | participant 3: | camera-to-face distance ~50 cm, gain = 128, white balance on |
| 5 | room 3: daylight (sunny, strong lighting): a one window in the front + light from the one computer screen; | participant 3: | camera-to-face distance ~60 cm (computer screen slightly lower – user has to gaze slightly downwards), gain = 100, white balance on |
| 6 | room 4: nighttime, artificial light only (ceiling lamps, table lamps, led curtain lamps + light from the one computer screen); | participant 3: | camera-to-face distance ~50 cm (computer screen slightly lower – user has to gaze slightly downwards), gain = 128, white balance on |
| 7 | room 3: daylight (cloudy, medium lighting): a one window in the front + light from the one computer screen; | participant 4: | camera-to-face distance ~60 cm (computer screen slightly lower – user has to gaze slightly downwards), gain = 128, white balance on |
| 8 | room 2: daylight (cloudy, poor lighting): a one roof window on the left, and a second window in the back on the right + fluorescent lamps in the back (2 m) + light from two computer screens; | participant 3: | camera-to-face distance ~50 cm, gain = 100, white balance off |
| 9 | room 5: artificial ceiling fluorescent light + natural light (dusk, medium lighting) from a one window on the right side + right side bulb lamp + light from the one computer screen; | participant 5: | camera-to-face distance ~60 cm, gain = 128, white balance on |
Figure 5An example of video frames: (a) video No.5; (b) video No.9.
Duration of the recorded video sequences and selected parts (mm: ss).
| Video No. | Entire Video | Part 1 | Part 2 | Part 4 | Part 6 |
|---|---|---|---|---|---|
| video 1 | 05:33 | 01:00 | 00:25 | 00:32 | 01:01 |
| video 2 | 04:53 | 00:50 | 00:19 | 00:31 | 01:01 |
| video 3 | 05:30 | 00:55 | 00:24 | 00:55 | 01:04 |
| video 4 | 05:19 | 00:48 | 00:18 | 00:58 | 01:02 |
| video 5 | 05:48 | 01:00 | 00:26 | 01:07 | 01:02 |
| video 6 | 05:46 | 01:00 | 00:28 | 01:02 | 01:03 |
| video 7 | 05:56 | 00:57 | 00:32 | 01:07 | 01:01 |
| video 8 | 05:56 | 01:00 | 00:20 | 01:02 | 01:01 |
| video 9 | 05:44 | 01:01 | 00:34 | 01:12 | 00:20 |
The average illumination and standard deviation of accelerations for recorded video sequences and selected parts.
| Video No. | Entire Video | Part 1 | Part 2 | Part 4 | Part 6 | Entire Video | Part 1 | Part 2 | Part 4 | Part 6 |
|---|---|---|---|---|---|---|---|---|---|---|
| Average Illumination [lux] | std of Accelerations [G] | |||||||||
| video 1 | 72 | 78 | 77 | 62 | 69 | 0.134 | 0.021 | 0.010 | 0.011 | 0.006 |
| video 2 | 86 | 93 | 91 | 63 | 76 | 0.123 | 0.007 | 0.006 | 0.007 | 0.007 |
| video 3 | 40 | 44 | 43 | 39 | 35 | 0.112 | 0.011 | 0.010 | 0.009 | 0.008 |
| video 4 | 54 | 60 | 59 | 58 | 46 | 0.109 | 0.010 | 0.008 | 0.008 | 0.007 |
| video 5 | 950 | 1271 | 1048 | 806 | 816 | 0.118 | 0.007 | 0.008 | 0.008 | 0.008 |
| video 6 | 27 | 27 | 30 | 23 | 25 | 0.110 | 0.007 | 0.009 | 0.008 | 0.007 |
| video 7 | 152 | 146 | 140 | 113 | 170 | 0.125 | 0.013 | 0.018 | 0.011 | 0.008 |
| video 8 | 49 | 57 | 56 | 53 | 38 | 0.099 | 0.008 | 0.007 | 0.008 | 0.009 |
| video 9 | 106 | 108 | 107 | 103 | 99 | 0.102 | 0.009 | 0.011 | 0.017 | 0.011 |
Results of HR estimation, algorithm No. 1 (PSD)—comparison of signal extraction methods.
| Video No. | ||||||||
|---|---|---|---|---|---|---|---|---|
| G | GRD | ExG | ICA | G | GRD | ExG | ICA | |
| video 1 | 10.7 | 4.5 | 3.2 | 7.8 | 48% | 76% | 85% | 71% |
| video 2 | 16.5 | 15.2 | 15.7 | 14.7 | 27% | 41% | 39% | 40% |
| video 3 | 16.5 | 13.9 | 12.9 | 2.8 | 47% | 49% | 57% | 83% |
| video 4 | 10.1 | 11.6 | 7.3 | 3.2 | 45% | 55% | 65% | 78% |
| video 5 | 3.9 | 2.2 | 2.5 | 2.8 | 84% | 91% | 91% | 87% |
| video 6 | 14.1 | 13.5 | 11.1 | 7.2 | 43% | 42% | 65% | 62% |
| video 7 | 3.5 | 3.7 | 3.4 | 2.2 | 80% | 84% | 86% | 91% |
| video 8 | 6.8 | 17.9 | 17.2 | 4.0 | 61% | 19% | 25% | 79% |
| video 9 | 22.2 | 36.7 | 35.8 | 34.8 | 11% | 16% | 15% | 12% |
Results of HR estimation, algorithm No. 2 (AR)—comparison of signal extraction methods.
| Video No. | ||||||||
|---|---|---|---|---|---|---|---|---|
| G | GRD | ExG | ICA | G | GRD | ExG | ICA | |
| video 1 | 9.5 | 4.6 | 3.5 | 6.3 | 45% | 62% | 74% | 55% |
| video 2 | 17.0 | 11.1 | 11.6 | 13.0 | 30% | 45% | 44% | 37% |
| video 3 | 15.6 | 15.3 | 12.2 | 5.0 | 48% | 45% | 61% | 68% |
| video 4 | 8.6 | 12.3 | 7.0 | 5.7 | 57% | 57% | 69% | 47% |
| video 5 | 3.9 | 2.6 | 2.6 | 3.0 | 80% | 83% | 82% | 77% |
| video 6 | 19.4 | 17.2 | 15.4 | 11.7 | 23% | 27% | 46% | 35% |
| video 7 | 5.4 | 7.9 | 6.7 | 5.8 | 76% | 73% | 71% | 48% |
| video 8 | 15.6 | 20.6 | 19.3 | 4.8 | 53% | 28% | 34% | 63% |
| video 9 | 23.3 | 35.3 | 35.5 | 30.0 | 10% | 13% | 12% | 9% |
Results of HR estimation, algorithm No. 3 (TIME)—comparison of signal extraction methods.
| Video No. | ||||||||
|---|---|---|---|---|---|---|---|---|
| G | GRD | ExG | ICA | G | GRD | ExG | ICA | |
| video 1 | 16.8 | 4.3 | 2.8 | 7.4 | 49% | 70% | 83% | 51% |
| video 2 | 21.3 | 11.3 | 10.9 | 12.2 | 38% | 47% | 53% | 30% |
| video 3 | 20.7 | 18.4 | 17.7 | 6.3 | 28% | 25% | 36% | 43% |
| video 4 | 14.2 | 13.1 | 10.4 | 8.6 | 35% | 43% | 45% | 21% |
| video 5 | 16.7 | 2.5 | 2.6 | 5.5 | 51% | 88% | 89% | 52% |
| video 6 | 20.0 | 21.4 | 17.8 | 12.2 | 22% | 21% | 22% | 26% |
| video 7 | 9.1 | 9.6 | 10.9 | 7.8 | 50% | 47% | 48% | 18% |
| video 8 | 21.4 | 20.9 | 21.2 | 9.7 | 14% | 18% | 20% | 23% |
| video 9 | 16.2 | 34.0 | 34.5 | 29.1 | 18% | 13% | 11% | 14% |
Figure 6Comparison of signal extraction methods, algorithm No.1 (PSD): (a) box plots for sRate; (b) box plots for RMSE. Blue lines—IQR range, red line—median value.
Figure 7Comparison of signal extraction methods, algorithm No.2 (AR): (a) box plots for sRate; (b) box plots for RMSE. Blue lines—IQR range, red line—median value.
Figure 8Comparison of signal extraction methods, algorithm No.3 (TIME): (a) box plots for sRate; (b) box plots for RMSE. Blue lines—IQR range, red line—median value.
The Wilcoxon rank sum test results (p-values) for comparing different signal extraction methods.
| Comparison | ||||||
|---|---|---|---|---|---|---|
| PSD | AR | TIME | PSD | AR | TIME | |
| G vs GRD | 1.00 | 0.93 | 0.49 | 0.93 | 1.00 | 0.93 |
| G vs ExG | 0.73 | 0.49 | 0.34 | 0.39 | 0.55 | 0.49 |
| G vs ICA | 0.14 | 0.19 |
| 0.16 | 0.86 | 0.86 |
| GRD vs ExG | 0.67 | 0.67 | 0.80 | 0.67 | 0.60 | 0.73 |
| GRD vs ICA | 0.22 | 0.30 | 0.34 | 0.34 | 0.86 | 0.67 |
| ExG vs ICA | 0.30 | 0.39 | 0.39 | 0.55 | 0.60 | 0.34 |
Figure 9Comparison of signal extraction methods, algorithm No.1 (PSD), part 1: (a) box plots for sRate; (b) box plots for RMSE. Blue lines—IQR range, red line—median value.
Figure 10Comparison of signal extraction methods, algorithm No.1 (PSD), part 2: (a) box plots for sRate; (b) box plots for RMSE. Blue lines—IQR range, red line—median value.
Figure 11Comparison of signal extraction methods, algorithm No.1 (PSD), part 4: (a) box plots for sRate; (b) box plots for RMSE. Blue lines—IQR range, red line—median value.
Figure 12Comparison of signal extraction methods, algorithm No.1 (PSD), part 6: (a) box plots for sRate; (b) box plots for RMSE. Blue lines—IQR range, red line—median value.
Figure 13Comparison of signal extraction methods, algorithm No.2 (AR), part 1: (a) box plots for sRate; (b) box plots for RMSE. Blue lines—IQR range, red line—median value.
Figure 14Comparison of signal extraction methods, algorithm No.2 (AR), part 2: (a) box plots for sRate; (b) box plots for RMSE. Blue lines—IQR range, red line—median value.
Figure 15Comparison of signal extraction methods, algorithm No.2 (AR), part 4: (a) box plots for sRate; (b) box plots for RMSE. Blue lines—IQR range, red line—median value.
Figure 16Comparison of signal extraction methods, algorithm No.2 (AR), part 6: (a) box plots for sRate; (b) box plots for RMSE. Blue lines—IQR range, red line—median value.
Figure 17Comparison of signal extraction methods, algorithm No.3 (TIME), part 1: (a) box plots for sRate; (b) box plots for RMSE. Blue lines—IQR range, red line—median value.
Figure 18Comparison of signal extraction methods, algorithm No.3 (TIME), part 2: (a) box plots for sRate; (b) box plots for RMSE. Blue lines—IQR range, red line—median value.
Figure 19Comparison of signal extraction methods, algorithm No.3 (TIME), part 4: (a) box plots for sRate; (b) box plots for RMSE. Blue lines—IQR range, red line—median value.
Figure 20Comparison of signal extraction methods, algorithm No.3 (TIME), part 6: (a) box plots for sRate; (b) box plots for RMSE. Blue lines—IQR range, red line—median value.
The Wilcoxon rank sum test results (p-values) for comparing different signal extraction methods and activities, algorithm No.1 (PSD).
| Comparison | ||||||||
|---|---|---|---|---|---|---|---|---|
| Part1 | Part2 | Part4 | Part6 | Part1 | Part2 | Part4 | Part6 | |
| G vs GRD | 0.80 | 0.49 | 1.00 | 0.93 | 1.00 | 0.44 | 0.60 | 0.80 |
| G vs ExG | 0.93 | 0.39 | 0.30 | 0.60 | 0.93 | 0.25 | 0.16 | 0.22 |
| G vs ICA | 0.44 | 0.09 | 0.34 |
| 0.30 | 0.16 | 0.45 |
|
| GRD vs ExG | 0.67 | 0.86 | 0.44 | 1.00 | 0.80 | 1.00 | 0.30 | 0.86 |
| GRD vs ICA | 0.22 | 0.73 | 0.39 | 0.22 | 0.26 | 0.75 | 0.67 | 0.26 |
| ExG vs ICA | 0.30 | 0.73 | 0.86 | 0.19 | 0.45 | 0.93 | 1.00 | 0.30 |
The Wilcoxon rank sum test results (p-values) for comparing different signal extraction methods and activities, algorithm No.2 (AR).
| Comparison | ||||||||
|---|---|---|---|---|---|---|---|---|
| Part1 | Part2 | Part4 | Part6 | Part1 | Part2 | Part4 | Part6 | |
| G vs GRD | 1.00 | 0.44 | 0.80 | 0.39 | 0.93 | 0.55 | 0.73 | 0.55 |
| G vs ExG | 1.00 | 0.34 | 0.39 | 0.22 | 1.00 | 0.19 | 0.22 | 0.26 |
| G vs ICA | 0.67 | 0.22 | 0.09 |
| 0.30 | 0.30 | 0.26 | 0.14 |
| GRD vs ExG | 0.93 | 0.73 | 0.30 | 0.67 | 0.86 | 0.49 | 0.14 | 0.60 |
| GRD vs ICA | 0.73 | 0.67 | 0.14 | 0.44 | 0.44 | 0.73 | 0.34 | 0.49 |
| ExG vs ICA | 1.00 | 0.80 | 0.26 | 0.73 | 0.34 | 0.73 | 0.60 | 1.00 |
The Wilcoxon rank sum test results (p-values) for comparing different signal extraction methods and activities, algorithm No.3 (TIME).
| Comparison | ||||||||
|---|---|---|---|---|---|---|---|---|
| Part1 | Part2 | Part4 | Part6 | Part1 | Part2 | Part4 | Part6 | |
| G vs GRD | 0.30 | 0.49 | 0.67 | 0.49 | 0.49 | 0.14 | 0.22 | 0.55 |
| G vs ExG | 0.34 | 0.80 | 0.22 | 0.49 | 0.60 | 0.17 | 0.39 | 0.34 |
| G vs ICA |
| 0.08 |
| 0.06 | 0.44 | 0.67 | 0.22 | 0.67 |
| GRD vs ExG | 0.86 | 0.80 | 0.55 | 0.80 | 0.86 | 0.93 | 1.00 | 0.73 |
| GRD vs ICA | 0.49 | 0.44 | 0.22 | 0.60 | 0.22 | 0.49 | 0.80 | 0.93 |
| ExG vs ICA | 0.44 | 0.30 | 0.60 | 0.86 | 0.22 | 0.30 | 0.86 | 0.55 |
The median sRate and RMSE for selected algorithms and signal extraction methods.
| Algorithm | ||||||||
|---|---|---|---|---|---|---|---|---|
| G | GRD | ExG | ICA | G | GRD | ExG | ICA | |
| PSD | 10.7 | 13.5 | 11.1 | 4.0 | 47% | 49% | 65% | 78% |
| AR | 15.6 | 12.3 | 11.6 | 5.8 | 48% | 45% | 61% | 48% |
| TIME | 16.8 | 13.1 | 10.9 | 8.6 | 35% | 43% | 45% | 26% |
| average | 14.4 | 13.0 | 11.2 | 6.1 | 43% | 46% | 57% | 51% |
The Wilcoxon rank sum test results (p-values) for comparing different algorithms.
| Comparison | ||||||||
|---|---|---|---|---|---|---|---|---|
| G | GRD | ExG | ICA | G | GRD | ExG | ICA | |
| PSD vs AR | 0.67 | 0.80 | 0.93 | 0.44 | 0.93 | 0.80 | 0.73 | 0.05 |
| PSD vs TIME |
| 0.73 | 0.67 | 0.11 | 0.26 | 0.44 | 0.26 |
|
| AR vs TIME | 0.16 | 0.80 | 0.86 | 0.26 | 0.22 | 0.49 | 0.55 | 0.06 |
The Pearson’s correlation values between the sRate and the average scene lighting.
| Algorithm | Correlation Value | p-Value | ||||||
|---|---|---|---|---|---|---|---|---|
| G | GRD | ExG | ICA | G | GRD | ExG | ICA | |
| PSD | 0.57 | 0.57 | 0.45 | 0.27 | 0.11 | 0.11 | 0.23 | 0.49 |
| AR | 0.56 | 0.61 | 0.46 | 0.47 | 0.11 | 0.08 | 0.22 | 0.20 |
| TIME | 0.50 |
| 0.62 | 0.51 | 0.17 | 0.03 | 0.08 | 0.16 |
The Pearson’s correlation values between the sRate and the standard deviation of the accelerations.
| Algorithm | Correlation Value | p-Value | ||||||
|---|---|---|---|---|---|---|---|---|
| G | GRD | ExG | ICA | G | GRD | ExG | ICA | |
| PSD | 0.27 |
|
| 0.24 | 0.49 | 0.02 | 0.04 | 0.53 |
| AR | 0.32 |
| 0.66 | 0.22 | 0.41 | 0.04 | 0.06 | 0.56 |
| TIME |
|
|
| 0.53 | 0.00 | 0.03 | 0.01 | 0.14 |