| Literature DB >> 30519506 |
Miha Finžgar1, Primož Podržaj1.
Abstract
BACKGROUND: Remote photoplethysmography (rPPG) is a promising optical method for non-contact assessment of pulse rate (PR) from video recordings. In order to implement the method in real-time applications, it is necessary for the rPPG algorithms to be capable of eliminating as many distortions from the pulse signal as possible.Entities:
Keywords: Biomedical monitoring; Heart rate; Image processing; Pulse rate; Remote photoplethymosgraphy; Signal processing
Year: 2018 PMID: 30519506 PMCID: PMC6267003 DOI: 10.7717/peerj.5859
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Performed tasks eliciting various emotions during the video recordings of the subjects in the MMSE-HR database.
| Task no. | Task identification | Task description | Target emotion |
|---|---|---|---|
| 1 | T1 | Listen to a funny joke (interview) | Happiness, amusement |
| 2 | T8 | Improvise a silly song | Embarrassment |
| 3 | T9 | Follow-up task similar to T8 | Embarrassment |
| 4 | T10 | Experience physical threat in dart game | Fear, nervousness |
| 5 | T11 | Submerge hand into ice water | Physical pain |
| 6 | T14 | Experience smelly odour | Disgust |
Note:
Adopted from Zhang et al. (2016).
Figure 1Framework of the proposed SB-CWT method.
Figure 2Schematic representation of the proposed SB-CWT method.
The illustration shows the entire procedure of extracting the pulse rate signal from facial recordings using the proposed SB-CWT. Notations above the arrows refer to the steps of the proposed algorithm. Notation s2 refers to the face detection, s3 to the ROI size adjustment, s4 to the spatial averaging of the pixel values inside the ROI in each frame, s5–7 to the signal inversion, windowing (l = 256) and temporal normalization, s8 to the continuous wavelet transform, s9–10 to the projection of wavelet coefficients on POS plane and alpha tuning, s11 to the application of weighting function to the wavelet coefficients, s12–13 to the inverse continuous wavelet transform with scale dependent thresholding and overlap-adding. The numbering of all the steps closely follows the numbering in the framework of the proposed algorithm depicted in Fig. 1. Notes: a.u. denotes the arbitrary units, R, G and B denote red, green and blue color channels, respectively. Colors of the signals in amplitude vs. time plots correspond to the RGB color channels. The output plot of s4 shows zero-mean raw RGB signals.
Comparison of the algorithmic steps of SB and the proposed SB-CWT.
| SB ( | SB-CWT | |
|---|---|---|
| Signal decomposition | FFT | CWT using generalized Morse wavelet |
| rPPG signal extraction | POS ( | POS ( |
| Weighting function | Ratio between the pulsatile amplitude and intensity variation amplitude | Scale-dependent energy distribution |
| Signal transformation back into the time domain | IFFT | ICWT |
Note:
CWT, continuous wavelet transform; FFT, fast Fourier transform; ICWT, inverse continuous wavelet transform; IFFT, inverse fast Fourier transform; POS, Plane-Orthogonal-to-Skin; rPPG, remote photoplethysmography.
Figure 3SNR comparison of the SB and the proposed SB-CWT method for four different window lengths (l).
(A) Results of the original proposed SB-CWT and state-of-the-art SB methods and (B) Results of the modified SB-CWT and SB methods (without the weighting of the sub-band signals). The red lines in each box indicate the median SNR values, bottom and box borders indicate the 25th and 75th percentiles, the whiskers extend to the largest/smallest values not considered as outliers and the red plus signs mark the outliers.
Figure 4Scatter plots and regression line plots comparing the estimated pulse rates obtained by the proposed SB-CWT and state-of-the-art SB algorithms (PRrPPG) with the pulse rates obtained from the reference signal (PRref).
Pulse rates estimated from the rPPG signals (PRrPPG) are plotted against the pulse rates estimated from the reference pulse signals (PRref) for different window lengths and implementations of the algorithms: (A) Results of the original SB-CWT and SB methods for l = 32, (B) Results of the original SB-CWT and SB methods for l = 64, (C) Results of the original SB-CWT and SB methods for l = 128, (D) Results of the original SB-CWT and SB methods for l = 256, (E) Results of the modified SB-CWT and SB methods (without the weighting of the sub-band signals) for l = 32, (F) Results of the modified SB-CWT and SB methods (without the weighting of the sub-band signals) for l = 64, (G) Results of the modified SB-CWT and SB methods (without the weighting of the sub-band signals) for l = 128, (H) Results of the modified SB-CWT and SB methods (without the weighting of the sub-band signals) for l = 256. Each subplot shows the equations of the regression lines and coefficients of determination (r2).
Figure 5The Bland–Altman plots showing agreements between the estimated pulse rates obtained by the proposed SB-CWT and state-of-the-art SB algorithms (PRrPPG) and the pulse rates obtained from the reference signals (PRref).
Figure shows the relation between the pulse rates estimated from the rPPG signals (PRrPPG) and the reference signals (PRref) for different window lengths and implementations of the algorithms: (A) Results of the original SB-CWT and SB methods for l = 32, (B) Results of the original SB-CWT and SB methods for l = 64, (C) Results of the original SB-CWT and SB methods for l = 128, (D) Results of the original SB-CWT and SB methods for l = 256, (E) Results of the modified SB-CWT and SB methods (without the weighting of the sub-band signals) for l = 32, (F) Results of the modified SB-CWT and SB methods (without the weighting of the sub-band signals) for l = 64, (G) Results of the modified SB-CWT and SB methods (without the weighting of the sub-band signals) for l = 128, (H) Results of the modified SB-CWT and SB methods (without the weighting of the sub-band signals) for l = 256. The solid lines denote the mean differences. Between the upper and lower limits of the agreement (shown as dashed lines) lie 95% of all the differences.
Evaluation of the performance of the state-of-the-art SB and the proposed SB-CWT algorithms in terms of MAE, MPE and RMS for four different window lengths l.
| MAE (BPM) | MPE (%) | RMSE (BPM) | ||||
|---|---|---|---|---|---|---|
| SB ( | SB-CWT | SB ( | SB-CWT | SB ( | SB-CWT | |
| 22.2 | 5.7 | −28.7 | −7.1 | 222.9 | 54.0 | |
| 20.7 | 4.1 | −26.3 | −4.6 | 208.0 | 35.1 | |
| 19.6 | 3.0 | −24.6 | −3.2 | 196.2 | 23.9 | |
| 18.9 | 2.4 | −23.6 | −2.5 | 189.0 | 18.3 | |
Note:
MAE, mean absolute error; MPE, mean percentage error; RMSE, root mean square error.
Computational times of the state-of-the-art SB and the proposed SB-CWT algorithms for the facial recording of subject F005 performing task 11 (total of 1,552 frames or 62.08 s) for all window lengths.
| SB ( | 11.25 ± 0.16 | 11.43 ± 0.30 | 11.44 ± 0.17 | 11.32 ± 0.09 |
| SB-CWT | 16.51 ± 0.12 | 17.00 ± 0.19 | 17.48 ± 0.19 | 17.41 ± 0.28 |
Note:
Values are expressed as mean ± standard deviation and are calculated based on the computational times of five consecutive executions of the code.
List of publicly available datasets suitable for rPPG studies.
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
Notes:
BP, blood pressure; BVP, blood volume pulse; ECG, electrocardiography; EEG, electroencephalography; EMG, electromyography; EOG, electrooculography; F, female; GSR, galvanic skin response; IR, infrared; M, male; NIR, near-infrared; PPG, photoplethysmography; resp. rate, respiratory rate; RGB, red, green, blue color space; skin temp., skin temperature; SpO2, blood oxygen saturation; st. dev., standard deviation.
Videos are available for only 22 subjects.