| Literature DB >> 29018352 |
Antonio Cicone1, Hau-Tieng Wu2,3.
Abstract
Despite the population of the noninvasive, economic, comfortable, and easy-to-install photoplethysmography (PPG), it is still lacking a mathematically rigorous and stable algorithm which is able to simultaneously extract from a single-channel PPG signal the instantaneous heart rate (IHR) and the instantaneous respiratory rate (IRR). In this paper, a novel algorithm called deppG is provided to tackle this challenge. deppG is composed of two theoretically solid nonlinear-type time-frequency analyses techniques, the de-shape short time Fourier transform and the synchrosqueezing transform, which allows us to extract the instantaneous physiological information from the PPG signal in a reliable way. To test its performance, in addition to validating the algorithm by a simulated signal and discussing the meaning of "instantaneous," the algorithm is applied to two publicly available batch databases, the Capnobase and the ICASSP 2015 signal processing cup. The former contains PPG signals relative to spontaneous or controlled breathing in static patients, and the latter is made up of PPG signals collected from subjects doing intense physical activities. The accuracies of the estimated IHR and IRR are compared with the ones obtained by other methods, and represent the state-of-the-art in this field of research. The results suggest the potential of deppG to extract instantaneous physiological information from a signal acquired from widely available wearable devices, even when a subject carries out intense physical activities.Entities:
Keywords: de-shape short time Fourier transform; de-shape synchrosqueezing transform; instantaneous heart rate; instantaneous respiratory rate; photoplethysmography
Year: 2017 PMID: 29018352 PMCID: PMC5615790 DOI: 10.3389/fphys.2017.00701
Source DB: PubMed Journal: Front Physiol ISSN: 1664-042X Impact factor: 4.566
Figure 1The flow chart of the proposed algorithm, deppG, to extract the instantaneous heart rate (IHR) and instantaneous respiratory rate (IRR) from the recorded PPG signal. A typical recorded PPG signal lasting for 20 s is shown on the left. The short time Fourier transform (STFT), and hence the spectrogram, of the input PPG signal are then evaluated. The intensity of the spectrogram at a point (t, ξ) in the time-frequency plane indicates how strong the signal oscillates at time t and frequency ξ. The dark curve around 1.6 Hz represents the IHR, while the gray curve around 3.2 Hz (and 4.8 Hz, 6.4 Hz, etc. The frequency axis above 4 Hz is not shown) comes from the non-sinusoidal oscillation of the cardiac activity. Similarly, the dark curve around 0.3 Hz represents the IRR, while the gray curve around 0.6 Hz comes from the non-sinusoidal oscillation of the respiratory activity. With the STFT and the spectrogram of the PPG signal, the nonlinear mask is then designed from the spectrogram and the phase function is determined from the STFT. The intensity of the phase function at a point (t, ξ) in the time-frequency plane indicates the angle of the complex value of the STFT at time t and frequency ξ, which ranges from 0 to 2π. By applying the nonlinear mask and the phase function of the STFT to the spectrogram, the spectrogram is improved and we obtain the de-shaped spectrogram. The darker curve around 1.6 Hz represents the IHR and the lighter curve around 0.3 Hz represents the IRR. The curves corresponding to the IHR and IRR are extracted from the de-shaped spectrogram, which are shown as the red and blue curves respectively superimposed on the de-shaped spectrogram.
Figure 2(A) The photoplethysmogram (PPG) signal of the dataset 0031_8 min in the Capnobase database lasting for 50 s. The signal is shifted by 1 to simulate the trend commonly encountered in real data. (B) The “windowed” PPG signal that is generated by multiplying the PPG signal by a Gaussian window centered at the 25-th second. (C) The power spectrum of the windowed PPG signal shown in (B). The fundamental frequency, 1.2Hz, and its multiples, 1.2 × 2, 1.2 × 3, …, etc, are indicated by the blue arrows. Note that the magnitude varies from peak to peak, which depends on the non-sinusoidal oscillation of the PPG signal. Also note that this power spectrum is the spectrogram at the 25-th second. (D) The 0.1 power of the power spectrum shown in (C). It is clear that the magnitudes of all peaks become more uniform after taking the fractional power. We could thus view it as a periodic function with the “period” exactly the same as the fundamental frequency of the PPG signal. (E) The spectrum of the (D) that contains the fundamental period information, which is the inverse of the fundamental frequency, of the PPG signal at the 25-th second. Specifically, that the peaks indicated by the red arrows located at 1/1.2, 2/1.2, …, etc, are associated with the fundamental period and its multiples. (F) The nonlinear mask determined by inverting the quefrency axis by sending a nonzero quefrency q to 1/q. We could clearly see that the peaks indicated by the red arrows in (E) become the peaks indicated by the green arrows (F). By a direct algebraic calculation, the peaks indicated by the green arrows correspond to the fundamental frequency, 1.2 Hz, and its divisions, 1.2/2, 1.2/3, …, etc.
Figure 3Top panel: a subinterval of the simulated photoplethysmogram (PPG) signal with the instantaneous heart rate (IHR) determined from 0009_8min in the Capnobase database and 20dB signal-to-noise ratio. a.u. means “arbitrary unit.” Middle panel: the spectrogram determined by the short time Fourier transform (STFT) is shown on the left hand side, the de-shaped spectrogram is shown in the middle, and the zoomed in de-shaped spectrogram is shown on the right hand side. The red arrow indicates the band associated with the fundamental frequency of the hemodynamic component, the green arrows indicate those bands associated with the multiples of the hemodynamic component, and the blue arrow indicates the band associated with the fundamental frequency of the respiratory component. Bottom panel: a further zoomed in de-shape spectrogram is shown on the left hand side and the middle, with the ground truth IHR superimposed in the middle, and the error between the estimated IHR and the ground truth IHR is shown on the right hand side. The orange arrows indicate the possible oversmoothing effect of the deppG algorithm. See the main context for the interpretation of this figure.
Summary of root mean square error (RMS) and mean absolute error (MAE) of the respiratory rate (RR) and heart rate (HR) estimation for the Capnobase benchmark database.
| Smart Fusion (Karlen et al., | 3.00 | 4.70 | 0.60 | 1.56 | 3.15 | 2.43 | 3.72 | n/a | n/a | n/a |
| CSD - 120s (Garde et al., | n/a | n/a | 0.27 | 0.95 | 6.20 | n/a | n/a | n/a | n/a | n/a |
| CSD - 60s (Garde et al., | n/a | n/a | n/a | 1.77 | n/a | n/a | n/a | n/a | n/a | n/a |
| PSD - 120s (Garde et al., | n/a | n/a | 1.20 | 3.18 | 11.30 | n/a | n/a | n/a | n/a | n/a |
| Garde et al., | n/a | n/a | 1.10 | 3.50 | 11.00 | n/a | n/a | n/a | n/a | n/a |
| Shelley et al., | n/a | n/a | 0.41 | 1.91 | 7.01 | n/a | n/a | n/a | n/a | n/a |
| Nakajima et al., | n/a | n/a | 0.59 | 7.47 | 10.60 | n/a | n/a | n/a | n/a | n/a |
| Zhang and Ding, | 2.44 | 4.08 | n/a | n/a | n/a | 1.52 | 2.73 | n/a | n/a | n/a |
| BCLA - (Zhu T. et al., | n/a | n/a | n/a | n/a | n/a | 1.97 | 0.40 | n/a | n/a | n/a |
| Dehkordi et al., | n/a | n/a | n/a | 0.39 | n/a | n/a | n/a | n/a | n/a | n/a |
| deppG vs. reference IRR | 1.39 | 1.87 | 0.38 | 0.73 | 1.70 | 0.94 | 1.37 | 0.22 | 0.50 | 0.83 |
| deppG-60s vs. reference ARR | 0.78 | 1.60 | 0.09 | 0.22 | 0.62 | 0.53 | 1.16 | 0.07 | 0.15 | 0.43 |
| Smart Fusion (Karlen et al., | n/a | n/a | 0.37 | 0.48 | 0.77 | n/a | n/a | n/a | n/a | n/a |
| CSD - 120s (Garde et al., | n/a | n/a | 0.34 | 0.76 | 1.45 | n/a | n/a | n/a | n/a | n/a |
| PSD - 120s (Garde et al., | n/a | n/a | 0.21 | 0.58 | 1.17 | n/a | n/a | n/a | n/a | n/a |
| Garde et al., | n/a | n/a | 0.20 | 0.35 | 0.59 | n/a | n/a | n/a | n/a | n/a |
| Wadehn et al., | n/a | n/a | n/a | n/a | n/a | 0.16 | 0.24 | n/a | n/a | n/a |
| deppG vs. reference IHR | 0.93 | 0.57 | 0.50 | 0.72 | 1.20 | 0.61 | 0.35 | 0.38 | 0.52 | 0.84 |
| deppG-60s vs. reference AHR | 0.23 | 0.49 | 0.07 | 0.09 | 0.19 | 0.15 | 0.29 | 0.05 | 0.07 | 0.12 |
Except deppG, the methods proposed so far in the literature do not focus on computing instantaneous rates, but average rates over a time window. n/a: not available. Std: standard deviation. Q.
Sliding window of 32s with 1s shifts.
Sliding window of 120s with 50% overlap. CSD: correntropy spectral density. PSD: power spectral density.
Sliding window of 60s with 50% overlap.
Sliding window of 82s.
No information is provided on the window size.
Sliding window of 32s with an increment of 3s. BCLA: Bayesian Continuous-Valued Label Aggregator.
The values reported are obtained excluding 4 datasets from the statistics due to contamination of their PPG or CO2 signals with artifacts for more than 50% of their duration. However, the authors do not specify which datasets have been removed. No information is provided regarding the window length.
Sliding window of 8s with 6s overlap.
The average absolute error (AAE, the same as the mean absolute error) and average absolute error percentage (AAEP) of the heart rate estimation for the ICASSP 2015 signal processing cup database.
| TROIKA - (Zhang et al., | 2.29 | 2.19 | 2.00 | 2.15 | 2.01 | 2.76 | 1.67 | 1.93 | 1.86 | 4.70 | 1.72 | 2.84 | 2.34 | 0.83 |
| (Schack et al., | 2.40 | 1.21 | 1.20 | 1.22 | 1.34 | 1.44 | 1.16 | 1.04 | 1.18 | 5.33 | 2.18 | 1.52 | 1.77 | 1.20 |
| SPECTRAP - (Sun and Zhang, | 1.18 | 2.42 | 0.86 | 1.38 | 0.92 | 1.37 | 1.53 | 0.64 | 0.60 | 3.65 | 0.92 | 1.25 | 1.39 | 0.86 |
| MICROST - (Zhu S. et al., | 2.93 | 3.06 | 2.03 | 2.29 | 2.64 | 2.58 | 1.97 | 1.77 | 1.87 | 3.81 | 1.91 | 4.07 | 2.58 | 0.77 |
| MISPT - (Murthy et al., | 1.58 | 1.80 | 0.58 | 0.99 | 0.74 | 0.93 | 0.73 | 0.45 | 0.41 | 3.60 | 0.88 | 0.68 | 1.11 | 0.89 |
| Zong and Jafari, | 1.05 | 0.98 | 1.26 | 1.33 | 0.66 | 0.77 | 0.41 | 0.47 | 0.35 | 3.49 | 0.50 | 1.52 | 1.07 | 0.86 |
| Frigo et al., | 2.11 | 1.89 | 1.01 | 1.08 | 0.61 | 1.66 | 0.54 | 0.59 | 0.54 | 4.12 | 1.15 | 2.83 | 1.51 | 1.09 |
| D'souza et al., | 3.93 | 3.30 | 2.81 | 2.07 | 0.90 | 2.50 | 0.83 | 1.08 | 0.75 | 3.68 | 1.65 | 3.60 | 2.26 | 1.21 |
| Zhang et al., | 2.06 | 3.59 | 0.92 | 1.54 | 0.97 | 1.64 | 2.25 | 0.63 | 0.62 | 4.62 | 1.30 | 1.80 | 1.83 | 1.21 |
| Mashhadi et al., | 1.72 | 1.33 | 0.90 | 1.28 | 0.93 | 1.41 | 0.61 | 0.88 | 0.59 | 3.78 | 0.85 | 0.71 | 1.25 | 0.87 |
| TROIKA - Zhang et al. - 25 Hz (Zhang et al., | 3.05 | 3.49 | 1.49 | 2.03 | 1.46 | 2.35 | 1.76 | 1.42 | 1.28 | 5.73 | 1.79 | 3.02 | 2.41 | 1.28 |
| Khan et al. - double channel - 25 Hz (Khan et al., | 1.64 | 0.81 | 0.57 | 1.44 | 0.77 | 1.06 | 0.63 | 0.47 | 0.52 | 2.94 | 1.05 | 0.91 | 1.07 | 0.69 |
| Khan et al. - single channel - 25 Hz (Khan et al., | 2.55 | 3.45 | 0.73 | 1.19 | 0.51 | 1.09 | 0.52 | 0.43 | 0.36 | 3.33 | 0.89 | 0.98 | 1.34 | 1.12 |
| deppG vs. reference IHR | 3.00 | 3.06 | 2.98 | 2.26 | 2.47 | 2.71 | 2.49 | 3.55 | 2.92 | 5.70 | 1.80 | 2.76 | 2.97 | 0.97 |
| deppG-IF vs. reference AHR | 1.30 | 0.52 | 0.47 | 1.41 | 0.47 | 0.75 | 0.68 | 0.51 | 0.30 | 3.72 | 0.96 | 0.60 | 0.97 | 0.93 |
| TROIKA - (Zhang et al., | 1.90 | 1.87 | 1.66 | 1.82 | 1.49 | 2.25 | 1.26 | 1.62 | 1.59 | 2.93 | 1.15 | 1.99 | 1.79 | 0.47 |
| SPECTRAP - (Sun and Zhang, | 1.04 | 2.33 | 0.66 | 1.31 | 0.74 | 1.14 | 1.36 | 0.55 | 0.52 | 2.27 | 0.65 | 1.02 | 1.13 | 0.61 |
| MICROST - (Zhu S. et al., | 2.55 | 2.94 | 1.60 | 1.89 | 1.80 | 2.03 | 1.49 | 1.50 | 1.64 | 2.39 | 1.31 | 2.76 | 1.99 | 0.54 |
| Frigo et al., | 1.71 | 1.56 | 0.88 | 1.00 | 0.46 | 1.37 | 0.42 | 0.52 | 0.48 | 2.75 | 0.74 | 1.86 | 1.15 | 0.72 |
| D'souza et al., | 3.02 | 3.02 | 2.20 | 1.96 | 0.67 | 2.09 | 0.63 | 0.93 | 0.62 | 2.29 | 1.06 | 2.60 | 1.76 | 0.93 |
| Zhang et al., | 1.66 | 3.50 | 0.73 | 1.41 | 0.72 | 1.24 | 1.55 | 0.53 | 0.51 | 2.83 | 0.84 | 1.25 | 1.40 | 0.92 |
| Mashhadi et al., | 1.5 | 1.3 | 0.75 | 1.2 | 0.69 | 1.2 | 0.5 | 0.8 | 0.5 | 2.4 | 0.6 | 0.5 | 1.00 | 0.56 |
| deppG | 2.85 | 3.01 | 2.50 | 1.93 | 1.89 | 2.33 | 2.01 | 3.22 | 2.60 | 3.84 | 1.23 | 2.17 | 2.47 | 0.70 |
| deppG-IF | 1.18 | 0.50 | 0.40 | 1.20 | 0.36 | 0.58 | 0.51 | 0.45 | 0.26 | 2.43 | 0.66 | 0.44 | 0.75 | 0.61 |
The unit for the heart rate (HR) is beats per minute. We compare results and statistics of previously developed method with the ones of deppG algorithm with and without a smoothing window of 8 s shifted of 2 s. Sbj: subject. std: standard deviation.
8s windows with 6s overlap
8s windows. The overlap is unknown
The authors compare their results with the ones of Zhang (.
This value has been recomputed using Matlab mean function and does not match the value reported in the original paper.
This value has been recomputed using Matlab std function and does not match the value reported in the original paper.
The authors did not mention in their work if they used windows or not
8s windows. The overlap is unknown. The authors do not specify if they are using truncated or untruncated datasets and if they are sampling at 25 or 125 Hz. In their table they compare their results with the ones obtained in the literature for both untruncated and 125 Hz sampled datasets and truncated and 25 Hz sampled ones. See Ref. Table II in Mashhadi et al. (.
8s windows with 6s overlap. Sampling rate at 25 Hz. These values are provided in Khan et al. (2015)
λ = 0.023
λ = 0.021
Figure 4Top row: the photoplethysmography signal of subject 9 in the training dataset of ICASSP 2015 signal processing cup. Second row: the spectrogram is shown on the left panel, the de-shaped spectrogram is shown in the middle panel, and the de-shaped spectrogram with the acceleration offset is shown on the right panel. The dominant curve indicated by the blue arrow in the de-shaped spectrogram with the acceleration offset is the instantaneous heart rate of the subject. On the other hand, the lighter curve indicated by the red arrow in the de-shaped spectrogram is directly related to the body swaying pattern. The heartbeat component displayed in the de-shaped spectrogram follow clearly the running pattern.