| Literature DB >> 35547575 |
Qunfeng Tang1,2, Zhencheng Chen1, Yanke Guo1, Yongbo Liang1, Rabab Ward2, Carlo Menon3, Mohamed Elgendi2,3.
Abstract
Electrocardiography and photoplethysmography are non-invasive techniques that measure signals from the cardiovascular system. While the cycles of the two measurements are highly correlated, the correlation between the waveforms has rarely been studied. Measuring the photoplethysmogram (PPG) is much easier and more convenient than the electrocardiogram (ECG). Recent research has shown that PPG can be used to reconstruct the ECG, indicating that practitioners can gain a deep understanding of the patients' cardiovascular health using two physiological signals (PPG and ECG) while measuring only PPG. This study proposes a subject-based deep learning model that reconstructs an ECG using a PPG and is based on the bidirectional long short-term memory model. Because the ECG waveform may vary from subject to subject, this model is subject-specific. The model was tested using 100 records from the MIMIC III database. Of these records, 50 had a circulatory disease. The results show that a long ECG signal could be effectively reconstructed from PPG, which is, to our knowledge, the first attempt in this field. A length of 228 s of ECG was constructed by the model, which was trained and validated using 60 s of PPG and ECG signals. To segment the data, a different approach that segments the data into short time segments of equal length (and that do not rely on beats and beat detection) was investigated. Segmenting the PPG and ECG time series data into equal segments of 1-min width gave the optimal results. This resulted in a high Pearson's correlation coefficient between the reconstructed 228 s of ECG and referenced ECG of 0.818, while the root mean square error was only 0.083 mV, and the dynamic time warping distance was 2.12 mV per second on average.Entities:
Keywords: cardiology; data science; digital health; electrocadiogram; intensive and critical care; vital sign analysis
Year: 2022 PMID: 35547575 PMCID: PMC9082149 DOI: 10.3389/fphys.2022.859763
Source DB: PubMed Journal: Front Physiol ISSN: 1664-042X Impact factor: 4.755
FIGURE 1Systole and diastole in ECG and PPG. The “O″” stands for the onset in PPG.
FIGURE 2Flowchart of constructing an electrocardiogram (ECG) signal from a photoplethysmogram (PPG) signal. The bidirectional long short-term memory (BiLSTM) model is trained and validated for 1 minute to generate 3.8 min ECG signal.
FIGURE 3Extracts of experimental results in patients with and without circulatory disease. The red and black curves are the reconstruction ECG and reference ECG, respectively. The abbreviations r, rmse, and d are the Pearson’s correlation coefficient, root mean square error, and dynamic time warping distance, respectively. (I), (II), (III), and (IV) are the results of a subject with a circulatory disease in segments of one, two, three, and 4 seconds, respectively. (V), (VI), (VII), and (VIII) are the results of a subject without any circulatory disease in segments of one, two, three, and 4 seconds, respectively. The PPG is the corresponding segment used to generate the ECG signals.
FIGURE 4Comparison of reconstruction ECG and reference ECG with and without alignment. The abbreviations r , rmse , and d are calculated after aligning the reconstruction ECG and reference ECG, respectively.
FIGURE 5The optimal warping path of the reconstruction and reference ECGs using DTW. (I) shows an example of a 1-s segment. (II) shows the result of the whole 228-s test set for one subject. d is the DTW distance of the 228-s reconstruction and reference ECGs. And is d divided by the signal time length of 228 s.
The results of the test in different lengths. The abbreviations r, rmse, and d refer to the Pearson’s correlation coefficient (r), root mean square error rmse, and dynamic time warping distance between the reconstruction ECG and reference ECG, respectively. The r , rmse and d represent the values between the reconstruction ECG and reference ECG after stitching all the segments together. The abbreviations r , rmse , r , and rmse is calculated after aligning the reconstruction ECG and reference ECG in the cross-correlation. Finally, and represent d and d divided by the signal time length, respectively.
| Segment Length (seconds) | Dataset | Number of Segments |
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|
|
| Train | 48 | 0.929 ± 0.095 | 0.048 ± 0.034 | 0.926 ± 0.047 | 0.052 ± 0.027 | 2.087 ± 1.263 | 95.569 ± 44.683 |
| (0.963 ± 0.039) | (0.038 ± 0.024) | (0.928 ± 0.046) | (0.052 ± 0.026) | (2.087 ± 1.263) | (1.991 ± 0.931) | |||
| Validation | 12 | 0.840 ± 0.175 | 0.071 ± 0.047 | 0.838 ± 0.108 | 0.077 ± 0.037 | 2.319 ± 1.718 | 26.177 ± 13.084 | |
| (0.945 ± 0.065) | (0.045 ± 0.031) | (0.893 ± 0.066) | (0.063 ± 0.032) | (2.319 ± 1.718) | (2.181 ± 1.090) | |||
| Test | 228 | 0.782 ± 0.222 | 0.084 ± 0.061 | 0.774 ± 0.113 | 0.094 ± 0.046 | 2.432 ± 1.809 | 512.876 ± 253.379 | |
|
|
| (0.805 ± 0.095) | (0.088 ± 0.045) | (2.432 ± 1.809) | (2.249 ± 1.111) | |||
|
| Train | 24 | 0.930 ± 0.085 | 0.048 ± 0.031 | 0.929 ± 0.054 | 0.051 ± 0.026 | 4.109 ± 2.050 | 95.267 ± 42.718 |
| (0.956 ± 0.037) | (0.041 ± 0.022) | (0.931 ± 0.052) | (0.050 ± 0.026) | (2.054 ± 1.025) | (1.985 ± 0.890) | |||
| Validation | 6 | 0.809 ± 0.151 | 0.079 ± 0.046 | 0.806 ± 0.109 | 0.083 ± 0.040 | 4.559 ± 2.313 | 26.316 ± 11.799 | |
| (0.932 ± 0.064) | (0.050 ± 0.029) | (0.874 ± 0.077) | (0.068 ± 0.033) | (2.280 ± 1.157) | (2.193 ± 0.983) | |||
| Test | 114 | 0.791 ± 0.193 | 0.083 ± 0.050 | 0.788 ± 0.096 | 0.089 ± 0.039 | 4.643 ± 2.497 | 502.629 ± 217.656 | |
| (0.924 ± 0.078) | (0.052 ± 0.033) | (0.817 ± 0.071) | (0.084 ± 0.037) | (2.322 ± 1.249) | (2.205 ± 0.955) | |||
|
| Train | 16 | 0.920 ± 0.101 | 0.050 ± 0.034 | 0.919 ± 0.077 | 0.053 ± 0.030 | 6.323 ± 3.502 | 98.667 ± 51.403 |
| (0.947 ± 0.067) | (0.043 ± 0.026) | (0.920 ± 0.077) | (0.053 ± 0.030) | (2.108 ± 1.167) | (2.056 ± 1.071) | |||
| Validation | 4 | 0.826 ± 0.154 | 0.079 ± 0.044 | 0.826 ± 0.122 | 0.079 ± 0.040 | 4.559 ± 2.313 | 26.316 ± 11.799 | |
| (0.917 ± 0.082) | (0.054 ± 0.031) | (0.891 ± 0.091) | (0.063 ± 0.034) | (2.298 ± 1.220) | (2.243 ± 1.154) | |||
| Test | 76 | 0.783 ± 0.199 | 0.084 ± 0.049 | 0.788 ± 0.121 | 0.089 ± 0.038 | 6.998 ± 3.744 | 510.932 ± 243.621 | |
| (0.911 ± 0.095) | (0.055 ± 0.034) | (0.813 ± 0.088) | (0.084 ± 0.036) | (2.333 ± 1.248) | (2.241 ± 1.069) | |||
|
| Train | 12 | 0.926 ± 0.082 | 0.049 ± 0.030 | 0.925 ± 0.054 | 0.052 ± 0.025 | 7.970 ± 3.926 | 93.664 ± 42.493 |
| (0.950 ± 0.043) | (0.043 ± 0.023) | (0.926 ± 0.052) | (0.051 ± 0.025) | (1.993 ± 0.982) | (1.951 ± 0.885) | |||
| Validation | 3 | 0.835 ± 0.140 | 0.075 ± 0.042 | 0.834 ± 0.119 | 0.076 ± 0.039 | 8.523 ± 4.018 | 25.015 ± 11.053 | |
| (0.920 ± 0.064) | (0.053 ± 0.030) | (0.904 ± 0.065) | (0.059 ± 0.031) | (2.131 ± 1.005) | (2.085 ± 0.921) | |||
| Test | 57 | 0.792 ± 0.186 | 0.083 ± 0.048 | 0.790 ± 0.103 | 0.088 ± 0.037 | 8.769 ± 4.330 | 483.404 ± 207.199 | |
| (0.910 ± 0.086) | (0.055 ± 0.032) |
|
| (2.192 ± 1.082) |
| |||
Comparison of this paper and other papers in the subject-specific model. The ‘NR’ stands for not reported.
| Segmentation Method | Data Used | The Training Segment Length per Subject (minutes) | Performance | ||||||
| Test to Training Ratio: 0.25 | Test to Training Ratio: 4.75 | ||||||||
|
|
|
|
|
|
| ||||
| This paper (subject-specific) | Seconds | MIMIC III | 0.8 | 0.904 | 0.059 | 2.085 | 0.818 | 0.083 | 2.120 |
| Beat-based model | Beat | TBME-RR | 6.4 | 0.984 | NR | NR | NR | NR | NR |
| MIMIC III | 4 | 0.940 | NR | NR | NR | NR | NR | ||
| Self-collected: 2 subjects | 24 and 33.6 | 0.904 | NR | NR | NR | NR | NR | ||
| XDJDL model | Beat | MIMIC III | 12 | 0.88 | NR | NR | NR | NR | NR |