| Literature DB >> 35237576 |
Andrea Rozo1,2, Jonathan Moeyersons1, John Morales1, Roberto Garcia van der Westen3, Lien Lijnen4, Christophe Smeets5, Sjors Jantzen6, Valerie Monpellier7, David Ruttens4,5, Chris Van Hoof8,9,10, Sabine Van Huffel1, Willemijn Groenendaal3, Carolina Varon1,2.
Abstract
Changes in respiratory rate have been found to be one of the early signs of health deterioration in patients. In remote environments where diagnostic tools and medical attention are scarce, such as deep space exploration, the monitoring of the respiratory signal becomes crucial to timely detect life-threatening conditions. Nowadays, this signal can be measured using wearable technology; however, the use of such technology is often hampered by the low quality of the recordings, which leads more often to wrong diagnosis and conclusions. Therefore, to apply these data in diagnosis analysis, it is important to determine which parts of the signal are of sufficient quality. In this context, this study aims to evaluate the performance of a signal quality assessment framework, where two machine learning algorithms (support vector machine-SVM, and convolutional neural network-CNN) were used. The models were pre-trained using data of patients suffering from chronic obstructive pulmonary disease. The generalization capability of the models was evaluated by testing them on data from a different patient population, presenting normal and pathological breathing. The new patients underwent bariatric surgery and performed a controlled breathing protocol, displaying six different breathing patterns. Data augmentation (DA) and transfer learning (TL) were used to increase the size of the training set and to optimize the models for the new dataset. The effect of the different breathing patterns on the performance of the classifiers was also studied. The SVM did not improve when using DA, however, when using TL, the performance improved significantly (p < 0.05) compared to DA. The opposite effect was observed for CNN, where the biggest improvement was obtained using DA, while TL did not show a significant change. The models presented a low performance for shallow, slow and fast breathing patterns. These results suggest that it is possible to classify respiratory signals obtained with wearable technologies using pre-trained machine learning models. This will allow focusing on the relevant data and avoid misleading conclusions because of the noise, when designing bio-monitoring systems.Entities:
Keywords: data augmentation; machine learning; respiratory monitoring; signal quality; transfer learning
Year: 2022 PMID: 35237576 PMCID: PMC8884147 DOI: 10.3389/fbioe.2022.806761
Source DB: PubMed Journal: Front Bioeng Biotechnol ISSN: 2296-4185
FIGURE 1Location of the electrodes of the wearable device (A) Electrode placement for the COPD dataset. The electrodes were placed symmetrically from the midsternal line and only the right side is represented (B) Electrode placement for the bariatric surgery dataset.
FIGURE 2Controlled breathing protocol followed by a BS patient. Each of the breathing types are specified as: spontaneous (Sp), chest (Ch), shallow (Sh), abdominal (Ab), slow (Sl) and fast (Fa) (a.u.) stands for arbitrary units.
FIGURE 3Example of different labels for quality annotation (A) Label 1—Excellent signal quality (B) Label 2—Good signal quality (C) Label 3—Average signal quality (D) Label 4—Bad signal quality (E) Label 5—Bad reference quality. In all plots, black line corresponds to the bio-impedance signal and the gray line to the reference signal (a.u.) stands for arbitrary units.
FIGURE 4Example of the data augmentation methods. Each row presents one method (A) Original signal (B) Mirrored signal with respect to y axis (C) mirrored signal with respect to x axis. (D) Modulating signal used for the second method (E) normalized modulated signal (F) Resulting segment after removing the 10% of the points (shaded areas at each end of the signal) (G) resampled signal with slower breathing rate (H) Second original signal to be concatenated to the first one (I) resampled signal with faster breathing rate. In (B) (C) (E) and (G), the gray line shows the original signal. In (I) the dashed gray line corresponds to the point in which the signals are concatenated; the left side corresponds to the signal in (A), the right side to the signal in (H) (a.u.) stands for arbitrary units.
Overview of the datasets, indicating the number of segments in each class. The suffix corresponds to the type of breathing imposed during the respiratory protocol. On the left, the total number of segments before data augmentation (DA). On the right, the segments after DA.
| Group | Before DA | After DA | ||||
|---|---|---|---|---|---|---|
| Clean | Noisy | Total | Clean | Noisy | Total | |
| COPD | 1,118 | 353 | 1,471 | 2072 | 2081 | 4,153 |
| BS-Sp | 202 | 240 | 442 | 808 | 807 | 1,615 |
| BS-Ch | 181 | 317 | 498 | 724 | 722 | 1,446 |
| BS-Sh | 181 | 305 | 486 | 724 | 722 | 1,446 |
| BS-Ab | 208 | 256 | 464 | 832 | 832 | 1,664 |
| BS-Sl | 83 | 216 | 299 | 332 | 330 | 662 |
| BS-Fa | 26 | 83 | 109 | 104 | 104 | 208 |
Performance of the machine learning models for each of the sub-groups of the BS dataset. The results are presented as median AUC (25th percentile—75th percentile) (%). On the left side, the results for the SVM. On the right side, the results for the CNN. The best results (i.e., higher AUC) for each model for each breathing type are in bold.
| — | SVM | CNN | ||||
|---|---|---|---|---|---|---|
| Original | DA | TL + DA | Original | DA | TL + DA | |
| Sp | 91.03 | 66.92 |
| 93.43 |
| 97.92**,* |
| [88.29–93.10] | [63.35–70.41] | [91.03–94.85] | [92.49–93.81] | [96.69–98.81] | [97.24–98.21] | |
| Ch | 84.80 | 70.84 |
| 90.56 |
| 96.10*** |
| [83.24–91.88] | [67.31–73.18] | [86.51–91.40] | [89.54–93.08] | [95.94–96.70] | [95.68–96.29] | |
| Sh | 79.68 | 52.18 |
| 83.86 | 84.84 |
|
| [73.89–83.04] | [49.47–59.64] | [79.34–88.06] | [80.06–89.73] | [82.88–89.45] | [87.89–92.82] | |
| Ab | 81.70 | 47.90 |
| 91.74 | 94.74** |
|
| [77.78–86.18] | [43.55–51.21] | [83.45–87.77] | [91.07–93.92] | [93.01–96.23] | [93.5–97.25] | |
| Sl | 85.65 | 71.73 |
| 78.12 |
| 89.26** |
| [79.41–86.46] | [66.52–74.34] | [79.20–83.92] | [72.89–86.83] | [87.77–93.51] | [87.23–93.05] | |
| Fa | 69.09 | 61.25 |
| 73.02 |
| 83.41* |
| [59.64–75.69] | [45.86–70.56] | [65.71–76.64] | [71.82–77.78] | [83.00–87.50] | [74.74–92.85] | |
Breathing patterns: Sp, spontaneous; Ch, chest; Sh, shallow; Ab, abdominal; Sl, slow; Fa, fast.
Significant results compared to the Original model, *p < 0.05, **p < 0.01, ***p ≪ 0.01.
Significant results compared to the DA, model, + p < 0.05, ++ p ≪ 0.01.
FIGURE 5ROC of the SVM models for each of the sub-groups of the BS dataset (A) Spontaneous breathing (Sp) (B) Chest breathing (Ch) (C) Shallow breathing (Sh) (D) Abdominal breathing (Ab) (E) Slow breathing (Sl) (F) Fast breathing (Fa). The curves present the median ROC for each model, in black the original, in dark gray the SVM-DA and in light gray the SVM-TL + DA. The dotted line corresponds to the random guess. It is observed that in all the breathing types the DA model presents a worse behavior than the other models, and that the TL + DA model shows a similar behavior to the Original one.
FIGURE 6ROC of the CNN models for each of the sub-groups of the BS dataset (A) Spontaneous breathing (Sp) (B) Chest breathing (Ch) (C) Shallow breathing (Sh) (D) Abdominal breathing (Ab) (E) Slow breathing (Sl) (F) Fast breathing (Fa). The curves present the median ROC for each model, in black the original, in dark gray the CNN-DA and in light gray the CNN-TL + DA. The dotted line corresponds to the random guess. It is observed that the models DA and TL + DA exhibit a better behavior than the Original one, while behaving similar to each other.
Performance of the machine learning models for each of the sub-groups of the BS dataset. The results are presented as median accuracy (25th percentile—75th percentile) (%). On the left side, the results for the SVM. On the middle, the results for the CNN. On the right side, the results using the heuristic method in Charlton et al. (2021). The best results of each model that performed better than the heuristic approach for each breathing type are in bold.
| — | SVM | CNN | Heuristic | ||||
|---|---|---|---|---|---|---|---|
| Original | DA | TL + DA | Original | DA | TL + DA | ||
| Sp | 83.13 | 65.63 |
| 87.41* | 81.21 |
| 84.03 |
| [81.82–84.91] | [58.57–70.59] | [85.64–89.53] | [86.33–88.1] | [79.43–84.35] | [91.61–93.20] | [82.35–86.32] | |
| Ch | 82.10 | 66.25 |
| 85.72* | 73.20 |
| 80.97 |
| [80.60–84.55] | [64.08–68.37] | [81.87–85.20] | [82.43–87.41] | [71.59–74.88] | [88.32–89.68] | [79.05–83.58] | |
| Sh |
| 70.31 | 77.71 | 74.66 | 65.01 |
| 79.26 |
| [69.88–84.21] | [59.04–77.68] | [71.78–83.26] | [71.33–79.47] | [61.00–66.43] | [80.04–85.00] | [71.08–87.50] | |
| Ab | 72.21 | 61.13 | 75.57 | 85.61* | 70.62 |
| 76.51 |
| [69.49–76.79] | [57.63–66.27] | [72.00–76.76] | [83.46–87.23] | [69.12–73.01] | [85.66–89.62] | [68.18–79.52] | |
| Sl |
| 72.76 | 69.93 | 76.79 | 63.02 |
| 73.82 |
| [77.33–85.14] | [67.39–82.43] | [67.26–72.48] | [73.03–80.21] | [60.98–66.50] | [79.07–86.19] | [72.55–78.57] | |
| Fa |
| 69.62 | 72.35 |
| 52.42 | 75.39 | 69.62 |
| [61.90–87.50] | [61.90–83.33] | [68.35–76.92] | [71.43–89.29] | [42.11–61.70] | [63.89–83.64] | [61.90–83.33] | |
Breathing patterns: Sp, spontaneous; Ch, chest; Sh, shallow; Ab, abdominal; Sl, slow; Fa, fast.
Significant results compared to the Heuristic method, *p < 0.05.
Performance of the machine learning models for each of the sub-groups of the BS dataset. The results are presented as median sensitivity (25th percentile—75th percentile) (%). On the left side, the results for the SVM. On the middle, the results for the CNN. On the right side, the results using the heuristic method in Charlton et al. (2021). The best results of each model that performed better than the heuristic approach for each breathing type are in bold.
| — | SVM | CNN | Heuristic | ||||
|---|---|---|---|---|---|---|---|
| Original | DA | TL + DA | Original | DA | TL + DA | ||
| Sp | 63.92 | 13.42 | 83.48 | 76.32 | 61.26 |
| 85.90 |
| [61.11–68.75] | [8.57–19.44] | [80.43–89.42] | [71.43–76.81] | [57.65–67.86] | [89.13–93.37] | [81.58–91.30] | |
| Ch | 55.72 | 9.29 |
| 68.07 | 48.12 |
| 64.17 |
| [46.67–59.52] | [6.82–11.11] | [80.00–88.89] | [63.64–74.14] | [40.00–49.60] | [87.22–90.69] | [57.78–70.37] | |
| Sh | 35.61 | 2.17 |
| 46.49 | 34.45 |
| 52.78 |
| [33.33–47.22] | [0.00–5.56] | [74.22–87.14] | [41.94–48.48] | [31.05–35.00] | [74.48–86.57] | [45.71–66.67] | |
| Ab | 42.64 | 10.91 |
| 77.62* | 46.50 |
| 62.50 |
| [40.91–43.75] | [0.00–12.50] | [71.88–81.58] | [74.32–82.09] | [44.64–49.32] | [84.33–90.82] | [55.56–64.00] | |
| Sl | 34.31 | 8.69 |
| 35.57 | 24.55 |
| 39.09 |
| [28.57–42.86] | [0.00–17.65] | [58.33–68.75] | [33.33–42.11] | [18.42–28.41] | [76.14–83.62] | [33.33–41.18] | |
| Fa | 21.11* | 5.00 |
| 35.42 | 10.36 |
| 10.56 |
| [14.29–33.33] | [0.00–13.33] | [75.00–100.00] | [28.57–40.00] | [10.00–12.50] | [47.50–84.38] | [0.00–14.29] | |
Breathing patterns: Sp, spontaneous; Ch, chest; Sh, shallow; Ab, abdominal; Sl, slow; Fa, fast.
Significant results compared to the Heuristic method, *p < 0.05.
Performance of the machine learning models for each of the sub-groups of the BS dataset. The results are presented as median specificity (25th percentile—75th percentile) (%). On the left side, the results for the SVM. On the middle, the results for the CNN. On the right side, the results using the heuristic method in Charlton et al. (2021). The best results of each model that performed better than the heuristic approach for each breathing type are in bold.
| — | SVM | CNN | Heuristic | ||||
|---|---|---|---|---|---|---|---|
| Original | DA | TL + DA | Original | DA | TL + DA | ||
| Sp |
| 100.00 | 88.37* | 97.10* |
| 93.72* | 84.98 |
| [91.89–98.33] | [100.00–100.00] | [84.62–94.62] | [96.30–97.59] | [99.29–99.61] | [92.53–95.24] | [78.13–85.42] | |
| Ch |
| 100.00 | 81.60 | 94.44* |
| 90.35 | 90.98 |
| [96.88–98.39] | [100.00–100.00] | [79.75–84.38] | [93.91–96] | [98.47–100.00] | [87.94–93.51] | [88.71–92.19] | |
| Sh |
| 100.00 | 70.54 | 96.74* |
| 87.33 | 89.53 |
| [95.00–100.00] | [98.73–100] | [67.57–74.32] | [95.71–97.41] | [98.9–99.62] | [85.61–88.5] | [88.68–92.31] | |
| Ab |
| 100.00 | 72.53 | 91.15* |
| 87.62* | 83.68 |
| [96.30–100.00] | [100.00–100.00] | [70.10–77.65] | [90.16–94.20] | [97.17–99.61] | [86.57–90.71] | [81.25–88.89] | |
| Sl |
| 100.00 | 74.26 | 91.48 | 100.00 | 87.17 | 89.29 |
| [97.14–100.00] | [100.00–100.00] | [68.82–81.82] | [89.58–93.90] | [100.00–100.00] | [83.53–89.36] | [88.24–95.24] | |
| Fa | 100.00 | 100.00 | 55.84 | 100.00 | 100.00 | 85.52 | 100.00 |
| [100.00–100.00] | [100.00–100.00] | [50.00–73.91] | [96.30–100.00] | [100.00–100.00] | [78.26–92.59] | [100.00–100.00] | |
Breathing patterns: Sp, spontaneous; Ch, chest; Sh, shallow; Ab, abdominal; Sl, slow; Fa, fast.
Significant results compared to the Heuristic method, *p < 0.05.