| Literature DB >> 35808449 |
Josep Noguer1, Ivan Contreras1, Omer Mujahid1, Aleix Beneyto1, Josep Vehi1,2.
Abstract
In this paper, we present a methodology based on generative adversarial network architecture to generate synthetic data sets with the intention of augmenting continuous glucose monitor data from individual patients. We use these synthetic data with the aim of improving the overall performance of prediction models based on machine learning techniques. Experiments were performed on two cohorts of patients suffering from type 1 diabetes mellitus with significant differences in their clinical outcomes. In the first contribution, we have demonstrated that the chosen methodology is able to replicate the intrinsic characteristics of individual patients following the statistical distributions of the original data. Next, a second contribution demonstrates the potential of synthetic data to improve the performance of machine learning approaches by testing and comparing different prediction models for the problem of predicting nocturnal hypoglycemic events in type 1 diabetic patients. The results obtained for both generative and predictive models are quite encouraging and set a precedent in the use of generative techniques to train new machine learning models.Entities:
Keywords: artificial intelligence; blood glucose; deep learning; generative model; type 1 diabetes
Mesh:
Substances:
Year: 2022 PMID: 35808449 PMCID: PMC9269743 DOI: 10.3390/s22134944
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
Figure 1Graphical representation of the generator architecture.
Figure 2Graphical representation of the discriminator architecture.
Figure 3Graphical representation of the nocturnal hypoglycemia classifier.
Total number of instances for the ten patients of the Barcelona data set. Class 0 is defined as a sleep period without hypoglycemia and Class 1 is defined as a sleep period with hypoglycemia.
| Patient ID | Total Instances | Class 1 | Class 0 |
|---|---|---|---|
| B1 | 85 | 6 (7%) | 79 (93%) |
| B2 | 77 | 7 (9%) | 70 (91%) |
| B3 | 94 | 22 (23%) | 72 (77%) |
| B4 | 92 | 3 (3%) | 89 (97%) |
| B5 | 82 | 6 (7%) | 76 (93%) |
| B6 | 88 | 7 (8%) | 81 (92%) |
| B7 | 103 | 31 (30%) | 72 (70%) |
| B8 | 78 | 15 (19%) | 63 (81%) |
| B9 | 110 | 13 (12%) | 97 (88%) |
| B10 | 107 | 20 (19%) | 87 (81%) |
Total number of instances for the six patients of the Ohio data set. Class 0 is defined as a sleep period without hypoglycemia and Class 1 is defined as a sleep period with hypoglycemia.
| Patient ID | Total Instances | Class 1 | Class 0 |
|---|---|---|---|
| O1 | 34 | 1 (3%) | 33 (97%) |
| O2 | 47 | 6 (13%) | 41 (87%) |
| O3 | 40 | 3 (7%) | 37 (93%) |
| O4 | 39 | 8 (21%) | 31 (79%) |
| O5 | 52 | 3 (6%) | 49 (94%) |
| O6 | 45 | 6 (13%) | 39 (87%) |
Ranges of values where a patient’s blood glucose can be found.
| Label | Range (mg/dL) |
|---|---|
| L2 Hypo | CGM < 54 |
| L1 Hypo | 54 ≤ CGM < 70 |
| TIR | 70 ≤ CGM < 180 |
| Hyper | 180 ≤ CGM |
Hardware specifications.
| CPU | Intel Core i7-4770 CPU 3.40 GHz |
| Power Source | Corsair TX 750 M 750 Watt |
| GPU | Nvidia GeForce Titan Xp Pascal 12 GB GDDR5X |
| Mother Board | Asus Z87-A |
| RAM Memory | DDR3 16 GB |
Results obtained by comparing real and synthetic data for patients in the Barcelona cohort. All p-values have been accepted. as they pass the acceptance threshold of 0.05.
| Patient | Hyper | TIR | L1 Hypo | L2 Hypo | Mean | JS | Variance | Z-Value | SD | |
|---|---|---|---|---|---|---|---|---|---|---|
| B1 | Real | 48.80 | 45.21 | 3.74 | 2.26 | 187.20 | 0.23 | −1.34 | 0.40 | 74.67 |
| Syn. | 37.36 | 54.02 | 5.67 | 2.95 | 164.73 | 0.21 | −1.37 | 0.59 | 58.59 | |
| B2 | Real | 28.79 | 61.94 | 5.01 | 4.26 | 147.95 | 0.19 | −0.95 | 0.54 | 47.80 |
| Syn. | 43.57 | 50.10 | 3.59 | 2.74 | 180.08 | 0.18 | −1.56 | 0.72 | 56.69 | |
| B3 | Real | 35.08 | 56.69 | 5.24 | 2.99 | 158.78 | 0.21 | −1.30 | 0.51 | 56.76 |
| Syn. | 38.67 | 54.74 | 4.05 | 2.54 | 171.32 | 0.18 | −1.74 | 0.77 | 54.35 | |
| B4 | Real | 44.72 | 47.80 | 4.07 | 3.40 | 175.15 | 0.22 | −0.95 | 0.38 | 69.61 |
| Syn. | 38.02 | 54.13 | 4.65 | 3.20 | 170.66 | 0.20 | −1.25 | 0.54 | 66.39 | |
| B5 | Real | 39.52 | 54.94 | 4.29 | 1.24 | 163.54 | 0.19 | −1.14 | 0.46 | 55.67 |
| Syn. | 45.13 | 51.54 | 2.35 | 0.98 | 184.42 | 0.18 | −1.42 | 0.60 | 59.73 | |
| B6 | Real | 51.37 | 41.11 | 3.21 | 4.31 | 185.68 | 0.22 | −1.29 | 0.66 | 59.43 |
| Syn. | 37.81 | 49.83 | 5.19 | 7.18 | 163.71 | 0.22 | −1.29 | 0.66 | 59.43 | |
| B7 | Real | 35.40 | 50.53 | 7.21 | 6.85 | 153.87 | 0.25 | −1.30 | 0.46 | 63.71 |
| Syn. | 27.70 | 59.22 | 6.55 | 6.53 | 146.27 | 0.22 | −1.41 | 0.54 | 54.17 | |
| B8 | Real | 32.84 | 56.30 | 7.06 | 3.80 | 154.30 | 0.23 | −1.26 | 0.46 | 61.44 |
| Syn. | 36.86 | 56.75 | 4.16 | 2.23 | 168.57 | 0.20 | −1.56 | 0.54 | 63.66 | |
| B9 | Real | 46.01 | 45.59 | 4.85 | 3.55 | 176.19 | 0.24 | −1.27 | 0.42 | 69.30 |
| Syn. | 33.94 | 54.01 | 5.47 | 6.58 | 157.40 | 0.22 | −1.56 | 0.66 | 57.23 | |
| B10 | Real | 36.59 | 47.59 | 6.30 | 9.52 | 160.35 | 0.28 | −1.08 | 0.29 | 81.94 |
| Syn. | 31.48 | 50.62 | 6.40 | 11.50 | 151.59 | 0.26 | −1.40 | 0.44 | 74.35 | |
Results obtained by comparing real and synthetic data for patients in the Ohio cohort. All p-values have been accepted, as they pass the acceptance threshold of 0.05.
| Patient | Hyper | TIR | L1 Hypo | L2 Hypo | Mean | JS | Variance | Z-Value | SD | |
|---|---|---|---|---|---|---|---|---|---|---|
| O1 | Real | 39.27 | 56.92 | 2.74 | 1.07 | 167.12 | 0.21 | −1.13 | 0.32 | 62.70 |
| Syn. | 32.91 | 61.88 | 4.57 | 0.64 | 163.45 | 0.20 | −1.92 | 0.50 | 59.84 | |
| O2 | Real | 25.72 | 72.13 | 1.85 | 0.30 | 148.04 | 0.17 | −1.14 | 0.33 | 43.13 |
| Syn. | 20.14 | 67.88 | 7.60 | 4.38 | 132.59 | 0.15 | −1.90 | 0.85 | 34.86 | |
| O3 | Real | 60.30 | 38.60 | 1.03 | 0.07 | 195.18 | 0.14 | −0.81 | 0.57 | 51.19 |
| Syn. | 50.99 | 47.41 | 1.29 | 0.31 | 196.22 | 0.13 | −2.12 | 1.07 | 48.59 | |
| O4 | Real | 25.13 | 67.62 | 5.12 | 2.14 | 144.20 | 0.21 | −1.32 | 0.33 | 53.73 |
| Syn. | 29.83 | 60.96 | 5.59 | 3.62 | 151.29 | 0.20 | −1.49 | 0.54 | 56.82 | |
| O5 | Real | 38.61 | 60.66 | 0.56 | 0.17 | 167.99 | 0.15 | −1.06 | 0.43 | 43.11 |
| Syn. | 54.98 | 44.10 | 0.78 | 0.13 | 195.53 | 0.14 | −1.07 | 0.68 | 49.08 | |
| O6 | Real | 31.16 | 65.06 | 3.19 | 0.59 | 153.16 | 0.19 | −1.16 | 0.38 | 50.15 |
| Syn. | 28.86 | 61.93 | 5.56 | 3.65 | 152.75 | 0.16 | −1.43 | 0.88 | 41.71 | |
Figure 4Heat maps representing distances of real and synthetic samples of Barcelona patients.
Figure 5Representation of the obtained metrics related to the amount of days used to augment the real data set.
Results of a nocturnal hypoglycemia classifier trained with real and augmented data sets for patients of both Barcelona and Ohio cohorts.
| Patient | ACC | SEN | SP |
| MCC | |
|---|---|---|---|---|---|---|
| B1 | Real | 87.5 | 30.0 | 90.2 | 52.0 | 0.17 |
| Aug. | 88.8 | 78.5 | 84.0 | 81.2 | 0.51 | |
| B2 | Real | 83.2 | 27.9 | 84.8 | 48.6 | 0.12 |
| Aug. | 89.7 | 62.0 | 88.4 | 74.0 | 0.47 | |
| B3 | Real | 74.4 | 50.9 | 74.4 | 61.6 | 0.31 |
| Aug. | 81.6 | 58.4 | 82.9 | 69.5 | 0.50 | |
| B5 | Real | 88.6 | 33.2 | 88.0 | 54.0 | 0.20 |
| Aug. | 80.5 | 51.0 | 79.7 | 63.7 | 0.26 | |
| B6 | Real | 87.0 | 24.7 | 90.3 | 47.3 | 0.13 |
| Aug. | 77.4 | 81.5 | 75.6 | 78.5 | 0.35 | |
| B7 | Real | 73.3 | 57.0 | 71.4 | 63.8 | 0.38 |
| Aug. | 78.9 | 66.2 | 75.1 | 70.5 | 0.52 | |
| B8 | Real | 79.4 | 66.0 | 80.3 | 72.8 | 0.46 |
| Aug. | 81.3 | 68.7 | 79.0 | 73.6 | 0.52 | |
| B9 | Real | 77.1 | 44.9 | 76.5 | 58.6 | 0.21 |
| Aug. | 79.9 | 71.7 | 78.1 | 74.8 | 0.41 | |
| B10 | Real | 79.0 | 55.3 | 80.4 | 66.7 | 0.39 |
| Aug. | 84.6 | 84.0 | 81.1 | 82.5 | 0.62 | |
| O2 | Real | 82.8 | 50.9 | 84.0 | 65.4 | 0.35 |
| Aug. | 76.9 | 70.0 | 73.8 | 71.9 | 0.38 | |
| O4 | Real | 66.1 | 36.0 | 73.2 | 51.3 | 0.09 |
| Aug. | 65.9 | 71.9 | 65.2 | 68.5 | 0.31 | |
| O6 | Real | 79.3 | 24.6 | 84.4 | 45.5 | 0.09 |
| Aug. | 84.6 | 62.5 | 85.9 | 73.3 | 0.45 | |
| Med. | Real | 79.4 | 40.4 | 82.2 | 56.3 | 0.21 |
| Aug. | 80.9 | 69.3 | 79.3 | 73.5 | 0.46 | |
Comparison between state-of-the-art results and the obtained with the augmented data set, presented as medians and interquartile ranges.
| State-of-the-Art | Aug. | |
|---|---|---|
| SEN | 75 (70–80) | 69 (62–74) |
| SP | 65 (55–74) | 79 (76–83) |
|
| 70 (62–77) | 74 (70–76) |