| Literature DB >> 32784179 |
Ashenafi Zebene Woldaregay1, Ilkka Kalervo Launonen2, David Albers3,4, Jorge Igual5, Eirik Årsand1, Gunnar Hartvigsen1.
Abstract
BACKGROUND: Semisupervised and unsupervised anomaly detection methods have been widely used in various applications to detect anomalous objects from a given data set. Specifically, these methods are popular in the medical domain because of their suitability for applications where there is a lack of a sufficient data set for the other classes. Infection incidence often brings prolonged hyperglycemia and frequent insulin injections in people with type 1 diabetes, which are significant anomalies. Despite these potentials, there have been very few studies that focused on detecting infection incidences in individuals with type 1 diabetes using a dedicated personalized health model.Entities:
Keywords: decision support techniques; infection detection; outbreak detection system; self-recorded health data; syndromic surveillance; type 1 diabetes
Mesh:
Year: 2020 PMID: 32784179 PMCID: PMC7450372 DOI: 10.2196/18912
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 5.428
Equipments used in the self-management of diabetes.
| Patients | Self-management | ||||
|
| BGa | Insulin administration | Diet | Body weight (kg) | HbA1cb (%) |
| Subject 1 | Finger pricks recorded in the Diabetes Diary mobile app and Dexcom CGMc | Insulin Pen (multiple bolus and 1-time basal in the morning) recorded in the Diabetes Diary mobile app | Carbohydrate in grams recorded in the Diabetes Diary mobile app; level 3 (advanced carb counting) | 83 | 6.0 |
| Subject 2 | Finger pricks recorded in the Spike mobile app and Dexcom G4 CGMc | Insulin Pen (multiple bolus [Humalog] and 1-time basal [Toujeo] before bed) recorded in the Spike mobile app | Carbohydrate in grams recorded in the Spike mobile app; level 3 (advanced carb counting) | 77 | 7.3 |
| Subject 3 | Enlite (Medtronic) CGMc and Dexcom G4 | Medtronic MinMed G640 insulin pump (basal rates profile [Fiasp] and multiple bolus [Fiasp]) | Carbohydrate in grams recorded in pump information; level 3 (advanced carb counting) | 70 | 6.2 |
aBG: blood glucose.
bHbA1c: hemoglobin A1c.
cCGM: continuous glucose monitoring.
Figure 1Daily scatter plot of average blood glucose levels versus total insulin (bolus) to total carbohydrate ratio for a specific regular or normal patient year without any infection incidences.
Figure 4Hourly scatter plot of average blood glucose levels versus total insulin (bolus) to total carbohydrate ratio for a specific patient year with an infection incidence (flu).
Average (SD) of area under the receiver operating characteristic curve, specificity, F1-score for the raw data set (without smoothing), and different sample size. Fraction=0.01.
| Models | 1 month | 2 months | 3 months | 4 months | |||||||||
|
| AUCa, mean (SD) | Specificity, mean (SD) | F1, mean (SD) | AUC, mean (SD) | Specificity, mean (SD) | F1, mean (SD) | AUC, mean (SD) | Specificity, mean (SD) | F1, mean (SD) | AUC, mean (SD) | Specificity, mean (SD) | F1, mean (SD) | |
|
| |||||||||||||
|
| SVDDb | 90.7 (8.8) | 71.7 (7.7) | 73.6 (5.5) | 93.4 (6.2) | 81.7 (5.0) | 87.4 (8.1) | 96.4 (2.9) | 87.8 (3.3) | 91.3 (6.0) | 94.6 (3.7) | 81.7 (5.0) | 90.0 (4.6) |
|
| IncSVDDc | 90.4 (8.9) | 66.7 (7.5) | 72.7 (4.9) | 91.8 (5.9) | 66.7 (7.5) | 84.4 (3.2) | 95.8 (2.9) | 70.0 (7.1) | 85.4 (1.2) | 93.7 (3.6) | 55 (10.7) | 81.0 (2.7) |
|
| V-SVMd | 93.1 (6.0) | 63 (10.6) |
| 96.5 (2.3) | 81.9 (4.7) |
| 97.9 (1.5) | 88.9 (0.0) |
| 96.2 (2.3) | 83.3 (0.0) |
|
|
| NNf | 74.2 (9.3) | 38.3 (7.7) | 61.0 (4.7) | 89.5 (9.3) | 20.0 (6.7) | 70.0 (4.6) | 90.1 (6.6) | 11.1 (18) | 69.2 (3.8) | 92.8 (3.3) | 33.3 (0.0) | 75.1 (0.4) |
|
| MSTg | 89.4 (8.1) | 50.0 (0.0) | 62.7 (6.6) | 95.4 (5.6) | 61.7 (7.7) | 82.3 (5.9) | 96.6 (2.7) | 68.9 (4.5) | 83.6 (4.7) | 94.1 (2.8) | 55.0 (7.7) | 80.6 (2.3) |
|
| |||||||||||||
|
| Gaussian | 90.6 (7.1) | 60.0 (8.2) | 68.8 (8.4) | 95.4 (4.6) | 70.0 (6.7) | 85.3 (4.6) | 97.3 (2.5) | 80.0 (4.5) | 89.2 (3.3) | 95.5 (3.2) | 66.7 (0.0) | 84.5 (2.0) |
|
| MOGh | 88.1 (9.9) | 80.1 (17.3) | 67.8 (16.4) | 93.1 (7.1) | 75.8 (14.8) | 82.5 (10.1) | 95.6 (3.4) | 80.2 (7.5) | 86.0 (6.7) | 93.7 (3.9) | 68.7 (11.6) | 84.2 (5.7) |
|
| MCDi Gaussian | 89.0 (8.5) | 55.0 (7.7) | 66.4 (9.0) | 94.0 (4.6) | 68.3 (5.0) | 84.6 (6.3) | 97.0 (2.7) | 80.0 (4.5) | 89.9 (2.4) | 94.5 (3.2) | 65.0 (5.0) | 84.0 (3.2) |
|
| Parzen | 89.0 (9.2) | 70.0 (6.7) | 70.7 (5.9) | 94.6 (4.9) | 83.3 (0.0) | 87.9 (6.3) | 97.2 (2.4) | 88.9 (0.0) | 90.5 (5.9) | 95.2 (2.9) | 83.3 (0.0) | 88.9 (3.3) |
|
| Naïve Parzen | 90.1 (7.6) | 55 (10.7) | 65.0 (5.0) | 95.7 (3.9) | 76.7 (8.2) | 87.2 (3.5) | 98.3 (1.4) | 88.9 (0.0) |
| 96.8 (2.1) | 83.3 (0.0) | 90.7 (2.0) |
|
| K-NNj | 91.8 (6.9) | 50.0 (0.0) | 66.0 (2.0) | 95.6 (3.1) | 81.7 (5.0) |
| 97.9 (1.6) | 88.9 (0.0) | 93.5 (3.7) | 97.0 (2.2) | 83.3 (0.0) |
|
|
| LOFk | 88.5 (6.1) | 66.7 (7.5) |
| 97.0 (1.9) | 71.7 (7.7) | 86.1 (2.4) | 96.8 (2.8) | 78.9 (3.3) | 88.7 (2.8) | 92.6 (4.8) | 50.0 (0.0) | 79.3 (2.6) |
|
| |||||||||||||
|
| PCAl | 87.8 (11.9) | 50.0 (7.5) | 62.4 (8.5) | 93.5 (6.2) | 51.7 (5.0) | 78.2 (4.1) | 93.6 (4.7) | 60 (10.2) | 81.8 (4.4) | 91.3 (5.2) | 46.7 (6.7) | 78.7 (2.3) |
|
| Auto-encoder | 82.2 (12.0) | 57.9 (15.3) | 64.7 (12.0) | 88.2 (9.5) | 61.6 (14.0) | 81.4 (7.1) | 93.4 (5.7) | 74.4 (11) | 86.4 (5.9) | 88.4 (8.8) | 61.3 (14.3) | 82.7 (5.7) |
|
| SOMm | 86.9 (9.4) | 78.3 (13.3) | 66.7 (16.9) | 92.8 (7.3) | 64.2 (12.4) | 80.9 (7.0) | 95.8 (3.7) | 80.1 (6.3) | 86.9 (5.5) | 92.2 (4.1) | 76.5 (9.0) | 87.5 (4.5) |
|
| K-means | 91.8 (6.9) | 65.0 (9.0) |
| 96.0 (2.4) | 83.3 (0.0) |
| 97.6 (1.6) | 88.9 (0.0) |
| 96.2 (2.2) | 83.3 (0.0) |
|
aAUC: area under the receiver operating characteristic curve.
bSVDD: support vector data description.
cIncSVDD: incremental support vector data description.
dV-SVM: one-class support vector machine.
eItalicized values indicates the top performing models.
fNN: nearest neighbor.
gMST: minimum spanning tree.
hMOG: mixture of Gaussian.
iMCD: minimum covariance determinant.
jK-NN: K-nearest neighbor.
kLOF: local outlier factor.
lPCA: principal component analysis.
mSOM: self-organizing maps.
Average of area under the receiver operating characteristic curve, specificity, and F1-score for smoothed version of the data with a 2-day moving average filter and different sample size. Fraction=0.01.
| Models | 1 month | 2 months | 3 months | 4 months | |||||||||
|
| AUCa, mean (SD) | Specificity | F1 | AUCa, mean (SD) | Specificity | F1 | AUCa, mean (SD) | Specificity | F1 | AUCa, mean (SD) | Specificity | F1 | |
|
| |||||||||||||
|
| SVDDb | 99.6 (1.3) | 100 (0.0) | 93.6 (15.2) | 100 (0.0) | 100 (0.0) | 94.8 (10.1) | 100 (0.0) | 100 (0.0) | 97.0 (4.1) | 100 (0.0) | 100 (0.0) | 96.9 (4.0) |
|
| IncSVDDc | 99.6 (1.3) | 100 (0.0) | 93.6 (15.2) | 100 (0.0) | 100 (0.0) | 97.1 (6.3) | 100 (0.0) | 100 (0.0) | 97.6 (4.1) | 100 (0.0) | 100 (0.0) | 98.3 (2.8) |
|
| V-SVMd | 100 (0.0) | 99.5 (2.9) |
| 100 (0.0) | 100 (0.0) |
| 100 (0.0) | 100 (0.0) |
| 100 (0.0) | 100 (0.0) |
|
|
| NNf | 98.1 (3.9) | 58.3 (15.4) | 72.3 (9.9) | 86.9 (12.5) | 16.7 (22.4) | 70.5 (5.3) | 88.1 (6.5) | 54.4 (22.5) | 80.0 (8.6) | 92.4 (5.3) | 8.3 (17.1) | 69.0 (4.8) |
|
| MSTg | 98.5 (2.4) | 85.0 (5.0) | 85.5 (2.1) | 99.7 (0.8) | 100 (0.0) | 97.1 (6.3) | 99.9 (0.4) | 97.8 (4.5) | 97.2 (4.0) | 99.7 (0.8) | 100 (0.0) | 97.0 (7.9) |
|
| |||||||||||||
|
| Gaussian | 100 (0.0) | 98.3 (5.0) | 92.1 (15.2) | 100 (0.0) | 100 (0.0) | 97.1 (6.3) | 99.8 (0.7) | 100 (0.0) | 97.6 (4.1) | 99.4 (1.7) | 100 (0.0) | 97.0 (7.9) |
|
| MOGh | 98.6 (3.2) | 99.8 (1.7) | 88.5 (16.8) | 99.6 (1.2) | 100 (0.0) | 92.2 (11.1) | 99.7 (0.7) | 99.8 (1.4) | 94 (10.3) | 99.3 (2.0) | 99.9 (1.2) | 94.4 (11.8) |
|
| MCDi Gaussian | 98.9 (2.2) | 91.7 (8.4) |
| 100 (0.0) | 100 (0.0) |
| 99.5 (1.1) | 96.7 (5.1) | 96.6 (5.9) | 99.4 (1.7) | 88.3 (7.7) | 92.0 (6.8) |
|
| Parzen | 99.6 (1.3) | 100 (0.0) | 87.7 (17.0) | 100 (0.0) | 100 (0.0) | 95.1 (8.0) | 100 (0.0) | 100 (0.0) | 94.6 (9.8) | 99.9 (0.4) | 100 (0.0) | 94.6 (12.3) |
|
| Naïve Parzen | 99.2 (2.5) | 100 (0.0) | 94.7 (11.1) | 100 (0.0) | 100 (0.0) | 93.8 (11.0) | 99.6 (1.1) | 100 (0.0) | 97.5 (5.0) | 100 (0.0) | 100 (0.0) |
|
|
| K-NNj | 98.1 (3.9) | 68.3 (5.0) | 75.2 (4.3) | 100 (0.0) | 100 (0.0) |
| 100 (0.0) | 100 (0.0) |
| 100 (0.0) | 100 (0.0) |
|
|
| LOFk | 98.6 (2.9) | 75.0 (13.5) | 80.2 (10.8) | 100 (0.0) | 100 (0.0) |
| 100 (0.0) | 100 (0.0) | 96.9 (5.0) | 99.7 (0.8) | 100 (0.0) | 97.4 (7.9) |
|
| |||||||||||||
|
| PCAl | 98.9 (2.2) | 85.0 (5.0) |
| 99.2 (1.3) | 85.0 (5.0) |
| 98.6 (1.9) | 88.9 (0.0) | 92.2 (6.0) | 97.8 (2.2) | 83.3 (0.0) | 89.1 (9.7) |
|
| Auto-encoder | 97.4 (6.0) | 89.1 (13.0) | 86.0 (14.2) | 98.5 (3.2) | 94.5 (9.6) | 91.8 (9.4) | 99.2 (2.4) | 93.7 (10.2) | 93.7 (8.3) | 98.6 (3.8) | 94.4 (9.5) | 93.7 (9.7) |
|
| SOMm | 99.3 (1.9) | 99.9 (1.2) | 84.7 (19.8) | 99.8 (0.7) | 100 (0.0) | 91.4 (9.6) | 99.9 (0.3) | 100 (0.0) | 95.2 (7.9) | 99.6 (1.3) | 100 (0.0) | 93.4 (12.1) |
|
| K-means | 99.2 (2.5) | 85.0 (11.7) | 87.0 (10.4) | 100 (0.0) | 100 (0.0) | 97.1 (6.3) | 100 (0.0) | 100 (0.0) |
| 100 (0.0) | 100 (0.0) |
|
aAUC: area under the receiver operating characteristic curve.
bSVDD: support vector data description.
cIncSVDD: incremental support vector data description.
dV-SVM: one-class support vector machine.
eItalicized values indicates the top performing models.
fNN: nearest neighbor.
gMST: minimum spanning tree.
hMOG: mixture of Gaussian.
iMCD: minimum covariance determinant.
jK-NN: K-nearest neighbor.
kLOF: local outlier factor.
lPCA: principal component analysis.
mSOM: self-organizing maps.
Average (SD) of area under the receiver operating characteristic curve, specificity, F1-score for the smoothed version of the data with a 48-hour moving average filter and different sample size. Fraction=0.01.
| Models | 1 month | 2 months | 3 months | 4 months | |||||||||
|
| AUCa, mean (SD) | Specificity | F1 | AUCa, mean (SD) | Specificity | F1 | AUCa, mean (SD) | Specificity | F1 | AUCa, mean (SD) | Specificity | F1 | |
|
| |||||||||||||
|
| SVDDb | 97.6 (1.9) | 83.2 (3.4) | 85.8 (1.7) | 97.8 (1.2) | 85.7 (5.0) | 90.5 (9.6) | 97.7 (1.2) | 90.4 (5.1) | 94.2 (2.9) | 98.1 (0.9) | 91.0 (3.7) | 96.8 (0.9) |
|
| IncSVDDc | 97.4 (1.9) | 84.5 (2.8) | 86.8 (1.9) | 97.7 (1.2) | 86.7 (2.0) | 93.9 (1.0) | 97.5 (1.2) | 88.5 (1.5) | 96.0 (1.1) | 97.9 (0.9) | 88.9 (1.2) |
|
|
| V-SVMd | 98.1 (2.1) | 84.5 (1.1) |
| 99.0 (1.1) | 92.6 (0.0) |
| 99.5 (0.6) | 93.8 (0.5) |
| 99.4 (0.4) | 94.2 (0.0) | 97.1 (1.3) |
|
| NNf | 84.8 (6.0) | 75.9 (4.5) | 74.8 (6.0) | 89.3 (2.2) | 76.5 (4.1) | 87.1 (3.3) | 89.0 (4.0) | 77.5 (3.9) | 89.3 (4.4) | 90.2 (4.7) | 77.5 (3.8) | 91.4 (6.4) |
|
| MSTg | 90.5 (3.1) | 85.4 (3.9) | 67.6 (14.5) | 94.4 (2.0) | 85.7 (4.0) | 85.1 (7.0) | 94.7 (2.4) | 88.8 (3.5) | 87.8 (8.5) | 95.8 (2.2) | 88.8 (3.0) | 90.9 (5.9) |
|
| |||||||||||||
|
| Gaussian | 98.1 (2.2) | 79.8 (4.9) | 83.9 (2.7) | 99.5 (0.9) | 90.1 (1.7) | 95.2 (1.8) | 99.6 (0.7) | 92.9 (1.3) | 97.1 (2.5) | 99.5 (0.5) | 92.2 (1.0) | 97.7 (1.1) |
|
| MOGh | 95.8 (3.6) | 82.7 (4.3) | 83.7 (5.0) | 98.3 (1.5) | 86.2 (2.7) | 92.3 (2.7) | 98.7 (1.4) | 88.7 (4.6) | 94.7 (3.5) | 98.6 (1.6) | 88.2 (3.1) | 95.3 (3.2) |
|
| MCDi Gaussian | 98.6 (2.1) | 75.3 (6.9) | 81.3 (2.5) | 99.6 (0.9) | 89.6 (1.9) | 95.0 (1.8) | 99.6 (0.7) | 92.5 (1.8) | 97.0 (2.3) | 99.6 (0.4) | 92.0 (1.2) | 97.7 (1.1) |
|
| Parzen | 91.9 (2.9) | 93.6 (2.0) | 63.4 (16.5) | 96.2 (2.3) | 94.4 (2.0) | 81.6 (10.2) | 96.6 (2.6) | 94.8 (1.7) | 84.2 (9.5) | 97.4 (2.2) | 95.6 (1.2) | 87.9 (7.1) |
|
| Naïve Parzen | 94.8 (3.7) | 76.4 (5.6) | 77.6 (7.9) | 98.7 (1.2) | 85.2 (3.3) | 91.8 (2.9) | 99.1 (1.1) | 89.1 (3.8) | 94.8 (2.5) | 98.9 (0.9) | 89.7 (2.4) | 96.2 (1.6) |
|
| K-NNj | 97.1 (3.4) | 78.8 (2.0) |
| 99.1 (1.0) | 92.9 (0.7) |
| 99.6 (0.4) | 93.8 (0.7) |
| 99.5 (0.3) | 94.0 (0.6) |
|
|
| LOFk | 96.9 (3.5) | 78.3 (3.0) | 84.2 (2.4) | 99.2 (1.1) | 91.9 (0.9) |
| 99.6 (0.5) | 93.7 (0.8) | 97.3 (2.1) | 99.5 (0.4) | 93.1 (0.4) | 97.8 (1.2) |
|
| |||||||||||||
|
| PCAl | 97.1 (3.4) | 63.9 (8.8) | 75.4 (0.3) | 99.4 (1.2) | 76.4 (6.6) | 90.2 (1.1) | 99.1 (1.3) | 75.1 (6.8) | 92.4 (1.1) | 98.9 (1.2) | 69.1 (4.1) | 93.1 (0.8) |
|
| Auto-encoder | 92.0 (4.8) | 79.5 (7.6) | 78.9 (8.3) | 96.2 (2.6) | 83.1 (7.2) | 91.1 (3.9) | 96.3 (3.2) | 84.3 (7.7) | 92.7 (5.0) | 96.7 (3.0) | 84.0 (8.0) | 94.6 (4.4) |
|
| SOMm | 94.1 (2.3) | 82.2 (3.3) | 82.6 (4.9) | 95.6 (1.1) | 82.9 (3.1) | 91.6 (1.9) | 94.8 (2.3) | 83.4 (5.8) | 92.3 (4.1) | 95.5 (1.9) | 84.1 (3.8) | 94.3 (3.8) |
|
| K-means | 97.3 (3.2) | 80.9 (2.5) |
| 98.9 (1.1) | 92.6 (0.7) |
| 99.3 (0.6) | 92.9 (0.7) |
| 99.4 (0.4) | 94.1 (0.2) |
|
aAUC: area under the receiver operating characteristic curve.
bSVDD: support vector data description.
cIncSVDD: incremental support vector data description.
dV-SVM: one-class support vector machine.
eItalicized values indicates the top performing models.
fNN: nearest neighbor.
gMST: minimum spanning tree.
hMOG: mixture of Gaussian.
iMCD: minimum covariance determinant.
jK-NN: K-nearest neighbor.
kLOF: local outlier factor.
lPCA: principal component analysis.
mSOM: self-organizing maps.
Average performance of each model across all the infection cases for the daily raw data set (without smoothing) and different sample sizes. Fraction=0.01.
| Models | 1 month | 2 months | 3 months | 4 months | |||||||||
|
| AUCa, mean (SD) | Specificity | F1 | AUCa, mean (SD) | Specificity | F1 | AUCa, mean (SD) | Specificity | F1 | AUCa, mean (SD) | Specificity | F1 | |
|
| |||||||||||||
|
| SVDDb | 87.1 (11) | 66.0 (13.5) | 74.8 (9.5) | 91.7 (7.3) | 61.7 (10.6) | 84.1 (5.5) | 93.3 (4.6) | 67.3 (10.5) | 86.2 (4.4) | 91.4 (4.3) | 61.7 (10.6) |
|
|
| IncSVDDd | 85.2 (11) | 63.0 (4.6) | 74.7 (10.4) | 90.5 (8.5) | 57.9 (11) |
| 92.8 (5.1) | 62.8 (10.9) | 84.9 (3.2) | 90.8 (4.4) | 55.0 (11.7) | 83.5 (3.7) |
|
| V-SVMe | 91.5 (8.0) | 55.7 (7.0) |
| 92.2 (5.1) | 60.6 (5.0) | 82.8 (4.5) | 94.2 (3.8) | 66.9 (6.1) |
| 93.8 (4.1) | 63.1 (11.9) | 84.5 (5.1) |
|
| NNf | 73.4 (12) | 31.3 (6.5) | 65.0 (5.4) | 72.1 (11.9) | 25.0 (9.6) | 75.7 (3.7) | 70.8 (11.2) | 8.6 (17.6) | 72.0 (4.7) | 70.0 (9.0) | 16.0 (14.4) | 75.7 (3.4) |
|
| MSTg | 82.4 (8.7) | 52.1 (0.0) | 71.2 (6.1) | 82.6 (9.1) | 50.4 (9.0) | 82.0 (5.1) | 84.0 (6.3) | 56.2 (9.3) | 82.9 (3.5) | 84.2 (6.6) | 50.0 (11.4) | 82.6 (2.7) |
|
| |||||||||||||
|
| Gaussian | 91.5 (9.9) | 56.9 (7.7) | 72.9 (7.8) | 93.6 (6.1) | 58.8 (10.9) | 84.0 (4.0) | 95.1 (4.3) | 65.3 (10.6) | 86.3 (3.2) | 95.0 (3.5) | 57.9 (10.3) | 84.6 (3.2) |
|
| MOGh | 89.9 (12) | 69.2 (11.9) | 71.3 (14.3) | 91.7 (6.1) | 64.1 (14.0) | 83.8 (6.8) | 94.0 (4.4) | 67.0 (11.4) | 85.0 (5.6) | 94.5 (3.7) | 61.6 (12.6) | 84.9 (5.1) |
|
| MCDi Gaussian | 90.8 (9.1) | 54.0 (5.5) |
| 93.1 (6.0) | 58.0 (8.1) | 84.1 (4.3) | 95.3 (4.2) | 65.3 (10.6) | 86.4 (3.0) | 94.8 (3.5) | 57.9 (10.6) | 84.9 (3.0) |
|
| Parzen | 89.7 (10) | 59.6 (8.3) | 70.6 (9.4) | 91.7 (6.5) | 62.1 (10.3) | 83.9 (5.3) | 93.9 (5.0) | 68.7 (11.2) | 85.6 (5.4) | 94.3 (3.8) | 66.1 (12.7) | 86.1 (3.8) |
|
| Naïve Parzen | 88.1 (8.7) | 54.2 (6.5) | 69.1 (9.6) | 90.2 (7.1) | 60.4 (11.2) | 83.7 (4.9) | 91.9 (5.5) | 66.5 (12.8) | 86.6 (4.4) | 92.8 (4.7) | 64.6 (10.0) |
|
|
| K-NNj | 91.1 (7.8) | 52.9 (5.1) | 71.6 (7.9) | 91.6 (5.0) | 61.1 (11.3) |
| 94.8 (4.8) | 66.9 (11.2) |
| 95.0 (3.8) | 62.1 (10.3) |
|
|
| LOFk | 89.2 (8.9) | 56.3 (3.9) | 73.0 (8.6) | 92.4 (6.0) | 59.2 (11.1) | 84.9 (2.8) | 94.0 (4.8) | 64.4 (11.4) | 86.2 (2.8) | 93.7 (4.3) | 53.8 (10.3) | 83.8 (2.5) |
|
| |||||||||||||
|
| PCAl | 87.6 (8.8) | 58.8 (4.6) | 73.7 (8.3) | 90.2 (6.4) | 55.0 (6.8) | 82.7 (4.5) | 91.4 (4.9) | 59.7 (6.2) | 84.1 (3.2) | 90.5 (4.5) | 53.8 (7.2) | 83.6 (2.9) |
|
| Auto-encoder | 83.6 (14) | 58.3 (17.7) | 71.0 (12.5) | 84.6 (12.5) | 53.1 (20.0) | 82.1 (7.0) | 88.4 (10.0) | 57.7 (21.5) | 83.3 (6.8) | 88.5 (10.6) | 52.3 (21.0) | 83.2 (5.8) |
|
| SOMm | 85.6 (12) | 63.4 (10.3) | 72.7 (11.7) | 87.6 (7.2) | 57.1 (10.2) | 81.6 (5.8) | 93.5 (5.4) | 64.4 (8.5) | 84.8 (4.0) | 94.7 (4.0) | 59.0 (5.8) | 85.0 (3.1) |
|
| K-means | 94.2 (7.6) | 57.2 (7.6) |
| 93.7 (6.2) | 62.2 (10.5) |
| 96.0 (4.4) | 67.6 (10.3) |
| 95.8 (3.9) | 62.1 (10.3) |
|
aAUC: area under the receiver operating characteristic curve.
bSVDD: support vector data description.
cItalicized values indicates the top performing models.
dIncSVDD: incremental support vector data description.
eV-SVM: one-class support vector machine.
fNN: nearest neighbor.
gMST: minimum spanning tree.
hMOG: mixture of Gaussian.
iMCD: minimum covariance determinant.
jK-NN: K-nearest neighbor.
kLOF: local outlier factor.
lPCA: principal component analysis.
mSOM: self-organizing maps.
Average performance of each model across all the infection cases for the hourly data set with smoothing and different sample size. Fraction=0.01.
| Models | 1 month | 2 months | 3 months | 4 months | |||||||||
|
| AUCa, mean (SD) | Specificity | F1 | AUCa, mean (SD) | Specificity | F1 | AUCa, mean (SD) | Specificity | F1 | AUCa, mean (SD) | Specificity | F1 | |
|
| |||||||||||||
|
| SVDDb | 97.4 (2.9) | 89.0 (3.4) | 89.4 (7.1) | 97.4 (1.8) | 86.7 (4.4) | 91.5 (10.9) | 97.2 (2.6) | 80.1 (5.5) | 93.5 (3.4) | 97.6 (1.7) | 81.8 (5.3) | 94.6 (6.0) |
|
| IncSVDDc | 97.1 (2.9) | 87.7 (2.7) | 89.5 (5.9) | 97.2 (1.8) | 86.4 (2.8) | 93.6 (4.8) | 97.0 (2.7) | 76.2 (6.3) | 93.2 (2.6) | 97.4 (1.7) | 79.0 (4.8) |
|
|
| V-SVMe | 98.1 (2.0) | 85.5 (0.6) |
| 98.9 (1.4) | 89.8 (0.2) |
| 98.7 (1.4) | 86.4 (0.4) |
| 99.0 (0.9) | 89.2 (0.3) |
|
|
| NNf | 93.2 (7.8) | 92.0 (2.4) | 83.9 (12.0) | 94.4 (2.5) | 88.4 (3.4) | 90.9 (5.3) | 93.3 (2.8) | 83.0 (3.7) | 92.0 (4.2) | 94.0 (2.8) | 82.9 (3.6) | 94.0 (4.0) |
|
| MSTg | 96.1 (2.6) | 94.4 (2.2) | 72.9 (18.5) | 97.3 (1.4) | 94.2 (2.1) | 86.1 (11.0) | 96.1 (2.1) | 93.5 (1.9) | 90.2 (7.3) | 97.0 (1.4) | 93.6 (1.7) | 92.6 (5.0) |
|
| |||||||||||||
|
| Gaussian | 98.4 (1.6) | 91.2 (2.6) | 89.6 (12.5) | 99.3 (0.9) | 92.3 (1.7) | 95.7 (4.9) | 98.8 (1.3) | 88.1 (4.0) | 95.9 (2.7) | 99.2 (0.7) | 89.8 (3.1) |
|
|
| MOGh | 97.5 (3.0) | 91.7 (3.2) | 87.8 (13.3) | 98.9 (1.2) | 90.9 (2.7) | 94.0 (6.3) | 98.2 (2.0) | 85.4 (6.6) | 94.2 (4.1) | 98.5 (1.5) | 88.0 (4.9) | 96.0 (3.1) |
|
| MCDi Gaussian | 98.5 (1.5) | 89.9 (3.7) |
| 99.5 (0.9) | 92.2 (92.2) |
| 98.9 (1.1) | 87.9 (3.3) |
| 99.2 (0.7) | 90.4 (3.4) |
|
|
| Parzen | 96.4 (2.6) | 97.8 (1.1) | 59.9 (18.9) | 98.0 (1.6) | 97.7 (1.1) | 79.5 (14.5) | 97.2 (2.3) | 96.4 (1.2) | 85.1 (10) | 98.1 (1.6) | 96.7 (1.1) | 88.6 (7.1) |
|
| Naïve Parzen | 96.4 (3.0) | 87.5 (3.5) | 85.1 (10.9) | 98.7 (1.5) | 89.2 (2.8) | 92.8 (7.5) | 96.0 (2.3) | 90.8 (2.6) | 95.0 (4.1) | 98.2 (1.6) | 90.0 (1.8) | 96.2 (2.8) |
|
| K-NNj | 97.6 (2.9) | 91.1 (1.6) | 87.6 (13.6) | 99.0 (1.4) | 92.4 (2.4) | 94.5 (6.6) | 98.4 (1.4) | 92.6 (1.4) | 95.7 (4.8) | 98.7 (1.1) | 93.3 (1.3) |
|
|
| LOFk | 96.9 (2.9) | 91.2 (1.6) | 86.2 (13.0) | 97.4 (1.8) | 89.8 (4.8) | 93.1 (4.9) | 95.0 (3.0) | 85.2 (4.6) | 92.9 (4.8) | 95.8 (1.7) | 85.3 (4.7) | 94.7 (3.2) |
|
| |||||||||||||
|
| PCAl | 97.4 (3.2) | 78.2 (6.1) | 82.5 (10.9) | 94.8 (3.8) | 77.6 (4.5) | 90.9 (3.6) | 92.6 (4.2) | 72.4 (3.8) | 92.5 (1.9) | 93.4 (3.2) | 71.1 (2.5) | 93.9 (1.1) |
|
| Auto-encoder | 95.4 (5.3) | 88.7 (9.5) | 86.1 (13.1) | 96.9 (3.2) | 87.1 (9.9) | 92.8 (6.4) | 95.0 (5.3) | 79.3 (14.5) | 93.1 (4.8) | 95.9 (4.3) | 80.3 (14.4) | 95.0 (3.6) |
|
| SOMm | 95.9 (2.9) | 91.6 (2.6) | 86.1 (14.4) | 95.7 (1.7) | 87.6 (4.1) | 92.7 (5.7) | 93.9 (3.5) | 79.1 (10.9) | 92.3 (4.5) | 96.0 (2.5) | 87.5 (7.0) | 96.1 (3.2) |
|
| K-means | 97.1 (3.9) | 89.7 (6.7) |
| 98.6 (1.7) | 91.1 (4.2) |
| 98.5 (1.5) | 92.3 (2.9) |
| 98.9 (1.0) | 93.9 (1.3) |
|
aAUC: area under the receiver operating characteristic curve.
bSVDD: support vector data description.
cIncSVDD: incremental support vector data description.
dItalicized values indicates the top performing models.
eV-SVM: one-class support vector machine.
fNN: nearest neighbor.
gMST: minimum spanning tree.
hMOG: mixture of Gaussian.
iMCD: minimum covariance determinant.
jK-NN: K-nearest neighbor.
kLOF: local outlier factor.
lPCA: principal component analysis.
mSOM: self-organizing maps.
Average performance of each model across all the infection cases for the daily smoothed data set (with filter) and different sample size. Fraction=0.01.
| Models | 1 month | 2 months | 3 months | 4 months | ||||||||||||
|
| AUCa, mean (SD) | Specificity | F1 | AUCa, mean (SD) | Specificity | F1 | AUCa, mean (SD) | Specificity | F1 | AUCa, mean (SD) | Specificity | F1 | ||||
|
| ||||||||||||||||
|
| SVDDb | 99.9 (0.7) | 100 (0.0) | 94.1 (14.2) | 100 (0.0) | 100 (0.0) | 96.1 (7.6) | 100 (0.0) | 100 (0.0) | 96.5 (6.5) | 100 (0.0) | 100 (0.0) | 97.9 (3.9) | |||
|
| 99.9 (0.7) | 100 (0.0) | 94.1 (14.2) | 100 (0.0) | 100 (0.0) | 96.9 (6.5) | 100 (0.0) | 100 (0.0) | 97.3 (5.9) | 100 (0.0) | 100 (0.0) | 98.6 (2.9) | ||||
|
| V-SVMd | 100 (0.0) | 100 (0.0) |
| 100 (0.0) | 100 (0.0) |
| 100 (0.0) | 100 (0.0) |
| 100 (0.0) | 100 (0.0) |
| |||
|
| NNf | 90.1 (14.5) | 40.0 (30.5) | 69.5 (13.2) | 88.9 (9.9) | 33.1 (22.6) | 78.4 (6.8) | 89.2 (7.9) | 33.6 (14.6) | 77.7 (5.3) | 90.5 (6.8) | 23.5 (18.6) | 77.1 (5.7) | |||
|
| MSTg | 98.9 (3.6) | 85 (6.1) | 86.7 (9.4) | 99.8 (0.7) | 96.7 (3.4) | 95.1 (6.2) | 99.9 (0.2) | 98.9 (4.1) | 98.0 (3.5) | 99.9 (0.5) | 100 (0.0) | 98.0 (5.4) | |||
|
| ||||||||||||||||
|
| Gaussian | 99.2 (5.1) | 92.6 (9.0) | 87.2 (15.2) | 99.5 (2.5) | 96.7 (7.5) | 94.8 (10.4) | 99.9 (0.4) | 100 (0.0) | 98.1 (4.9) | 99.8 (0.8) | 100 (0.0) | 98.3 (5.9) | |||
|
| MOGh | 98.8 (5.4) | 92.9 (8.6) | 85.2 (17.1) | 99.4 (2.6) | 97.0 (5.4) | 92.1 (11.6) | 99.9 (0.4) | 99.9 (0.7) | 95.4 (7.8) | 99.8 (1.0) | 99.9 (0.6) | 96.4 (7.7) | |||
|
| MCDi Gaussian | 98.4 (5.6) | 86.6 (8.8) | 86.6 (11.9) | 99.3 (2.7) | 90.0 (8.7) | 93.4 (8.1) | 99.8 (0.5) | 99.2 (2.6) | 98.0 (5.3) | 99.8 (0.9) | 97.1 (3.9) | 97.0 (5.5) | |||
|
| Parzen | 99.2 (3.5) | 100 (0.0) | 90.8 (16.4) | 99.9 (0.4) | 100 (0.0) | 93.7 (9.8) | 100 (0.0) | 100 (0.0) | 93.6 (8.9) | 99.9 (0.3) | 100 (0.0) | 95.8 (8.2) | |||
|
| Naïve Parzen | 99.8 (1.2) | 100 (0.0) | 94.4 (14.6) | 100 (0.0) | 100 (0.0) | 96.1 (7.9) | 99.9 (0.5) | 100 (0.0) | 97.4 (5.6) | 100 (0.0) | 100 (0.0) | 98.2 (4.2) | |||
|
| K-NNj | 99.5 (2.0) | 91.6 (3.6) |
| 99.9 (0.4) | 100 (0.0) |
| 100 (0.0) | 100 (0.0) |
| 100 (0.0) | 100 (0.0) |
| |||
|
| LOFk | 99.6 (1.5) | 93.3 (7.3) | 92.4 (10.6) | 99.9 (0.5) | 99.2 (3.4) | 97.1 (7.3) | 99.9 (0.2) | 98.6 (2.8) | 97.4 (4.5) | 99.9 (0.4) | 100 (0.0) | 98.2 (5.9) | |||
|
| ||||||||||||||||
|
| PCAl | 93.8 (6.7) | 82.0 (7.3) | 83.8 (10.4) | 91.3 (4.3) | 77.9 (7.3) | 89.3 (8.7) | 88.7 (5.9) | 76.3 (8.6) | 89.5 (5.3) | 90.7 (3.6) | 76.2 (8.6) | 89.0 (6.9) | |||
|
| Auto-encoder | 97.0 (8.1) | 91.6 (14.6) | 87.7 (16.0) | 98.1 (5.4) | 92.6 (15.3) | 92.0 (10.7) | 98.6 (4.6) | 92.8 (14.8) | 94.0 (8.3) | 98.7 (4.0) | 92.7 (15.8) | 94.9 (7.7) | |||
|
| SOMm | 99.1 (3.2) | 99.9 (0.6) | 85.2 (20.5) | 99.8 (0.7) | 100 (0.0) | 88.9 (16.1) | 99.9 (0.2) | 100 (0.0) | 94.6 (8.0) | 99.8 (0.6) | 100 (0.0) | 95.9 (8.1) | |||
|
| K-means | 99.8 (1.2) | 96.2 (6.0) |
| 100 (0.0) | 100 (0.0) |
| 100 (0.0) | 100 (0.0) |
| 100 (0.0) | 100 (0.0) |
| |||
aAUC: area under the receiver operating characteristic curve.
bSVDD: support vector data description.
cIncSVDD: incremental support vector data description.
dV-SVM: one-class support vector machine.
eItalicized values indicates the top performing models.
fNN: nearest neighbor.
gMST: minimum spanning tree.
hMOG: mixture of Gaussian.
iMCD: minimum covariance determinant.
jK-NN: K-nearest neighbor.
kLOF: local outlier factor.
lPCA: principal component analysis.
mSOM: self-organizing maps.
Average area under the receiver operating characteristic curve, specificity, and F1-score for both with and without smoothed versions of the data. The parameters kd and kh represent the optimal number of nearest neighbors for the daily and hourly cases, respectively.
| Frequencies, density-based methods | ||||||||||||||
|
| Pre-pro | Models (threshold) | 1st case of infection (kd=30, kh=240) | 2nd case of infection (kd=30, kh=240) | 3rd case of infection (kd=30, kh=240) | 4th case of infection (kd=30, kh=240) | ||||||||
|
|
|
| AUCa | Specific | F1 | AUCa | Specific | F1 | AUCa | Specific | F1 | AUCa | Specific | F1 |
|
| ||||||||||||||
|
| Without filter | LOFb (T1=2.4, T2=1.2, T3=1.45, T4=1.8)c | 75.0 | 50.0 |
| 90.0 | 100 | 67.4 | 92.1 | 66.7 |
| 98.2 | 100 |
|
|
|
| COFd (T1=1.4, T2=1.3, T3=1.4, T4=1.4) | 82.1 | 66.7 | 72.6 | 97.4 | 100 |
| 75.2 | 66.7 | 67.6 | 96.7 | 100 | 71.8 |
|
| With filter | LOFb (T1=1.7, T2=1.6, T3=1.95, T4=2.2) | 99.0 | 100 |
| 99.2 | 100 |
| 100 | 100 |
| 99.9 | 100 | 94.7 |
|
|
| COFd
| 97.6 | 100 | 76.6 | 97.9 | 100 | 77.6 | 99.5 | 100 | 88.8 | 100 | 100 |
|
|
| ||||||||||||||
|
|
| LOFb (T1=1.4, T2=1.3, T3=1.35, T4=1.5) | 98.0 | 86.0 |
| 95.5 | 100 |
| 94.3 | 91.4 |
| 85.2 | 72.6 |
|
|
|
| COFd (T1=1.2, T2=1.1, T3=, T4=1.1) | 92.4 | 88.4 |
| 77.0 | 66.0 | 62.5 | 90.3 | 82.7 | 74.6 | 82.6 | 82.2 | 63.7 |
aAUC: area under the receiver operating characteristic curve.
bLOF: local outlier factor.
cTk: threshold for the kth month.
dCOF: connectivity-based outlier factor.
Figure 5Plot of models’ average computational time for the training phase. The x-axis depicts the sample size, and each label stands for total sample size divided by 24. The y-axis depicts the computational time required by each model. Gauss: Gaussian; IncSVDD: incremental support vector data description; K-NN: K-nearest neighbor; LOF: local outlier factor; MCD: minimum covariance determinant; MOG: mixture of Gaussian; MST: minimum spanning tree; NN: nearest neighbor; NParzen: naïve Parzen; PCA: principal component analysis; SOM: self-organizing maps; SVDD: support vector data description; V-SVM: one-class support vector machine.
Figure 6Plot of models’ average computational time for the testing phase. The x-axis depicts the sample size, and each label stands for total sample size divided by 24. The y-axis depicts the computational time required by each model. Gauss: Gaussian; IncSVDD: incremental support vector data description; K-NN: K-nearest neighbor; LOF: local outlier factor; MCD Gauss: Gaussian: SOM: self-organizing maps; MOG: mixture of Gaussian; MST: minimum spanning tree; NN: nearest neighbor; NParzen: naïve Parzen; PCA: principal component analysis; SVDD: support vector data description; V-SVM: one-class support vector machine.
Figure 7Quadrants of wellness in people with type 1 diabetes. The figure depicts the 4 possible scenarios of different parameters: carbohydrate action, insulin action, physical activity action, and abnormality because of metabolic change such as infection and stress. BG: blood glucose; PA: physical activity.
Figure 8Average performance (F1-score) of each model across all the infection cases. AE: auto-encoder; Gauss: Gaussian; IncSVDD: incremental support vector data description; K-NN: K-nearest neighbor; LOF: local outlier factor; MCD: minimum covariance determinant; MOG: mixture of Gaussian; MST: minimum spanning tree; NN: nearest neighbor; NP: naïve Parzen; PCA: principal component analysis; SOM: self-organizing maps; SVDD: support vector data description; V-SVM: one-class support vector machine.
Rough estimation of average training and testing time required by the different classifiers.
| Methods | Training time, mean (SD) | Testing time, mean (SD) | |||
|
| |||||
|
| SVDDa | 105.2 (2.03) | 0.008 (0.002) | ||
|
| IncSVDDb | 0.05 (0.16) | 2.41 (0.83) | ||
|
| K-means | 0.0047 (0.0014) | 0.0032 (0.0010) | ||
|
| Gaussian | 0.0055 (0.0032) | 0.0032 (0.0012) | ||
|
| MOGc | 0.076 (0.018) | 0.0036 (0.0011) | ||
|
| MCDd Gaussian | 0.27 (0.075) | 0.0034 (0.0015) | ||
|
| SOMe | 21.62 (5.91) | 0.0033 (0.00087) | ||
|
| K-NNf | 0.51 (0.11) | 0.52 (0.12) | ||
|
| Parzen | 2.02 (0.41) | 0.21 (0.052) | ||
|
| Naïve Parzen | 4.02 (0.82) | 0.40 (0.10) | ||
|
| LOFg | 1.15 (0.28) | 1198.05 (323.07) | ||
|
| NNh | 151.34 (22.52) | 0.18 (0.024) | ||
|
| MSTi | 2.39 (0.31) | 1.24 (0.19) | ||
|
| PCAj | 0.046 (0.20) | 0.0031 (0.00086) | ||
|
| Auto-encoder | 0.65 (0.094) | 0.017 (0.0034) | ||
|
| V-SVMk | 0.32 (0.024) | 0.035 (0.0066) | ||
|
| |||||
|
| LOFl | N/Am | 0.2 (0.0) | ||
|
| COFn | N/A | 82.8 (1.5) | ||
aSVDD: support vector data description.
bIncSVDD: incremental support vector data description.
cMOG: mixture of Gaussian.
dMCD: minimum covariance determinant.
eSOM: self-organizing maps.
fK-NN: K-nearest neighbor.
gLOF: local outlier factor.
hNN: nearest neighbor.
iMST: minimum spanning tree.
jPCA: principal component analysis.
kV-SVM: one-class support vector machine.
lLOF: local outlier factor.
mN/A: not applicable.
nCOF: connectivity-based outlier factor.