| Literature DB >> 31694629 |
Wonju Seo1, You-Bin Lee2, Seunghyun Lee1, Sang-Man Jin3, Sung-Min Park4.
Abstract
BACKGROUND: For an effective artificial pancreas (AP) system and an improved therapeutic intervention with continuous glucose monitoring (CGM), predicting the occurrence of hypoglycemia accurately is very important. While there have been many studies reporting successful algorithms for predicting nocturnal hypoglycemia, predicting postprandial hypoglycemia still remains a challenge due to extreme glucose fluctuations that occur around mealtimes. The goal of this study is to evaluate the feasibility of easy-to-use, computationally efficient machine-learning algorithm to predict postprandial hypoglycemia with a unique feature set.Entities:
Keywords: Diabetes; Hypoglycemia; Machine-learning approach; Risk prediction
Mesh:
Substances:
Year: 2019 PMID: 31694629 PMCID: PMC6833234 DOI: 10.1186/s12911-019-0943-4
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Clinical characteristics of enrolled study subjects
| Type-1 diabetes (55 three-day CGM datasets in 52 patients) | Type-2 diabetes (52 three-day CGM datasets in 52 patients) | |
|---|---|---|
| Age (year) | 40.0 (29.0-52.0) | 63.5 (54.3-68.0) |
| Sex (male:female) | 21:34* | 21:31 |
| Body weight (kg) | 60.48 (52.35-69.41) | 60.75 (54.60-70.37) |
| BMI (kg/ | 22.85 ±3.26 | 24.60 ±2.62 |
| Duration of diabetes (years) | 11.0 (6.0-18.0) | 19.0 (13.3-25.0) |
| Insulin therapy (with insulin therapy: without insulin therapy) | 55*:0 | 43:9 |
| Insulin regimen basal:intermediate-acting: premix:MDI:CSII | 3:1:6:44:1 | 20:3:11:9:0 |
| Daily insulin dose (IU/day) | 42.3 ±17.7 | 28.6 ±18.1 |
| Daily insulin dose per body weight (IU/day/kg) | 0.68 (0.53-0.82) | 0.50 (0.30-0.60) |
| eGFR (ml/min/1.73 | 83.05 (71.98-96.95) | 70.40 (51.30-82.50) |
| End stage renal disease [n (%)] | 4 (7.3) | 2 (3.8) |
| Liber cirrhosis [n (%)] | 2 (3.6) | 0 (0.0) |
| Heart failure with reduced ejection fraction [n (%)] | 0 (0.0) | 1 (1.9) |
| Pancreatic resection [n (%)] | 2 (3.6)† | 0 (0.0) |
| Acute infection [n (%)] | 0 (0.0) | 1 (1.9) |
| Pregnancy [n (%)] | 1 (1.8) | 0 (0.0) |
| Hemoglobin A1C (%) | 7.94 ±1.13 | 8.31 ±1.32 |
| C-peptide (ng/mL) | 0.02 (0.02-0.15) | 1.46 (0.80-2.44) |
Continuous variables with normal distributions are expressed as mean ± standard deviation, whereas continuous variables with non-normal distributions were expressed as median (interquartile range)
*Three female patients on insulin therapy were included twice because they participated twice
One of these two patients underwent total pancreatectomy; the other went through Whipple’s operation. Abbreviations: CGM, continuous glucose monitoring; BMI, body mass index; MDI, multiple daily injections; CSII, continuous subcutaneous insulin infusion; eGFR, estimated glomerular filtration rate
Fig. 1Representative CGM time-series data to show different reactions of selected patients’ glucose levels after meals. Blue line: CGM time-series data points; red line and transparent red box: CGM data point <3.9 mmol/L (70 mg/dL); magenta filled circle: CGM data point at the meal; red filled circle: peak CGM data point after the meal; green filled circle: CGM data point at the time of prediction. Clinical explanations: a No peak of CGM data point could occur because the patient ate a small amount of carbohydrates in the meal. b Low peak after the meal, then rapid fall in glucose could occur because patient ate a small amount of carbohydrates in the meal. c Steep peak, then rapid fall in glucose could occur when the patient ate foods rich in carbohydrate with high glycemic index or injected rapid-acting insulin later than he or she should have. d A rapid fall and then no peak after the meal could occur when the insulin injected before the previous meal is still active (insulin on board)
Fig. 2The three features and the 30-min prediction horizon. Blue line: CGM time-series data points; red line: CGM data point <3.9 mmol/L (70 mg/dL); magenta filled circle: CGM data point at the meal; red filled circle: peak CGM data point after the meal; green filled circle: CGM data point at the time of prediction; black arrow: rate of increase in glucose (RIG); red arrow: glucose rate of change (GRC); transparent yellow box: observational window; transparent green box: the 30-min prediction horizon
Fig. 3Flowchart of the proposed approach including data-preprocessing, and how to train RF, SVM-LN, SVM-RBF, KNN, and LR. Since the 5-fold cross-subject validation was used, training and testing models were repeated by 5 times. For each iteration, each model’s result and tuned hyper-parameters was saved
Average and standard deviation of metrics of models with 5-fold cross-subject validation
| Model | Sen (%,SD) | Spe (%,SD) | F1 score (SD) | AUC (SD) | NH (SD) | FAR (SD) | DT (min,SD) | |
| RF | 89.6 | 91.3 | 0.543 | 0.966 | 36.4 | 30.2 | 0.704 | 25.5 |
| (2.78) | (2.03) | (0.053) | (0.007) | (11.0) | (8.42) | (0.035) | (1.97) | |
| SVM | 93.3 | 88.2 | 0.487 | 0.967 | 36.4 | 29.2 | 0.777 | 25.8 |
| -LN | (1.70) | (2.83) | (0.046) | (0.007) | (11.0) | (8.30) | (0.034) | (2.12) |
| SVM | 89.9 | 88.8 | 0.487 | 0.952 | 36.4 | 29.4 | 0.760 | 25.2 |
| -RBF | (8.65) | (2.96) | (0.062) | (0.014) | (11.0) | (9.20) | (0.038) | (3.22) |
| KNN | 88.5 | 89.4 | 0.492 | 0.917 | 36.4 | 29.6 | 0.779 | 25.8 |
| (1.93) | (2.09) | (0.054) | (0.012) | (11.0) | (8.73) | (0.038) | (3.76) | |
| LR | 93.6 | 87.9 | 0.484 | 0.967 | 36.4 | 29.6 | 0.772 | 25.0 |
| (2.25) | (2.95) | (0.047) | (0.007) | (11.0) | (8.71) | (0.037) | (2.87) |
With the 5-fold cross-subject validation, average metrics were calculated using Eq 6, 7, 9, and 10 on test set,q=1,2,3,4,5. Since there should be at least two consecutive predictions of a hypoglycemia alert value to make an alarm, we excluded hypoglycemic events occurring immediately after meals. Abbreviation: RF, random forest; SVM-LN, support vector machine with a linear kernel; SVM-RBF, support vector machine with a radial basis function; KNN, K-nearest neighbor; LR, logistic regression; SD, standard deviation; Sen, sensitivity; Spe, specificity; AUC, the area under the ROC curve; NH, the number of hypoglycemic events; FAR, false alarm rate; DT, detection time.
Fig. 4ROC curves for different models. In each iteration of the 5-fold cross-subject validation, the hyper-parameters of the models were determined by the grid search method. a ROC curves of RF. b ROC curves of SVM-LN. c ROC curves of SVM-RBF. d ROC curves of KNN. e ROC curves of LR. Each colored dashed line represents the ROC curve of each fold. The red dash-dot line indicates a random prediction (i.e., AUC = 0.5)