| Literature DB >> 35415447 |
Taiyu Zhu1, Kezhi Li1, Jianwei Chen1, Pau Herrero1, Pantelis Georgiou1.
Abstract
Diabetes is a chronic disease affecting 415 million people worldwide. People with type 1 diabetes mellitus (T1DM) need to self-administer insulin to maintain blood glucose (BG) levels in a normal range, which is usually a very challenging task. Developing a reliable glucose forecasting model would have a profound impact on diabetes management, since it could provide predictive glucose alarms or insulin suspension at low-glucose for hypoglycemia minimisation. Recently, deep learning has shown great potential in healthcare and medical research for diagnosis, forecasting and decision-making. In this work, we introduce a deep learning model based on a dilated recurrent neural network (DRNN) to provide 30-min forecasts of future glucose levels. Using dilation, the DRNN model gains a much larger receptive field in terms of neurons aiming at capturing long-term dependencies. A transfer learning technique is also applied to make use of the data from multiple subjects. The proposed approach outperforms existing glucose forecasting algorithms, including autoregressive models (ARX), support vector regression (SVR) and conventional neural networks for predicting glucose (NNPG) (e.g. RMSE = NNPG, 22.9 mg/dL; SVR, 21.7 mg/dL; ARX, 20.1 mg/dl; DRNN, 18.9 mg/dL on the OhioT1DM dataset). The results suggest that dilated connections can improve glucose forecasting performance efficiently.Entities:
Keywords: Continuous glucose monitor (CGM); Deep learning; Diabetes; Dilated recurrent neural network; Glucose forecasting
Year: 2020 PMID: 35415447 PMCID: PMC8982716 DOI: 10.1007/s41666-020-00068-2
Source DB: PubMed Journal: J Healthc Inform Res ISSN: 2509-498X
Fig. 1The architecture of the proposed BG forecast model using DRNN layers. There are some missing gaps or outliers, which are shown in orange in the diagram that need to be corrected by pre-processing
Fig. 2The pre-processed BG training data on Jan 2. Interpolation fills up a small missing interval that the 10 zeros between 7:35 and 8:25, and median filter removes the outliers and spikes on the curve [23]
Fig. 3The unrolled structure of vanilla RNN cells
RMSE performance for traditional LSTM and different DRNN models
| Model | Traditional LSTM | DRNN (Vanilla) | DRNN (LSTM) | DRNN (GRU) |
|---|---|---|---|---|
| Simulated dataset | 9.2∗∗ | 7.8 | 7.9 | 7.9 |
| OhioT1DM dataset | 21.0∗∗ | 18.9 | 20.2∗ | 19.9 |
| Number of parameters | 4( | 4( | 3( |
∗p ≤ 0.05∗∗p ≤ 0.01
Fig. 4An illustration of dilated connections of multi-layer RNN architecture. The dilation in three layers increases exponentially from 1 to 4
Fig. 5An illustration of making batches with real examples from the clinical dataset. Sliding down the windows, we obtain multiple entries in a batch
DRNN hyperparameters
| Parameter | Value |
|---|---|
| Length of sequences | 12 timesteps |
| Input channels | [G, I, M, T] |
| Cell type | Vanilla RNN |
| Dilation | [1, 2, 4] |
| Hidden node dimension in each layer | 32 |
| RMSprop learning rate | 0.001 |
| RMSprop decay rate | 0.9 |
| Batch size | 512 |
Prediction performance for the 10 simulated T1DM subjects
| Method | DRNN | NNPG | SVR | ARX |
|---|---|---|---|---|
| RMSE (mg/dl) | 7.8 ± 0.6 | 13.1 ± 1.2∗∗ | 11.9 ± 1.4∗∗ | 11.3 ± 0.8∗∗ |
| MARD (%) | 4.8 ± 0.6 | 7.2 ± 1.1∗∗ | 6.1 ± 0.8∗∗ | 6.8 ± 0.9∗∗ |
| 0.4 ± 0.3 | 9.3 ± 1.8∗∗ | 6.8 ± 1.6∗∗ | 4.8 ± 1.5∗∗ |
∗p ≤ 0.05∗∗p ≤ 0.01
Fig. 6The forecasting curves for virtual adult 1 in one-day period. It is an average scenario from the testing dataset
Prediction performance for the 6 clinical T1DM subjects
| Subject | 559 | 563 | 570 | 575 | 588 | 591 | Avg ± SD |
|---|---|---|---|---|---|---|---|
| DRNN | |||||||
| RMSE (mg/dl) | 18.6 | 18.0 | 15.3 | 22.7 | 17.6 | 21.1 | 18.9 ± 2.6 |
| MARD (%) | 8.6 | 8.0 | 5.6 | 10.3 | 7.9 | 11.9 | 8.7 ± 2.2 |
| 4.8 | 6.3 | 2.2 | 12.3 | 8.6 | 14.1 | 8.1 ± 4.1 | |
| NNPG | |||||||
| RMSE (mg/dl) | 23.3 | 21.2 | 19.0 | 27.4 | 21.9 | 24.9 | 22.9 ± 2.9∗∗ |
| MARD (%) | 10.2 | 9.4 | 7.1 | 13.4 | 9.7 | 14.8 | 10.8 ± 2.8∗∗ |
| 12.8 | 16.4 | 11.8 | 21.4 | 13.6 | 22.6 | 16.4 ± 4.2∗∗ | |
| SVR | |||||||
| RMSE (mg/dl) | 23.5 | 18.3 | 20.4 | 23.6 | 20.6 | 23.5 | 21.7 ± 1.9∗ |
| MARD (%) | 9.8 | 8.3 | 6.2 | 10.2 | 8.4 | 12.6 | 9.2 ± 2.2∗ |
| 11.4 | 15.8 | 10.3 | 20.4 | 12.3 | 21.4 | 15.3 ± 4.3∗∗ | |
| ARX | |||||||
| RMSE (mg/dl) | 18.7 | 19.6 | 16.8 | 23.6 | 19.5 | 22.4 | 20.1 ± 2.5∗ |
| MARD (%) | 8.0 | 8.4 | 6.0 | 10.6 | 8.3 | 12.0 | 8.9 ± 4.0 |
| 8.4 | 13.4 | 8.6 | 18.1 | 11.8 | 21.3 | 13.6 ± 4.8∗∗ | |
∗p ≤ 0.05 ∗∗p ≤ 0.01
Fig. 7The prediction curves for clinical subject 570 on Jan 21 from the testing dataset. There is a missing interval between 17:10 and 18:00, which contains 10 BG measurements