Literature DB >> 35813300

A Clinical Decision Support System for Diabetes Patients with Deep Learning: Experience of a Taiwan Medical Center.

Ting-Ying Chien1,2,3, Hsien-Wei Ting3,4, Chih-Fang Chen5, Cheng-Zen Yang1,3, Chong-Yi Chen1.   

Abstract

Background: Diabetes mellitus (DM) is a major public health problem worldwide. It involves dysfunction of blood sugar regulation resulting from insulin resistance, inadequate insulin secretion, or excessive glucagon secretion.
Methods: This study collated 971,401 drug usage records of 51,009 DM patients. These data include patient identification code, age, gender, outpatient visiting dates, visiting code, medication features (included items, doses, and frequencies of drugs), HbA1c results, and testing time. We apply a random forest (RF) model for feature selection and implement a regression model with the bidirectional long short-term memory (Bi-LSTM) deep learning architecture. Finally, we use the root mean square error (RMSE) as the evaluation index for the prediction model.
Results: After data cleaning, the data included 8,729 male and 9,115 female cases. Metformin was the most important feature suggested by the RF model, followed by glimepiride, acarbose, pioglitazone, glibenclamide, gliclazide, repaglinide, nateglinide, sitagliptin, and vildagliptin. The model performed better with the past two seasons in the training data than with additional seasons. Further, the Bi-LSTM architecture model performed better than support vector machines (SVMs). Discussion &
Conclusion: This study found that Bi-LSTM models is a well kernel in a CDSS which help physicians' decision-making, and the increasing the number of seasons will negative impact the performance. In addition, this study found that the most important drug is metformin, which is recommended as first-line treatment OHA in various situations for DM patients. © The author(s).

Entities:  

Keywords:  Bidirectional Long Short-Term Memory (Bi-LSTM); Clinical Decision Support System (CDSS); Deep Learning; Diabetes Mellitus; Electronic Health Record; Oral Hypoglycemic Agents

Mesh:

Substances:

Year:  2022        PMID: 35813300      PMCID: PMC9254376          DOI: 10.7150/ijms.71341

Source DB:  PubMed          Journal:  Int J Med Sci        ISSN: 1449-1907            Impact factor:   3.642


Introduction

Diabetes mellitus (DM) is a major public health problem worldwide. It involves dysfunction of blood sugar regulation resulting from insulin resistance, inadequate insulin secretion, or excessive glucagon secretion 1. There are two types of DM. Type 1 DM is usually due to an autoimmune disorder and involves the destruction of pancreatic beta-cells. Type 2 DM is caused by impairment of glucose regulation due to the malfunction of pancreatic beta cells or insulin resistance 1. Treatment using oral hypoglycemic agents (OHA) for type 2 DM may have negative side effects, such as hypoglycemia. Therefore, it is crucial to ensure the safety and efficacy of OHA usage 1-7. Clinical decision support systems (CDSSs), which integrate electronic health records (EHRs) and expert knowledge, have improved the decision-making of physicians and medical caregivers 8-12. Many methods have been developed for CDSSs, including linear/logistic regression, support vector machines (SVMs), decision trees, random forest (RF), rough sets, and trajectory methods 13-19. Glycated hemoglobin (HbA1c) is extensively studied in these approaches because it is a good indicator of DM control. DM patients with higher Hba1c measures are more likely to experience renal diseases, macrovascular events, cardiovascular diseases, retinopathies, skin ulceration/gangrene, and high mortality 3. A well-controlled HbA1c value plays an important role in DM management. CDSSs have been shown to be effective in supporting HbA1c control. For example, O'Connor et al. showed that the HbA1c of DM patients significantly improve when the physicians use a CDSS compared with when physicians do not use a CDSS (p < 0.01). Moreover, 94% of physicians using the CDSS were satisfied for this application and physicians continued to use the CDSS for more than one year without research funding support 20. Recently, deep learning methods have dramatically improved different fields of medical care and research 21, 22. They have also been used as the core methods to build the CDSS 23, 24. For example, convolutional neural networks (CNNs) are used to process image data and recurrent neural networks (RNNs) are used for sequential pattern problems 23, 25. Sun et al. proposed a method to predict blood sugar levels at four intervals, namely 15, 30, 45, and 60 minutes, using the long short-term memory (LSTM) model and the bidirectional-LSTM (Bi-LSTM) model 26. Therefore, we devise a CDSS using a Bi-LSTM model with HbA1c as the outcome index for managing OHA usage. The structure of the proposed CDSS, the LSTM model, and the Bi-LSTM model are shown in Figure 1.
Figure 1

Structure of the clinical decision support system (CDSS), Long short time model (LSTM) and bidirectional-LSTM (Bi-LSTM) model. CDSS: clinical decision support system; EHR: electronic health records; Bi-LSTM: Bidirectional long short-term memory network model.

Materials and Methods

We collated 971,401 drug usage records of 51,009 diabetes mellitus (DM) patients from January 2012 to December 2014 (12 seasons) and 313,165 laboratory records of 74,792 DM patients in a medical center from January 2012 to June 2015 (14 seasons). These data included patient identification code, age, gender, outpatient visiting dates, visiting code, medication features (included items, doses, and frequencies of drugs), HbA1c results, and testing time. The data were combined and cleansed. Twelve seasons of data and 17,844 DM patients were included in this study. The data were evaluated with five-fold cross-validation (training data = 80% and testing data = 20%) (Figure 2). We applied an RF model with mean square error (MSE) for feature selection where higher mean decrease MSE indicated more important parameters 27-29. OHA dosages and codes were collected. This study was approved by the Institutional Review Board (IRB) of the MacKay Memorial Hospital (IRB approval number: 15MMHIS143e).
Figure 2

Data management flowchart. R: visiting times; P: patient case numbers; Q: seasonal times; Lab. data: laboratory data.

We implemented the Bi-LSTM structure using PyTorch running on two personal computers equipped, respectively, with Ubuntu 16.04 and Ubuntu 18.04. The GPU environments were NVIDIA GeForce GTX 980 and GTX 1080 Ti. We used Grid search to adjust the model parameters: epoch (100); batch size (32, 64, 128); hidden layer neuron numbers of Bi-LSTM (32, 64, 128); dropout rate (0, 0.2) between Bi-LSTM; optimizers AMSBound 30 and Adadelda 31; and learning rate (AMSBound used 0.001 and the Adadelda used 1.0). We used early stopping (patience = 30) and L2 normalization (weight decay = 0.0005) to solve the problem of overfitting and applied gradient clipping (clip norm = 5) to prevent exploding gradient problems. We also implemented a support vector regression (SVR) model using Rgtsvm in R to compare with the Bi-LSTM model 18. To evaluate the models, we used root mean square error (RMSE) (), sensitivity (), specificity (), and the Matthews correlation coefficient (MCC) (). We applied Pearson's chi-squared test and the student t-test for data analysis. The statistical analysis was conducted using SPSS version 19.0 (SPSS Inc., Chicago, IL, USA). Statistical significance was defined as p < 0.05.

Results

Of the included 17,844 cases, 8729 (49.0%) were male and 9115 (51.0%) were female. The mean age was 62.3 years old (SD = 11.9) overall, 60.4 (SD = 11.8) for males, and 64.2 (11.7) for females. The 45 to 64 year old age had the most cases (8,507 cases), followed by those aged above 65(6,966 cases). The mean Hba1c was 7.6% (SD = 1.7). There were 13,346 cases with Hba1c higher than 6.5% and 4,498 cases whose Hba1c were less than 6.5% (Table 1).
Table 1

Demographic data of training cases

SexMaleFemaleAll
Case number (%)8,729 (49.0)9,115 (51.0)17,844
Mean age (SD)60.4 (11.8)64.2 (11.7)62.3 (11.9)
Age rank
<25 years old222042
25-44 years old7323731,105
45-64 years old4,5313,9768,507
≥ 65 years old2,8824,0846,966
Unknown age5626621,224
Mean Hba1c (SD)7.6% (1.8)7.6% (1.7)7.6% (1.7)
Hba1c ≤ 6.5% (%)2,337 (26.8%)2,161 (23.7%)4,498 (25.2%)
Hba1c > 6.5% (%)6,392 (73,2%)6,954 (76,3%)13,346 (74.8%)
The data included 11 types of drugs. Compound drugs glimepiride (25,719), pioglitazone (25,720), and vildagliptin (25,726) were combined with metformin dosages of 500, 850, and 1000 mg, respectively. Nateglinide had dosages of 60 and 120 mg. The most important feature was metformin with mean decreased MSE = 171.8, followed by glimepiride (156.6), acarbose (151.8), pioglitazone (148.1), glibenclamide (143.7), gliclazide (114,1), repaglinide (93.3), nateglinide (80.6), sitagliptin (74.0), and vildagliptin (21.9) (Table 2).
Table 2

Oral hypoglycemic agents (OHA) mean decreased mean square error (MSE), dosages, and codes

OHA₸ Mean Decrease MSEDosage
AllMaleFemale≥65 y/o<65 y/o
Metformin171.8108.69117.6194.56114.81500mg
#Glimepiride156.677.13100.0089.7272.882mg
Acarbose151.898.8155.4480.2049.18100mg
#Pioglitazone148.124.3129.5712.1426.6515mg
Glibenclamide143.733.7873.2852.0163.915mg
Gliclazide114.197.9694.6096.9582.9430mg
Repaglinide93.342.0050.4037.4151.211mg
§Nateglinide80.68.7520.6519.3923.1060mg, 120mg
Sitagliptin74.055.3376.4391.8745.17100mg
#Vildagliptin71.144.9535.6444.7342.0450mg
Linagliptin21.98.924.85-0.382.755mg

# These three compound drugs are all combined with metformin. Glimepiride, pioglitazone, and vildagliptin have 500, 850, and 1000 mg of metformin added, respectively.

§ Nateglinide has two dosages: 60 and 120 mg.

₸ MSE: mean square error.

This study treated every season as ground truth from 2013 Q1 to 2015 Q1 and constructed nine datasets, each having a different sample size. For example, the dataset of 2014 Q4 had 12,677 and 3169 cases as training and test samples, respectively. Using other data as independent factors, we designed three kinds of models. The first used two seasons of data to predict drug usage of the third season. For example, model 9 (2015 Q1) used 2014 Q3 and 2014 Q4 to predict 2015 Q1. The other two types of model used three/four seasons to predict the drugs of the fourth/fifth seasons. This study also evaluated differences in Hba1c between seasons. For example, we calculated the differences in mean Hba1c between 2015 Q1 and 2014 Q4 (0.87%), 2014 Q3 (0.98%) and 2014 Q2 (1.09%). We found that longer time distances had greater differences in Hba1c (Table 3).
Table 3

Research design and Hba1c differences between first and last seasons in each model. There are nine datasets. The models use two/three/four seasons to predict the drugs for the third/fourth/fifth seasons.

DatasetGround truthTwo seasonsThree seasonsFour seasonsTraining sampleTesting sample
Time periodHBa1 differenceTime periodHBa1 differenceTime periodHBa1 difference
92015 Q12014 Q3 -2014 Q40.872014 Q2 -2014 Q40.982014 Q1 -2014 Q41.09123343084
82014 Q42014 Q2 -2014 Q30.882014 Q1 -2014 Q31.022013 Q4 -2014 Q31.10126773169
72014 Q32014 Q1 -2014 Q20.922013 Q4 -2014 Q21.022013 Q3 -2014 Q21.1486262156
62014 Q22013 Q4 -2014 Q10.902013 Q3 -2014 Q11.072013 Q2 -2014 Q11.23123623090
52014 Q12013 Q3 -2013 Q40.992013 Q2 -2013 Q41.172013 Q1 -2013 Q41.25124743119
42013 Q42013 Q2 -2013 Q31.082013 Q1 -2013 Q31.172012 Q4 -2013 Q31.30122663066
32013 Q32013 Q1 -2013 Q21.022012 Q4 -2013 Q21.182012 Q3 -2013 Q21.30118262957
22013 Q22012 Q4 -2013 Q11.032012 Q3 -2013 Q11.192012 Q2 -2013 Q11.30114422861
12013 Q12012 Q3 -2012 Q41.032012 Q2 -2012 Q41.182012 Q1 -2012 Q41.3181552039
We compared Bi-LSTM and SVM in the two-, three-, and four-season models. The RMSE of both two-season models (Bi-LSTM = 1.05±0.17 and. SVM = 1.05±0.07) was the best, followed by the three-season models (Bi-LSTM = 1.10±0.25 and. SVM = 1.12±0.03) and four-season models (Bi-LSTM = 1.09±0.21 and SVM = 1.16±0.04). The sensitivity of the SVM models were not significantly different to each other (two seasons: 0.88±0.03, three seasons: 0.88±0.02, four seasons: 0.89±0.02). The sensitivity of the Bi-LSTM models gradually decreased non-significantly (two seasons: 0.83±0.16, three seasons: 0.80±0.21, four seasons: 0.77±0.23), but performed worse than the SVM models. The specificity of the SVM models gradually decreased as the included seasons increased (two seasons: 0.68±0.05, three seasons: 0.64±0.05, four seasons: 0.59±0.04). The specificity of the Bi-LSTM models was not significant for any approach (two seasons: 0.69±0.32, three seasons: 0.71±0.31, four seasons: 0.71±0.30). According to the MCC evaluation, there were no significant differences between the six models. The two-season SVM model (0.39±0.06) had the shortest run time, followed by the three-season (0.47±0.09) and four-season (0.52±0.06) SVM models. The Bi-LSTM models had significantly longer run times (two seasons: 3.39±0.64, three seasons: 5.18±1.07, four seasons: 5.36±1.17) than the SVM models (Table 4).
Table 4

Comparison of Bi-LSTM and SVM models with different seasons

Seasonal modelEvaluation indexSVMBi-LSTM
Two seasonsRMSE1.05±0.071.05±0.17
Sensitivity0.88±0.030.83±0.16
Specificity0.68±0.050.69±0.32
MCC0.57±0.050.56±0.15
Duration (Hours)0.39±0.063.39±0.64
Three seasonsRMSE1.12±0.031.10±0.25
Sensitivity0.88±0.020.80±0.21
specificity0.64±0.050.71±0.31
MCC0.54±0.050.55±0.15
Duration (Hours)0.47±0.095.18±1.07
Four seasonsRMSE1.16±0.041.09±0.21
sensitivity0.89±0.020.77±0.23
specificity0.59±0.040.71±0.30
MCC0.50±0.040.54±0.12
Duration (Hours)0.52±0.065.36±1.17

Bi-LSTM: Bidirectional long short-term model. SVM: support vector machine.

Root mean square error (RMSE) =; Sensitivity = ; Specificity = ; Matthews correlation coefficient (MCC) = .

Discussion & Conclusion

Studies have found that higher Hba1c is linked with increased risk of complications in DM patients 3. A physician-pharmacist collaboration is useful for OHA adjustment to manage Hba1c owing to physician knowledge and experience 32. However, it is a challenge to leverage the knowledge of these experts. The current study found that Bi-LSTM models performed better than SVM models for a CDSS to support physicians' decision-making related to OHA adjustment for DM patients. Many CDSS and classification models have used SVM and other artificial intelligence technologies 18, 33-36. We also found that increasing the number of seasons used in the prediction negatively impacted accuracy and RMSE. Physicians reference the most recent Hba1c value to adjust OHA dosage and may choose to maintain the dosage if the Hba1c value is only slightly higher than 7% for the first time. Thus, it is not necessary to reference three or more seasons of Hba1c data, as validated by our experimental results. We calculated the importance of these drugs and used RF to translate this information into the CDSS 27-29, 33, 35. The most important drug was metformin, which is recommended as first-line treatment OHA in various situations for DM patients. This drug improves lipids and inflammatory markers and reduces cardiovascular events, but may be contraindicated for patients with mild to moderate chronic kidney disease. Recent research indicates that metformin requires caution in these kinds of DM patients 5. Sulfonylureas is an important DM drug. We found that the glimepiride (2nd) glibenclamide (5th), and gliclazide (6th) are also important OHA for DM patients 2, 4. Acarbose was also found to have some gastrointestinal adverse effects, which were similar to metformin 6. Pioglitazone is an important DM drug that may reduce Hba1c and improve both metabolic syndrome and nonalcoholic fatty liver disease/nonalcoholic steatohepatitis 7. It also has a side effect of weight loss for some patients, which is sometimes treated as a benefit. Glimepiride, pioglitazone, and vildagliptin were all combined with metformin (Table 2). This study has some limitations and areas for extension. Yanase et al. reported that low HbA1c is linked with frailty and suspected malnutrition in elderly type 2 DM patients 37. Around 25% of cases in this study had Hba1c < 6.5%, indicating that the DM control was too strict for some patients, potentially leading to malnutrition or hypoglycemia. Our CDSS defined “good control” as Hba1c ≤ 7. Although our approach worked well, future versions could be enhanced by considering 6.5 ≤ Hba1c ≤ 7.
  31 in total

1.  Decision model for acute appendicitis treatment with decision tree technology--a modification of the Alvarado scoring system.

Authors:  Hsien-Wei Ting; Jing-Tang Wu; Chien-Lung Chan; Shoei-Loong Lin; Min-Hsiung Chen
Journal:  J Chin Med Assoc       Date:  2010-08       Impact factor: 2.743

Review 2.  Deep learning.

Authors:  Yann LeCun; Yoshua Bengio; Geoffrey Hinton
Journal:  Nature       Date:  2015-05-28       Impact factor: 49.962

Review 3.  Deep learning with convolutional neural network in radiology.

Authors:  Koichiro Yasaka; Hiroyuki Akai; Akira Kunimatsu; Shigeru Kiryu; Osamu Abe
Journal:  Jpn J Radiol       Date:  2018-03-01       Impact factor: 2.374

Review 4.  A comparative review on sleep stage classification methods in patients and healthy individuals.

Authors:  Reza Boostani; Foroozan Karimzadeh; Mohammad Nami
Journal:  Comput Methods Programs Biomed       Date:  2016-12-10       Impact factor: 5.428

5.  Impact of electronic health record clinical decision support on diabetes care: a randomized trial.

Authors:  Patrick J O'Connor; Joann M Sperl-Hillen; William A Rush; Paul E Johnson; Gerald H Amundson; Stephen E Asche; Heidi L Ekstrom; Todd P Gilmer
Journal:  Ann Fam Med       Date:  2011 Jan-Feb       Impact factor: 5.166

Review 6.  Diabetes Mellitus Review.

Authors:  Meg Blair
Journal:  Urol Nurs       Date:  2016 Jan-Feb

Review 7.  Comparison of glucose lowering effect of metformin and acarbose in type 2 diabetes mellitus: a meta-analysis.

Authors:  Shuyan Gu; Jihao Shi; Zhiliu Tang; Monika Sawhney; Huimei Hu; Lizheng Shi; Vivian Fonseca; Hengjin Dong
Journal:  PLoS One       Date:  2015-05-11       Impact factor: 3.240

8.  Clinical Decision Support Systems for Drug Allergy Checking: Systematic Review.

Authors:  Laura Légat; Sven Van Laere; Marc Nyssen; Stephane Steurbaut; Alain G Dupont; Pieter Cornu
Journal:  J Med Internet Res       Date:  2018-09-07       Impact factor: 5.428

9.  Random forests for feature selection in QSPR Models - an application for predicting standard enthalpy of formation of hydrocarbons.

Authors:  Ana L Teixeira; João P Leal; Andre O Falcao
Journal:  J Cheminform       Date:  2013-02-11       Impact factor: 5.514

10.  Exploration of Medical Trajectories of Stroke Patients Based on Group-Based Trajectory Modeling.

Authors:  Ting-Ying Chien; Mei-Lien Lee; Wan-Ling Wu; Hsien-Wei Ting
Journal:  Int J Environ Res Public Health       Date:  2019-09-18       Impact factor: 3.390

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.