| Literature DB >> 35434723 |
Ashwini Tuppad1, Shantala Devi Patil1.
Abstract
Type 2 diabetes has recently acquired the status of an epidemic silent killer, though it is non-communicable. There are two main reasons behind this perception of the disease. First, a gradual but exponential growth in the disease prevalence has been witnessed irrespective of age groups, geography or gender. Second, the disease dynamics are very complex in terms of multifactorial risks involved, initial asymptomatic period, different short-term and long-term complications posing serious health threat and related co-morbidities. Majority of its risk factors are lifestyle habits like physical inactivity, lack of exercise, high body mass index (BMI), poor diet, smoking except some inevitable ones like family history of diabetes, ethnic predisposition, ageing etc. Nowadays, machine learning (ML) is increasingly being applied for alleviation of diabetes health burden and many research works have been proposed in the literature to offer clinical decision support in different application areas as well. In this paper, we present a review of such efforts for the prevention and management of type 2 diabetes. Firstly, we present the medical gaps in diabetes knowledge base, guidelines and medical practice identified from relevant articles and highlight those that can be addressed by ML. Further, we review the ML research works in three different application areas namely-(1) risk assessment (statistical risk scores and ML-based risk models), (2) diagnosis (using non-invasive and invasive features), (3) prognosis (from normoglycemia/prior morbidity to incident diabetes and prognosis of incident diabetes to related complications). We discuss and summarize the shortcomings or gaps in the existing ML methodologies for diabetes to be addressed in future. This review provides the breadth of ML predictive modeling applications for diabetes while highlighting the medical and technological gaps as well as various aspects involved in ML-based diabetes clinical decision support.Entities:
Keywords: Clinical decision support; Diabetes prediction; Machine learning; Pre-diabetes; Risk assessment
Year: 2022 PMID: 35434723 PMCID: PMC9006199 DOI: 10.1007/s43674-022-00034-y
Source DB: PubMed Journal: Adv Comput Intell ISSN: 2730-7794
American Diabetes Association (ADA) diagnostic criteria for pre-diabetes and diabetes based on FPG, OGTT and HbA1C blood tests
| Diagnostic test | Pre-diabetes | Diabetes |
|---|---|---|
| Fasting plasma glucose (FPG) | 100 mg/dl ≤ FPG < 126 mg/dl (impaired fasting glucose) | FPG ≥ 126 mg/dl |
| Oral glucose tolerance test (OGTT) | 140 mg/dl ≤ 2 h BG < 200 mg/dl (impaired glucose tolerance) | 2 h BG ≥ 200 mg/dl |
| HbA1C | 5.7% ≤ HbA1C < 6.5% | HbA1C ≥ 6.5% |
Fig. 1Flow diagram of the process adopted for the search and scrutiny of research articles for the review
Fig. 2General architecture of ML-based clinical decision support for disease prevention and management
Fig. 3Classification of ML applications in clinical decision support
Popular diabetes risk scores proposed in the literature with their characteristics
| Authors | Risk score | Study population | Risk factors | Performance |
|---|---|---|---|---|
| Mohan et al. ( | Indian Diabetes Risk Score | CURES population (size = 2350) | Age, abdominal obesity, family history of diabetes and physical activity | Sensitivity = 72.5% Specificity = 60.1% |
| Aekplakorn et al. ( | Australian Diabetes Risk Score | AusDiab dataset (size = 6060) | Age, sex, ethnicity, parental history of diabetes, history of high blood glucose, use of antihypertensive medications, smoking, physical inactivity and waist circumference | Sensitivity = 74% Specificity = 67.7% |
| Schulze et al. ( | Thai Risk Score | Thai cohort (size = 2677) | Age, sex, BMI, waist circumference, history of hypertension, history of diabetes in parents or siblings | Sensitivity = 77% Specificity = 60% |
| Hippisley-Cox et al. ( | QDScore (England and Wales) | QResearch Database Version 19 (size = 2,540,753) | Age, sex, ethnicity, body mass index, smoking status, family history of diabetes, Townsend deprivation score (socioeconomic indicator), treated hypertension, cardiovascular disease, and current use of corticosteroids | Variance = 51.53% in women, 48.16% in men Discrimination, |
| Chen et al. ( | Finnish Diabetes Risk Score | Drawn randomly from National Population Register (size = 4746) | Age, BMI, waist circumference, use of blood pressure medication, History of high blood glucose, physical activity, daily consumption of vegetables, fruits, or berries | Sensitivity = 78% Specificity = 77% |
| Gray et al. ( | German Diabetes Risk Score | [EPIC]-Potsdam study (size = 25,167) | Age, waist circumference, Height, history of Hypertension, Intake of red meat, whole-grain bread, coffee, alcohol consumption, physical activity, smoking | Sensitivity = 82.8% Specificity = 72.2% |
| Pei et al. ( | Leicester Diabetes Risk Score | ADDITION-Leicester study (size = 6186) | Age, sex, ethnicity, BMI, waist circumference, first degree family history of type 2 diabetes, antihypertensive medication or high blood pressure diagnosis | Sensitivity = 72.1% Specificity = 54.1% |
Summary of ML methodologies for diabetes diagnosis using non-invasive features
| Authors | Non-invasive data source | Modality/device used | Dataset size | Features used | ML classifier | Performance |
|---|---|---|---|---|---|---|
| Pei et al. ( | Physical Examination Reports | 4205 | Age, gender, body mass index (BMI), hypertension, history of cardiovascular disease or stroke, family history of diabetes, physical activity, work stress, and salty food preference | J48 decision tree, AdaBoostM1, Sequential Minimal Optimization, Bayes Net, Naïve Bayes | Accuracy—0.9503 by J48 decision tree | |
| Carter et al. ( | Toenails | Inductively coupled plasma mass spectrometry | 40 | Concentration of 22 elements found in toenails, age, gender, smoking history | Random Forests | AUC—0.90 |
| Nirala et al. ( | Toenails | Photoplethysmography | 141 | Derivates of PPG signal waveforms | Support Vector Machine | Accuracy—97.87% |
| Zhang et al. ( | Tongue images | TDA-1 digital tongue instrument | 296 | Texture and color of tongue body and coating | Support Vector Machine | Accuracy—78.77% |
| Samant and Agarwal ( | Iris images | Through I-SCAN-2 Iris Scanner device | 338 | Statistical, textural and discrete wavelet transforms | Random Forests | Accuracy—89.63% |
| Zhang et al. ( | Retrospective Cohort data collected by survey | – | 37,730 | Demographic, medical and family history, anthropometric, lifestyle and diet predictors | Joint bagging boosting with stacking algorithm | AUC—0.885 |
| Spänig et al. ( | Anamnesis and non-invasive physiological sensors | Speech synthesis and recognition system, ultrasonic sensor | 4814 | Age, gender, BMI, height, weight, waist circumference | Deep neural network and Support Vector Machine | AUC—0.84 and 0.703 by DNN (with and without HbA1C, respectively) |
| Farran et al. ( | Questionnaire data | – | 1837 | Age, sex, BMI, pre-existing hypertension, family history of hypertension | Logistic Regression, k-Nearest Neighbor (kNN), Support Vector Machine | AUC—0.83, 0.82 and 0.79 by SVM (for 3-, 5-, 7-year period respectively) |
Summary of ML methodologies for diabetes diagnosis using invasive, lab-based clinical features
| Authors | Dataset size | Features | Feature selection technique | ML classifier | Performance |
|---|---|---|---|---|---|
| Zheng et al. ( | 300 | Demographic, self-reported symptoms and complications, diagnosis notes, Medication, Lab test parameters, disease diagnosis codes | By feature construction and correlation-based feature summarization | k-Nearest Neighbor, Naïve Bayes, J48, Random Forests, Support Vector Machine and Logistic Regression | Accuracy—0.99 by LR |
| Lee and Kim ( | 11,937 | Age, height, weight, BMI, FPG, SBP, DBP, TG, weight, patient phenotypes, anthropometric indices | – | Naïve Bayes and Logistic Regression | Accuracy—0.661, 0.735 by LR in men and women, respectively |
| Dinh et al. ( | 21,091 | Demographic, physical examination, dietary intake, lab test and questionnaire-based features | XGBoost Ensemble classifier | Random Forest, gradient boosted tree, logistic regression, support vector machine and their weighted ensemble model | Accuracy—95.7, 84.4% by XGBoost for diabetes and pre-diabetes, respectively |
| Bernardini et al. ( | 2433 | Demographic (age, gender), monitoring (BP), clinical features (pathologies, exemptions, exam and drug prescriptions) | – | Sparse balanced support vector machine | AUC—81.43% |
| Hayashi and Yukita ( | 768 | Number of pregnancies, plasma glucose after 2-h OGTT, diastolic BP, triceps skin fold thickness, serum insulin, BMI, age, diabetes pedigree function | – | J48graft decision tree with recursive rule extraction algorithm | Accuracy—83.83% |
| Severeyn et al. ( | 188 | Insulin, blood glucose values obtained from 5-sample OGTT | – | K-means clustering | AUC—0.76 |
| Abbas et al. ( | 1492 | Demographic features (age), insulin and plasma glucose before and after OGTT | Minimum redundancy maximum relevance | Support Vector Machine | Mean accuracy—96.80% |
| Bernardini et al. ( | 2433 | Demographic (age, gender), monitoring (systolic and diastolic BP), laboratory features | – | Multiple Instance Boosting Algorithm | Accuracy—0.83, 0.70 with and without triglyceride-glucose index, respectively |
| Casanova et al. ( | 3633 | Demographic, anthropometric, blood markers, medical history, ECG data | – | Random Forests | AUC—0.82 |
| Akula et al. ( | 9948 | Age, gender, systolic and diastolic blood pressure, height, weight, BMI | – | Weighted average ensemble model | Accuracy—85% |
| Rahman et al. ( | 768 | Number of pregnancies, plasma glucose after 2-h OGTT, diastolic BP, triceps skin fold thickness, serum insulin, BMI, age, diabetes pedigree function | Boruta wrapper | Convolutional long short-term memory | Accuracy—97.26% |
| Nguyen et al. ( | 9948 | Demographic (age, sex), physical examination (body mass index (BMI) and blood pressure), physical, diagnosis codes, medication, lab variables | Manual feature engineering | Wide and deep learning ensemble model | Accuracy—84.28%, AUC—84.13% |
| Deng et al. ( | 40 | Demographics (age, gender), body compositions (body mass, height, body mass index), hormone levels, blood glucose data | – | Recurrent neural networks, gated convolutional neural networks, self-attention networks with transfer learning | Accuracy—89% AUC—0.86 |
| Kumar et al. (2020) | 768 | Number of pregnancies, plasma glucose after 2-h OGTT, diastolic BP, triceps skin fold thickness, serum insulin, BMI, age, diabetes pedigree function | Extremely randomized trees | Deep neural network model | Accuracy—98.16% |
| Nilashi et al. ( | 768 | Number of pregnancies, plasma glucose after 2-h OGTT, diastolic BP, triceps skin fold thickness, serum insulin, BMI, age, diabetes pedigree function | – | Neural network | Accuracy—92.28% |
| Zhu et al. ( | 768 | Number of pregnancies, plasma glucose after 2-h OGTT, diastolic BP, triceps skin fold thickness, serum insulin, BMI, age, diabetes pedigree function | – | Principal component analysis + k-means + Logistic Regression | Accuracy—97.40% |
| Han et al. ( | 7913 | HbA1C, triglycerides, uric acid, high density lipoprotein, age, diastolic BP, cholesterol, waist, weight | Chi-square test, information gain-based filter technique, Random Forest | Support Vector Machine + Random Forests | Weighted mean precision and recall—94.2, 93.9%, respectively |
| Roopa and Asha ( | 768 | Number of pregnancies, plasma glucose after 2-h OGTT, diastolic BP, triceps skin fold thickness, serum insulin, BMI, age, diabetes pedigree function | PCA | Principal component analysis based Linear Regression Model | Accuracy—82.1% |
| Alharbi and Alghahtani ( | 200 | Pregnancies, fasting plasma glucose, 1 h and 2 h post glucose load in OGTT, diastolic BP, HbA1C, BMI, age | Genetic algorithm | Extreme learning machine neural network | Accuracy—97.5% |
| Wang et al. ( | 768 | Number of pregnancies, plasma glucose after 2-h OGTT, diastolic BP, triceps skin fold thickness, serum insulin, BMI, age, diabetes pedigree function | – | Random Forests | Accuracy—87.1% |
| Mahboob Alam et al. ( | 768 | Number of pregnancies, plasma glucose after 2-h OGTT, diastolic BP, triceps skin fold thickness, serum insulin, BMI, age, diabetes pedigree function | PCA | K-means clustering, Random Forests, Artificial Neural Network for classification, Apriori algorithm for rule generation | Accuracy—75.7% |
Summary of ML methodologies for prognosis of normoglycemia/prior morbidity to incident type 2 diabetes
| Prognosis from normoglycemia/prior morbidity to T2D incidence | ||||||
|---|---|---|---|---|---|---|
| Authors | Prognosis modeling | Dataset size | Features | Feature selection technique | ML classifier | Performance |
| Yokota et al. ( | Pre-diabetes to diabetes | 2105 | Sex, age, BMI, smoking habit, physical activity, family history of diabetes up to third-degree relatives, systolic blood pressure, fasting and 1 and 2 h post-load plasma glucose during 75 g OGTT, HbA1c, high-density lipoprotein cholesterol, low-density lipoprotein cholesterol, triglycerides and alanine aminotransferase | Stepwise backward elimination | Multivariate logistic regression | AUC—0.80 |
| Perveen et al. ( | Metabolic syndrome to diabetes | 4403 | Sex, systolic blood pressure, diastolic blood pressure, high density lipoprotein, triglycerides, BMI, fasting blood sugar | – | Decision Tree and Naïve Bayes | AUC—79% |
| Anderson et al. ( | Normoglycemia to pre-diabetes and diabetes | 24,331 | Demographic, lab features, clinical observations, ICD-9 disease diagnosis codes, prescriptions | – | Ensemble model from reverse engineering and forward simulation platform | AUC—0.76 (normoglycemia to T2D and 0.70 (normoglycemia to pre-diabetes) |
| Cahn et al. ( | Pre-diabetes to diabetes | 121,792 | Demographics, diagnoses READ codes, medical interventions, hospitalizations, medications, laboratory test results | LASSO on Logistic Regression model | Gradient Boosted Trees | AUC—0.865 (internal validation); 0.907, 0.925 (in external validation datasets) |
| Perveen et al. ( | Normoglycemia to T2D | 1981 | Age, gender, systolic and diastolic blood pressure, fasting blood glucose, triglycerides, BMI, high density lipoprotein, HbA1C | – | Gaussian hidden Markov model | AUC—86.9% |
| Choi et al. | Normoglycemia (with cardiovascular risk) to T2D | 8454 | Sex, age, BMI, history of particular diseases, blood test results, pharmaceutical treatments for cardiovascular disease and T2DM | Information gain | Logistic Regression, linear and quadratic discriminant analysis, k-Nearest Neighbor | AUC—0.78 by Logistic Regression |
| Garcia-Carretero et al. | Obese, hypertensive state to T2D | 1647 | Demographic, clinical and laboratory variables including those related to cardiovascular risk | – | k-Nearest Neighbor | Accuracy—0.977, AUC—0.89 |
Summary of ML methodologies for prognosis of type 2 diabetes to related complications
| Prognosis of T2D to related complications | ||||||
|---|---|---|---|---|---|---|
| Paper | Prognosis modeling | Dataset size | Features | Feature selection technique | ML classifier | Performance |
| Dalakleidi et al. ( | T2D to fatal/non-fatal cardiovascular disease incidence | 768 (Pima dataset) 560 (Hippokrateion dataset) | Number of pregnancies, plasma glucose after 2-h OGTT, diastolic BP, triceps skin fold thickness, serum insulin, BMI, age, diabetes pedigree function (Pima dataset) Age, diabetes duration, BMI, systolic and diastolic blood pressure, HbA1C, blood glucose, total cholesterol, triglycerides, high-density and low-density lipoprotein (Hippokrateion dataset) | – | Ensembles of artificial neural networks | Accuracy—92.86%, AUC—0.739 |
| Dagliati et al. ( | Diabetic retinopathy, neuropathy and nephropathy | 943 | Age, gender, time to diagnosis, BMI, HbA1C, lipid profile, smoking habit, antihypertensive therapy | Stepwise feature selection based on Akaike information criterion | Logistic Regression | Accuracy—0.77, 0.69, 0.74, 0.80, 0.79 (retinopathy, nephropathy, neuropathy, respectively) AUC—0.80, 0.734, 0.799 (retinopathy, nephropathy, neuropathy, respectively) |
| Zarkogianni et al. ( | T2D to fatal/non-fatal cardiovascular disease incidence | 560 | Age, BMI, HbA1C, diabetes duration, pulse pressure, fasting blood glucose, total cholesterol, high density lipoprotein cholesterol, triglycerides, | – | Ensemble models based on hybrid wavelet neural network (HWNN) and self organizing maps (SOM) | AUC—71.48 by hybrid ensemble |
| Kowsher et al. ( | T2D to complications and treatment prediction | 9483 | Fasting and 2 h post load blood glucose, BMI, duration of diabetes, age, sex, blood pressure, high cholesterols, heart diseases, kidney diseases, medications | – | Deep neural network (optimal classifier) along with a number of ML algorithms | Accuracy—95.14% |
| Aminian et al. ( | T2D related coronary artery events, heart failure, nephropathy and all-cause mortality with or without metabolic surgery | 2287 | Sex, age, BMI, BMI category, race, smoking status, medical history, HbA1C, systolic and diastolic BP, estimated glomerular filtration rate (eGFR), triglycerides, medication history | – | Multivariable time-to-event regression and Random Forest | AUC—0.81 (all-cause mortality), 0.67 (coronary artery events), 0.75 (heart failure), 0.76 (nephropathy) by Random Forest model |
| Allen et al. ( | T2D to 5-year risk of multi-stage chronic kidney disease (CKD) | 62,994 | Age, sex, BMI, systolic and diastolic blood pressure, blood urea nitrogen, creatinine, eGFR, high- and low-density lipoprotein, white cell count, medical history (acute kidney injury, chronic heart failure, reported smoking, alcoholism) | – | Gradient Boosted Trees and Random Forest | AUC—0.75 for any stage of CKD (internal validation), > 0.82 for advanced stages (external validation) |
| Sudharsan et al. ( | T2D to hypoglycemia | 56,000 blood glucose data points | Self-monitored blood glucose readings, timestamp of reading, medication administration and dosage | – | Random Forest, support vector machine, k-Nearest Neighbor, Naïve Bayes | Sensitivity—92%, Specificity—90% with blood glucose readings and medication |
Major gap areas and underlying challenges in existing ML clinical decision support for type 2 diabetes prevention and management
| Gap areas | Underlying challenges |
|---|---|
| 1. Medical gaps (diabetes knowledge base, guidelines and practice) | Etiological and pathophysiological causes of diabetes unknown |
| Lack of knowledge about pre-diabetes risk factors | |
| Selection of intervention candidates | |
| Detection of early onset complications in pre-diabetes | |
| 2. Technological gaps (machine learning predictive modeling) | Data availability and pre-processing issues |
| Patient-wise risk factor ranking | |
| Accuracy v/s interpretability tradeoff | |
| External validation of the model | |
| Practical usability and impact of the model |