| Literature DB >> 30256722 |
Irene Dankwa-Mullan1, Marc Rivo2, Marisol Sepulveda3, Yoonyoung Park4, Jane Snowdon5, Kyu Rhee6.
Abstract
An estimated 425 million people globally have diabetes, accounting for 12% of the world's health expenditures, and yet 1 in 2 persons remain undiagnosed and untreated. Applications of artificial intelligence (AI) and cognitive computing offer promise in diabetes care. The purpose of this article is to better understand what AI advances may be relevant today to persons with diabetes (PWDs), their clinicians, family, and caregivers. The authors conducted a predefined, online PubMed search of publicly available sources of information from 2009 onward using the search terms "diabetes" and "artificial intelligence." The study included clinically-relevant, high-impact articles, and excluded articles whose purpose was technical in nature. A total of 450 published diabetes and AI articles met the inclusion criteria. The studies represent a diverse and complex set of innovative approaches that aim to transform diabetes care in 4 main areas: automated retinal screening, clinical decision support, predictive population risk stratification, and patient self-management tools. Many of these new AI-powered retinal imaging systems, predictive modeling programs, glucose sensors, insulin pumps, smartphone applications, and other decision-support aids are on the market today with more on the way. AI applications have the potential to transform diabetes care and help millions of PWDs to achieve better blood glucose control, reduce hypoglycemic episodes, and reduce diabetes comorbidities and complications. AI applications offer greater accuracy, efficiency, ease of use, and satisfaction for PWDs, their clinicians, family, and caregivers.Entities:
Keywords: artificial intelligence; artificial pancreas; cognitive computing; diabetes care; glucose monitoring; retinal imaging
Mesh:
Year: 2018 PMID: 30256722 PMCID: PMC6555175 DOI: 10.1089/pop.2018.0129
Source DB: PubMed Journal: Popul Health Manag ISSN: 1942-7891 Impact factor: 2.459
Categorization of Artificial Intelligence and Diabetes Care
| Automated Retinal Screening | 96 | Detection of diabetic retinopathy, maculopathy, exudates, and other abnormalities from normal findings |
| Clinical Decision Support | 126 | Detection and monitoring of diabetes and comorbidities such as neuropathy, nephropathy and wounds |
| Predictive Population Risk Stratification | 135 | Identification of diabetes subpopulations at higher risk for complications, hospitalization, and readmissions |
| Patient Self-Management Tools | 94 | AI-improved glucose sensors, artificial pancreas, activity and dietary tracking devices |
| TOTAL | 450 |
AI, artificial intelligence.
Common Artificial Intelligence Approaches Used in Diabetes Care
| Multilayer perceptron | Composed of neurons in input layer, output layer, and multiple hidden layers. Neurons in each layer are connected to all neurons in the next layer, making each layer fully connected to the next. | Can model complex nonlinear relationship | Greater number of parameters have to be estimated without convolution | Prediction models, patient self-management tools |
| Convolutional neural network (CNN) | Composed of multiple layers of neurons with the convolution layer having neurons that look at small patches of the input image at a time, like a filter, and are convolved across the whole input image and share parameters across the image. | Can model complex nonlinear relationship | Require a large amount of data to train | Retinal screening |
| Random forest | Creates an ensemble of decision trees | Easy to fit, generally produces good performance | Can be slow in prediction | Retinal screening, decision support, prediction models, patient self-management tools |
| Fuzzy logic/fuzzy system | Provides a probability value between 0 and 1 rather than deterministic decision (0 or 1) for membership in a certain class | Resembles human reasoning | Requires an expert curation of rules | Retinal screening, decision support, sensors and artifical pancreas |
| Support vector machine (SVM) | Classification method for binary outcomes (not often used for multiclass problems, but techniques for multiclass SVM exist) | Performs well in nonlinear decision boundaries | Does not scale well to large data | Retinal screening, decision support, prediction models, patient self-management tools |
| Logistic regression | Classification method for binary outcome | Easy to fit, efficient, and scalable | Only binary classification | Prediction models |
| Natural language processing | Computational tools and methods to process, analyze, and perform inference of human languages | Critical in building intelligent machine and human–computer interactions | Usually requires a large amount of human-annotated records to train | Prediction models |
| K-nearest neighbors algorithm | Categorizes input data into several classes using its k nearest neighbors | Does not make assumptions about underlying distribution | Computationally intensive | Retinal screening, decision support, prediction models, patient self-management tools |
Summary of Selected Key Diabetes Artificial Intellegence Studies and Description of Outcomes
| Gulshan V. 2016[ | Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs | Deep CNN | Data set: 128,175 retinal images | Test data: EyePACS-1 data set (n = 9963 images from 4997 patients) | Sensitivity: 97.5% for EyePACS-1, 96.1% for Messidor-2 | Deep machine learning algorithm had high sensitivity and specificity for detecting referable DR. |
| Rahim S. 2014[ | Detection of Diabetic Retinopathy and Maculopathy Using Fuzzy Image Processing | Fuzzy image processing, ML classifiers (1-nearest neighbour, NB, SVM) | Dataset: public data sets (DIARETDB0, DIARETDB1, MESSIDOR, DRIVE, STARE, REVIEW, ROC) | Test data: remaining 10% data | For k-NN, polynomial SVM, RBF SVM, and NB, respectively | Fuzzy image processing together with the retinal structure extraction in DR screening can help produce a more reliable and efficient screening system |
| Lam C. 2018[ | Retinal Lesion Detection With Deep Learning Using Image Patches | CNN (GoogLeNet) | Data set: manually created image patches from public image data set (Kaggle retinopathy data subset, n = 243) | Test data: public image data set (eOphta) (n = 463) | In validation using the patch images: | Regionally trained CNNs can detect and distinguish between subtle pathologic lesions with only a few hundred training examples per lesion. |
| Keel S. 2018[ | Feasibility and Patient Acceptability of a Novel Artificial Intelligence-based Screening Model for Diabetic Retinopathy at Endocrinology Outpatient Services | Deep CNN (Inception v3) | Data set: public data set (LabelMe, n = 66,790) | Test data: data from 96 participants who agreed to receive both retinal screening approaches and complete a questionnaire | Sensitivity: 92.3% | AI-based DR screening appears to be feasible, accurate, and well accepted by patients attending endocrinology outpatient settings. |
| Han L. 2015[ | Rule Extraction from Support Vector Machines Using Ensemble Learning Approach: An Application for Diagnosis of Diabetes | Ensemble learning using SVM and RF rule extraction | Data set: China Health and Nutrition Survey data (n = 7913, 646 diabetic) | Test data: remaining 10% of data | For positive cases: | The proposed hybrid system can provide a tool for the diagnosis of diabetes from population-based nutritional surveys, and it supports a second opinion for lay users |
| Shankaracharya. 2012[ | Computational Intelligence-based Diagnosis Tool for the Detection of Prediabetes and Type 2 Diabetes in India | Mixture of expert system based on MLP | Data set: 1415 subjects (947 diabetic) | Test data: 311/1415 | Best result achieved | The proposed tool for identifying individuals with prediabetes, diabetes, and nondiabetes is highly accurate and may be used for large-scale diabetic screenings. |
| Wei WQ. 2010[ | A High Throughput Semantic Concept Frequency Based Approach for Patient Identification: A Case Study Using Type 2 Diabetes Mellitus Clinical Notes | NLP, SVM, and semantic knowledge | Data set: 57,707 electronic notes from 1600 DM patients and 1600 control patients in Mayo Clinic | No separate test data were specified | F-score for cases: 0.956 | The proposed approach is accurate and responsive to the urgent need to develop a general automatic approach for diabetic patient case-finding and characterization. |
| Corey KE. 2016[ | Development and Validation of an Algorithm to Identify Nonalcoholic Fatty Liver Disease (NAFLD) in the Electronic Medical Record | LR with adaptive LASSO | Data set: electronic medical records from 620 patient randomly selected from the high-risk patients in Partners Healthcare | Test data: randomly selected 611 high-risk patients identified by classification algorithm | Specificity; 91% | The NAFLD classification algorithm is superior to ICD-9 billing data alone. This approach is simple to develop, deploy, and can be applied across different institutions to create EMR-based cohorts of individuals with NAFLD. |
| Neves J. 2015[ | A Soft Computing Approach to Kidney Diseases Evaluation | Logic Program-ming, ANN | Data set: data from 558 total patients (175 diagnosed with CKD) | Test data: remaining 1/3 of data | ANN performance in test data set | The proposed model showed good performance in predicting the likelihood of a CKD diagnosis |
| Rau HH. 2016[ | Development of a Web-based Liver Cancer Prediction Model for Type II Diabetes Patients by Using an Artificial Neural Network | ANN, LR | Data set: data from 2060 diabetic patients in the National Health Insurance Research Database (NHIRD) of Taiwan | Test data: 618/2060 | ANN performance was superior to that of LR for predicting diabetics who will be diagnosed with liver cancer in the next 6 years. | Data mining systems enable clinicians to predict those diabetics at greater risk for the development of liver cancer. |
| Vyas R. 2016[ | Building and Analysis of Protein-Protein Interactions Related to Diabetes Mellitus Using Support Vector Machine, Biomedical Text Mining and Network Analysis | SVM | Training data: positive and negative proteins from PDB and UniProt databases (n = 2653) | Test data: 129 proteins extracted via text mining from literature | Accuracy: 78.20% | This integrated approach has a potential to identify disease-related proteins, functional annotation, and other proteomics studies. |
| López B. 2018[ | Single Nucleotide Polymorphism (SNP) Relevance Learning with Random Forests for Type 2 Diabetes Risk Prediction | Random forest, k-NN | Data set: data from 677 subjects (248 diabetic), each containing 96 SNPs regarding type 2 diabetes | Test data: 10-fold cross-validation used. No separate test data were specified | For risk prediction | RF is a useful method for learning predictive models to help physicians to identify the relevant SNPs associated with and predictive of type 2 diabetes. |
| Lo-Ciganic WH. 2015[ | Using Machine Learning to Examine Medication Adherence Thresholds and Risk of Hospitalization | Random survival forests, survival trees models | Data set: 33,130 non-dual-eligible Medicaid enrollees with type 2 diabetes | Test data: remaining 10% data | The adherence thresholds most discriminating for risk of all-cause hospitalization varied from 46% to 94% - the widely used 80% adherence threshold is not optimal for predicting risk of hospitalization | Machine learning approaches hold promise as an intuitive and powerful approach for customizing interventions in medication adherence in diabetics and optimizing health outcomes. |
| Shu T. 2017[ | An Extensive Analysis of Various Texture Feature Extractors to Detect Diabetes Mellitus Using Facial Specific Regions | k-NN, SVM with 8 image extractor methods | Data set: 284 diabetes mellitus and 231 healthy samples | Test data: 10-fold cross-validation used. No separate test data were specified | The best texture feature extractor, Image Gray-scale Histogram (bin n = 256), combined with SVM | Compared with traditional diagnostic methods that rely on blood samples, the Image Gray-scale Histogram is a highly accurate, non-invasive way to diagnose diabetes using facial and tongue features. |
| Katigari KM. 2017[ | Fuzzy Expert System for Diagnosing Diabetic Neuropathy | Fuzzy expert system | Data set: diagnostic parameters and their importance developed by specialists used to develop fuzzy expert system | Test data: 213 medical records of patients diagnosed with diabetic neuropathy | For diagnosis and severity of diabetic neuropathy | The fuzzy expert system can help diagnose and determine the severity of diabetic neuropathy. |
| Wang L. 2017[ | Area Determination of Diabetic Foot Ulcer Images Using a Cascaded Two-Stage SVM-Based Classification | Two-stage SVM with simple linear iterative clustering and conditional random fields | Data set: 100 foot ulcer images from 15 patients | Test data: cross-validation used. No separate test data were specified | Sensitivity: 73.3% | Computer-based systems provide high performance rates for measuring diabetic wounds and monitoring wound healing status, and are sufficiently efficient for smartphone-based image analysis. |
| Mauseth R. 2015[ | Testing of an Artificial Pancreas System With Pizza and Exercise Leads to Improvements in the System's Fuzzy Logic Controller | Fuzzy Logic Controller systems (FLC) | N/A | Total 17 meal, 13 exercise studies in 10 subjects with type 1 diabetes (T1D) | FLC v2.1 showed improvements in mean blood glucose after pizza consumption, after exercise testing, in reducing hyperglycemia, and percentage time spent in euglycemic range | Stress testing the AP system followed by adjustments to the dosing matrix significantly improved FLC performance when retested for mean blood glucose, high blood glucose, and normal blood glucose |
| Ling SH. 2012[ | Natural Occurrence of Nocturnal Hypoglycemia Detection Using Hybrid Particle Swarm Optimized Fuzzy Reasoning Model | Fuzzy reasoning model with hybrid particle swarm optimization with wavelet mutation | Data set: 16 type 1 diabetic patients | Test data: remaining 269 data points from 8/16 patients | Advanced noctural hypoglycemic episode detection | The proposed system offers a noninvasive means to detect hypoglycemic episodes in type 1 diabetic patients. |
| Herrero P. 2015[ | Advanced Insulin Bolus Advisor Based on Run-To-Run Control and Case-Based Reasoning | Combination of R2R and CBR | N/A | In silico testing using commercial type 1 diabetes simulator generated 1-month data for 10 adults and 10 adolescents scenarios | Using CBR(R2R), mean blood glucose improved in both adult and adolescent populations and hypoglycemia was completely eliminated (R2R alone was not able to do it in the adolescent population) | The proposed smartphone system keeps the simplicity of a standard bolus calculator while enhancing its performance by providing more adaptability and flexibility. |
| DeJournett L. 2016[ | In Silico Testing of an Artificial-Intelligence-Based Artificial Pancreas Designed for Use in the Intensive Care Unit Setting | Knowledge-based system | N/A | In silico analysis: 126 000 unique 5-day simulations resulting in 107 million glucose values | On average, time in control range was 94.2%, time in range 70–140 mg/dl was 97.8%, time in hyperglycemic range was 2.1%, time in hypoglycemic range was 0.09% | An AI-based closed-loop glucose controller may be able to improve on the results achieved by currently existing ICU-based PID/MPC controllers |
| Zhang W. 2015. [ | “Snap-n-Eat”: Food Recognition and Nutrition Estimation on a Smartphone | SVM | Data set: 2000 food images comprising 15 predefined categories | Test data: 5-fold cross validation | Accuracy: 85% | The proposed smartphone mobile system can recognize food items present on a plate and estimates their calorific and nutrition content, automatically helping diabetic patients make more informed food choice decisions. |
| Cvetković B. 2016[ | Activity Recognition for Diabetic Patients Using a Smartphone | Ensemble of models (SVM, J48, random forest, Jrip, AdaBoost and Bagging algorithms), symbolic rules | Data set: average 11 hours of phone and 7.5 hours of ECG recordings per day for 2 weeks from 9 healthy volunteers | Test data: second week of recordings | Best result achieved by Multi-Classifier Adaptive Training (MCAT) method | Smartphone sensors using machine learning and symbolic reasoning can recognize and quantify high-level lifestyle activities of diabetic patients and help them make more informed activity choices. |
| Wang L. 2015[ | Smartphone-based Wound Assessment System for Patients with Diabetes | Image boundary detection: mean-shift segmentation algorithm | N/A | 30 simulated wound images, 34 actual patient wound images | Visual evaluation for simulated images | The proposed smartphone camera system enables diabetic patients and their caregivers to take a more active role in daily wound care. |
| Rigla M. 2018[ | Gestational Diabetes Management (GDM) Using Smart Mobile Telemedicine | Mobile telemedicine system | NA | 20 patients diagnosed with GDM | Metabolic and perinatal outcomes were similar except for BP, which was lower in patients using the telemedicine system | Artificial-intelligence-augmented telemedicine has been proposed as a helpful tool to facilitate an efficient widespread medical assistance to GDM. |
AI, artificial intelligence; ANN, artificial neural network; AP, artificial pancreas; AUC, area under the curve; BP, blood pressure; CBR, case-based reasoning; CKD, chronic kidney disease; CI, confidence interval; CNN, convolutional neural network; DM, diabetes mellitus; DR, diabetic retinopathy; ECG, electrocardiogram; EMR, electronic medical record; FLC, Fuzzy Logic Controller; GDM, gestational diabetes management; ICD-9, International Classification of Diseases, Ninth Revision; k-NN, k nearest neighbors algorithm; LASSO, least absolute shrinkage and selection operator; LR, logistic regression; MLP, multilayer perceptron; MPC, model predictive control; N/A, not applicable; NA; NAFLD, nonalcoholic fatty liver disease; NB, naïve bayes; NLP, natural language processing; NPV, negative predictive value; PDB, protein databank; PID, proportional integral derivative; PPV, positive predictive value; R2R, run-to-run; RBF, radial basis function; RF, random forest; ROC, receiver operating characteristic; SNP, single nucleotide polymorphism; SVM, support vector machine.