Literature DB >> 31949894

Classification and prediction of diabetes disease using machine learning paradigm.

Md Maniruzzaman1,2, Md Jahanur Rahman2, Benojir Ahammed1, Md Menhazul Abedin1.   

Abstract

BACKGROUND AND OBJECTIVES: Diabetes is a chronic disease characterized by high blood sugar. It may cause many complicated disease like stroke, kidney failure, heart attack, etc. About 422 million people were affected by diabetes disease in worldwide in 2014. The figure will be reached 642 million in 2040. The main objective of this study is to develop a machine learning (ML)-based system for predicting diabetic patients.
MATERIALS AND METHODS: Logistic regression (LR) is used to identify the risk factors for diabetes disease based on p value and odds ratio (OR). We have adopted four classifiers like naïve Bayes (NB), decision tree (DT), Adaboost (AB), and random forest (RF) to predict the diabetic patients. Three types of partition protocols (K2, K5, and K10) have also adopted and repeated these protocols into 20 trails. Performances of these classifiers are evaluated using accuracy (ACC) and area under the curve (AUC).
RESULTS: We have used diabetes dataset, conducted in 2009-2012, derived from the National Health and Nutrition Examination Survey. The dataset consists of 6561 respondents with 657 diabetic and 5904 controls. LR model demonstrates that 7 factors out of 14 as age, education, BMI, systolic BP, diastolic BP, direct cholesterol, and total cholesterol are the risk factors for diabetes. The overall ACC of ML-based system is 90.62%. The combination of LR-based feature selection and RF-based classifier gives 94.25% ACC and 0.95 AUC for K10 protocol.
CONCLUSION: The combination of LR and RF-based classifier performs better. This combination will be very helpful for predicting diabetic patients. © Springer Nature Switzerland AG 2020.

Entities:  

Keywords:  Adaboost; Classification; Decision tree; Diabetes; Machine learning; Naïve Bayes; Random forest

Year:  2020        PMID: 31949894      PMCID: PMC6942113          DOI: 10.1007/s13755-019-0095-z

Source DB:  PubMed          Journal:  Health Inf Sci Syst        ISSN: 2047-2501


  32 in total

1.  IntelliHealth: A medical decision support application using a novel weighted multi-layer classifier ensemble framework.

Authors:  Saba Bashir; Usman Qamar; Farhan Hassan Khan
Journal:  J Biomed Inform       Date:  2015-12-15       Impact factor: 6.317

2.  Neural networks for mining the associations between diseases and symptoms in clinical notes.

Authors:  Setu Shah; Xiao Luo; Saravanan Kanakasabai; Ricardo Tuason; Gregory Klopper
Journal:  Health Inf Sci Syst       Date:  2018-11-28

3.  Comparative approaches for classification of diabetes mellitus data: Machine learning paradigm.

Authors:  Md Maniruzzaman; Nishith Kumar; Md Menhazul Abedin; Md Shaykhul Islam; Harman S Suri; Ayman S El-Baz; Jasjit S Suri
Journal:  Comput Methods Programs Biomed       Date:  2017-09-08       Impact factor: 5.428

Review 4.  Diabetes mellitus statistics on prevalence and mortality: facts and fallacies.

Authors:  Paul Zimmet; K George Alberti; Dianna J Magliano; Peter H Bennett
Journal:  Nat Rev Endocrinol       Date:  2016-07-08       Impact factor: 43.330

5.  Evaluation of variable selection methods for random forests and omics data sets.

Authors:  Frauke Degenhardt; Stephan Seifert; Silke Szymczak
Journal:  Brief Bioinform       Date:  2019-03-22       Impact factor: 11.622

6.  Diagnosis and classification of diabetes mellitus.

Authors: 
Journal:  Diabetes Care       Date:  2010-01       Impact factor: 19.112

7.  Ethiopic maternal care data mining: discovering the factors that affect postnatal care visit in Ethiopia.

Authors:  Geletaw Sahle
Journal:  Health Inf Sci Syst       Date:  2016-05-23

8.  PredicT-ML: a tool for automating machine learning model building with big clinical data.

Authors:  Gang Luo
Journal:  Health Inf Sci Syst       Date:  2016-06-08

9.  Predicting Diabetes Mellitus With Machine Learning Techniques.

Authors:  Quan Zou; Kaiyang Qu; Yamei Luo; Dehui Yin; Ying Ju; Hua Tang
Journal:  Front Genet       Date:  2018-11-06       Impact factor: 4.599

10.  Identification of Potential Type II Diabetes in a Chinese Population with a Sensitive Decision Tree Approach.

Authors:  Dongmei Pei; Chengpu Zhang; Yu Quan; Qiyong Guo
Journal:  J Diabetes Res       Date:  2019-01-22       Impact factor: 4.011

View more
  17 in total

1.  Mapping the spatial distribution of the dengue vector Aedes aegypti and predicting its abundance in northeastern Thailand using machine-learning approach.

Authors:  M S Rahman; Chamsai Pientong; Sumaira Zafar; Tipaya Ekalaksananan; Richard E Paul; Ubydul Haque; Joacim Rocklöv; Hans J Overgaard
Journal:  One Health       Date:  2021-12-04

2.  Identification of phosphorylation site using S-padding strategy based convolutional neural network.

Authors:  Yanjiao Zeng; Dongning Liu; Yang Wang
Journal:  Health Inf Sci Syst       Date:  2022-09-17

3.  Predicting the Risk of Incident Type 2 Diabetes Mellitus in Chinese Elderly Using Machine Learning Techniques.

Authors:  Qing Liu; Miao Zhang; Yifeng He; Lei Zhang; Jingui Zou; Yaqiong Yan; Yan Guo
Journal:  J Pers Med       Date:  2022-05-31

4.  A Novel Approach for Feature Selection and Classification of Diabetes Mellitus: Machine Learning Methods.

Authors:  Roshi Saxena; Sanjay Kumar Sharma; Manali Gupta; G C Sampada
Journal:  Comput Intell Neurosci       Date:  2022-04-15

5.  Identification of Potential Type II Diabetes in a Large-Scale Chinese Population Using a Systematic Machine Learning Framework.

Authors:  Mingyue Xue; Yinxia Su; Chen Li; Shuxia Wang; Hua Yao
Journal:  J Diabetes Res       Date:  2020-09-24       Impact factor: 4.011

Review 6.  A Comprehensive Review of Various Diabetic Prediction Models: A Literature Survey.

Authors:  Roshi Saxena; Sanjay Kumar Sharma; Manali Gupta; G C Sampada
Journal:  J Healthc Eng       Date:  2022-04-12       Impact factor: 3.822

7.  Smoker's characteristics, general health and their perception of smoking in the social environment: a study of smokers in Rajshahi City, Bangladesh.

Authors:  Md Kamruzzaman; Ahammad Hossain; Enamul Kabir
Journal:  Z Gesundh Wiss       Date:  2021-01-06

Review 8.  A Novel Diabetes Healthcare Disease Prediction Framework Using Machine Learning Techniques.

Authors:  Raja Krishnamoorthi; Shubham Joshi; Hatim Z Almarzouki; Piyush Kumar Shukla; Ali Rizwan; C Kalpana; Basant Tiwari
Journal:  J Healthc Eng       Date:  2022-01-11       Impact factor: 2.682

9.  Investigate the risk factors of stunting, wasting, and underweight among under-five Bangladeshi children and its prediction based on machine learning approach.

Authors:  S M Jubaidur Rahman; N A M Faisal Ahmed; Md Menhazul Abedin; Benojir Ahammed; Mohammad Ali; Md Jahanur Rahman; Md Maniruzzaman
Journal:  PLoS One       Date:  2021-06-17       Impact factor: 3.240

10.  Development and validation of a new diabetes index for the risk classification of present and new-onset diabetes: multicohort study.

Authors:  Shinje Moon; Ji-Yong Jang; Yumin Kim; Chang-Myung Oh
Journal:  Sci Rep       Date:  2021-08-03       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.