Md Maniruzzaman1, Nishith Kumar2, Md Menhazul Abedin3, Md Shaykhul Islam4, Harman S Suri5, Ayman S El-Baz6, Jasjit S Suri7. 1. Department of Statistics, University of Rajshahi, Rajshahi, Bangladesh. Electronic address: monir.stat91@gmail.com. 2. Department of Statistics, Bangabandhu Sheikh Mujibur Rahman Science and Technology University, Gopalganj, Bangladesh. Electronic address: nk.bru09@gmail.com. 3. Statistics Discipline, Khulna University, Khulna, Bangladesh. Electronic address: menhaz70@gmail.com. 4. Department of Statistics, University of Rajshahi, Rajshahi, Bangladesh. Electronic address: shaykhulrustat@gmail.com. 5. Diabetic Care Division, AtheroPoint LLC, Roseville, CA, USA. Electronic address: hsingh574@gmail.com. 6. Department of Bioengineering, J.B Speed School of Engineering, University of Louisville, Louisville, KY, USA. Electronic address: ayman_elbazz@yahoo.com. 7. Diabetic Care Division, AtheroPoint LLC, Roseville, CA, USA; Department of Electrical Engineering, Idaho State University (Affl.), Idaho, USA. Electronic address: jasjit.suri@atheropoint.com.
Abstract
BACKGROUND AND OBJECTIVE: Diabetes is a silent killer. The main cause of this disease is the presence of excessive amounts of metabolites such as glucose. There were about 387 million diabetic people all over the world in 2014. The financial burden of this disease has been calculated to be about $13,700 per year. According to the World Health Organization (WHO), these figures will more than double by the year 2030. This cost will be reduced dramatically if someone can predict diabetes statistically on the basis of some covariates. Although several classification techniques are available, it is very difficult to classify diabetes. The main objectives of this paper are as follows: (i) Gaussian process classification (GPC), (ii) comparative classifier for diabetes data classification, (iii) data analysis using the cross-validation approach, (iv) interpretation of the data analysis and (v) benchmarking our method against others. METHODS: To classify diabetes, several classification techniques are used such as linear discriminant analysis (LDA), quadratic discriminant analysis (QDA), and Naive Bayes (NB). However, most of the medical data show non-normality, non-linearity and inherent correlation structure. So in this paper we adapted Gaussian process (GP)-based classification technique using three kernels namely: linear, polynomial and radial basis kernel. We also investigate the performance of a GP-based classification technique in comparison to existing techniques such as LDA, QDA and NB. Performances are evaluated by using the accuracy (ACC), sensitivity (SE), specificity (SP), positive predictive value (PPV), negative predictive value (NPV) and receiver-operating characteristic (ROC) curves. RESULTS: Pima Indian diabetes dataset is taken as part of the study. This consists of 768 patients, of which 268 patients are diabetic and 500 patients are controls. Our machine learning system shows the performance of GP-based model as: ACC 81.97%, SE 91.79%, SP 63.33%, PPV 84.91% and NPV 62.50% which are larger compared to other methods.
BACKGROUND AND OBJECTIVE:Diabetes is a silent killer. The main cause of this disease is the presence of excessive amounts of metabolites such as glucose. There were about 387 million diabeticpeople all over the world in 2014. The financial burden of this disease has been calculated to be about $13,700 per year. According to the World Health Organization (WHO), these figures will more than double by the year 2030. This cost will be reduced dramatically if someone can predict diabetes statistically on the basis of some covariates. Although several classification techniques are available, it is very difficult to classify diabetes. The main objectives of this paper are as follows: (i) Gaussian process classification (GPC), (ii) comparative classifier for diabetes data classification, (iii) data analysis using the cross-validation approach, (iv) interpretation of the data analysis and (v) benchmarking our method against others. METHODS: To classify diabetes, several classification techniques are used such as linear discriminant analysis (LDA), quadratic discriminant analysis (QDA), and Naive Bayes (NB). However, most of the medical data show non-normality, non-linearity and inherent correlation structure. So in this paper we adapted Gaussian process (GP)-based classification technique using three kernels namely: linear, polynomial and radial basis kernel. We also investigate the performance of a GP-based classification technique in comparison to existing techniques such as LDA, QDA and NB. Performances are evaluated by using the accuracy (ACC), sensitivity (SE), specificity (SP), positive predictive value (PPV), negative predictive value (NPV) and receiver-operating characteristic (ROC) curves. RESULTS: Pima Indian diabetes dataset is taken as part of the study. This consists of 768 patients, of which 268 patients are diabetic and 500 patients are controls. Our machine learning system shows the performance of GP-based model as: ACC 81.97%, SE 91.79%, SP 63.33%, PPV 84.91% and NPV 62.50% which are larger compared to other methods.
Authors: Luca Saba; Skandha S Sanagala; Suneet K Gupta; Vijaya K Koppula; Amer M Johri; Aditya M Sharma; Raghu Kolluri; Deepak L Bhatt; Andrew Nicolaides; Jasjit S Suri Journal: Int J Cardiovasc Imaging Date: 2021-01-09 Impact factor: 2.357
Authors: Ankush Jamthikar; Deep Gupta; Narendra N Khanna; Tadashi Araki; Luca Saba; Andrew Nicolaides; Aditya Sharma; Tomaz Omerzu; Harman S Suri; Ajay Gupta; Sophie Mavrogeni; Monika Turk; John R Laird; Athanasios Protogerou; Petros P Sfikakis; George D Kitas; Vijay Viswanathan; Gyan Pareek; Martin Miner; Jasjit S Suri Journal: Curr Atheroscler Rep Date: 2019-05-01 Impact factor: 5.113
Authors: Ankush D Jamthikar; Deep Gupta; Laura E Mantella; Luca Saba; John R Laird; Amer M Johri; Jasjit S Suri Journal: Int J Cardiovasc Imaging Date: 2020-11-12 Impact factor: 2.357
Authors: George Konstantonis; Krishna V Singh; Petros P Sfikakis; Ankush D Jamthikar; George D Kitas; Suneet K Gupta; Luca Saba; Kleio Verrou; Narendra N Khanna; Zoltan Ruzsa; Aditya M Sharma; John R Laird; Amer M Johri; Manudeep Kalra; Athanasios Protogerou; Jasjit S Suri Journal: Rheumatol Int Date: 2022-01-11 Impact factor: 2.631
Authors: Mainak Biswas; Luca Saba; Tomaž Omerzu; Amer M Johri; Narendra N Khanna; Klaudija Viskovic; Sophie Mavrogeni; John R Laird; Gyan Pareek; Martin Miner; Antonella Balestrieri; Petros P Sfikakis; Athanasios Protogerou; Durga Prasanna Misra; Vikas Agarwal; George D Kitas; Raghu Kolluri; Aditya Sharma; Vijay Viswanathan; Zoltan Ruzsa; Andrew Nicolaides; Jasjit S Suri Journal: J Digit Imaging Date: 2021-06-02 Impact factor: 4.903