Literature DB >> 33815733

Software Defect Prediction for Healthcare Big Data: An Empirical Evaluation of Machine Learning Techniques.

Bilal Khan1, Rashid Naseem2, Muhammad Arif Shah2, Karzan Wakil3, Atif Khan4, M Irfan Uddin5, Marwan Mahmoud6.   

Abstract

Software defect prediction (SDP) in the initial period of the software development life cycle (SDLC) remains a critical and important assignment. SDP is essentially studied during few last decades as it leads to assure the quality of software systems. The quick forecast of defective or imperfect artifacts in software development may serve the development team to use the existing assets competently and more effectively to provide extraordinary software products in the given or narrow time. Previously, several canvassers have industrialized models for defect prediction utilizing machine learning (ML) and statistical techniques. ML methods are considered as an operative and operational approach to pinpoint the defective modules, in which moving parts through mining concealed patterns amid software metrics (attributes). ML techniques are also utilized by several researchers on healthcare datasets. This study utilizes different ML techniques software defect prediction using seven broadly used datasets. The ML techniques include the multilayer perceptron (MLP), support vector machine (SVM), decision tree (J48), radial basis function (RBF), random forest (RF), hidden Markov model (HMM), credal decision tree (CDT), K-nearest neighbor (KNN), average one dependency estimator (A1DE), and Naïve Bayes (NB). The performance of each technique is evaluated using different measures, for instance, relative absolute error (RAE), mean absolute error (MAE), root mean squared error (RMSE), root relative squared error (RRSE), recall, and accuracy. The inclusive outcome shows the best performance of RF with 88.32% average accuracy and 2.96 rank value, second-best performance is achieved by SVM with 87.99% average accuracy and 3.83 rank values. Moreover, CDT also shows 87.88% average accuracy and 3.62 rank values, placed on the third position. The comprehensive outcomes of research can be utilized as a reference point for new research in the SDP domain, and therefore, any assertion concerning the enhancement in prediction over any new technique or model can be benchmarked and proved.
Copyright © 2021 Bilal Khan et al.

Entities:  

Mesh:

Year:  2021        PMID: 33815733      PMCID: PMC7987450          DOI: 10.1155/2021/8899263

Source DB:  PubMed          Journal:  J Healthc Eng        ISSN: 2040-2295            Impact factor:   2.682


  13 in total

1.  Severe Dengue Prognosis Using Human Genome Data and Machine Learning.

Authors:  Caio Davi; Andre Pastor; Thiego Oliveira; Fernando B de Lima Neto; Ulisses Braga-Neto; Abigail W Bigham; Michael Bamshad; Ernesto T A Marques; Bartolomeu Acioli-Santos
Journal:  IEEE Trans Biomed Eng       Date:  2019-02-04       Impact factor: 4.538

2.  Hidden Markov model speed heuristic and iterative HMM search procedure.

Authors:  L Steven Johnson; Sean R Eddy; Elon Portugaly
Journal:  BMC Bioinformatics       Date:  2010-08-18       Impact factor: 3.169

3.  Clinical application of modified bag-of-features coupled with hybrid neural-based classifier in dengue fever classification using gene expression data.

Authors:  Sankhadeep Chatterjee; Nilanjan Dey; Fuqian Shi; Amira S Ashour; Simon James Fong; Soumya Sen
Journal:  Med Biol Eng Comput       Date:  2017-09-11       Impact factor: 2.602

4.  Gene prediction with a hidden Markov model and a new intron submodel.

Authors:  Mario Stanke; Stephan Waack
Journal:  Bioinformatics       Date:  2003-10       Impact factor: 6.937

5.  Analysis of dengue infection based on Raman spectroscopy and support vector machine (SVM).

Authors:  Saranjam Khan; Rahat Ullah; Asifullah Khan; Noorul Wahab; Muhammad Bilal; Mushtaq Ahmed
Journal:  Biomed Opt Express       Date:  2016-05-18       Impact factor: 3.732

6.  Morbidity Rate Prediction of Dengue Hemorrhagic Fever (DHF) Using the Support Vector Machine and the Aedes aegypti Infection Rate in Similar Climates and Geographical Areas.

Authors:  Kraisak Kesorn; Phatsavee Ongruk; Jakkrawarn Chompoosri; Atchara Phumee; Usavadee Thavara; Apiwat Tawatsin; Padet Siriyasatien
Journal:  PLoS One       Date:  2015-05-11       Impact factor: 3.240

7.  Prospective forecasts of annual dengue hemorrhagic fever incidence in Thailand, 2010-2014.

Authors:  Stephen A Lauer; Krzysztof Sakrejda; Evan L Ray; Lindsay T Keegan; Qifang Bi; Paphanij Suangtho; Soawapak Hinjoy; Sopon Iamsirithaworn; Suthanun Suthachana; Yongjua Laosiritaworn; Derek A T Cummings; Justin Lessler; Nicholas G Reich
Journal:  Proc Natl Acad Sci U S A       Date:  2018-02-20       Impact factor: 11.205

8.  Machine learning methods reveal the temporal pattern of dengue incidence using meteorological factors in metropolitan Manila, Philippines.

Authors:  Thaddeus M Carvajal; Katherine M Viacrusis; Lara Fides T Hernandez; Howell T Ho; Divina M Amalin; Kozo Watanabe
Journal:  BMC Infect Dis       Date:  2018-04-17       Impact factor: 3.090

9.  Predicting Infectious Disease Using Deep Learning and Big Data.

Authors:  Sangwon Chae; Sungjun Kwon; Donghyun Lee
Journal:  Int J Environ Res Public Health       Date:  2018-07-27       Impact factor: 3.390

10.  A Systematic Machine Learning Based Approach for the Diagnosis of Non-Alcoholic Fatty Liver Disease Risk and Progression.

Authors:  Sajida Perveen; Muhammad Shahbaz; Karim Keshavjee; Aziz Guergachi
Journal:  Sci Rep       Date:  2018-02-01       Impact factor: 4.379

View more
  1 in total

1.  Research on Chorus Emotion Recognition and Intelligent Medical Application Based on Health Big Data.

Authors:  Yu Li; Yao Chen
Journal:  J Healthc Eng       Date:  2022-03-09       Impact factor: 2.682

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.