Literature DB >> 24472367

Prediction of human breast and colon cancers from imbalanced data using nearest neighbor and support vector machines.

Abdul Majid1, Safdar Ali2, Mubashar Iqbal3, Nabeela Kausar4.   

Abstract

This study proposes a novel prediction approach for human breast and colon cancers using different feature spaces. The proposed scheme consists of two stages: the preprocessor and the predictor. In the preprocessor stage, the mega-trend diffusion (MTD) technique is employed to increase the samples of the minority class, thereby balancing the dataset. In the predictor stage, machine-learning approaches of K-nearest neighbor (KNN) and support vector machines (SVM) are used to develop hybrid MTD-SVM and MTD-KNN prediction models. MTD-SVM model has provided the best values of accuracy, G-mean and Matthew's correlation coefficient of 96.71%, 96.70% and 71.98% for cancer/non-cancer dataset, breast/non-breast cancer dataset and colon/non-colon cancer dataset, respectively. We found that hybrid MTD-SVM is the best with respect to prediction performance and computational cost. MTD-KNN model has achieved moderately better prediction as compared to hybrid MTD-NB (Naïve Bayes) but at the expense of higher computing cost. MTD-KNN model is faster than MTD-RF (random forest) but its prediction is not better than MTD-RF. To the best of our knowledge, the reported results are the best results, so far, for these datasets. The proposed scheme indicates that the developed models can be used as a tool for the prediction of cancer. This scheme may be useful for study of any sequential information such as protein sequence or any nucleic acid sequence.
Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

Entities:  

Keywords:  Breast/colon cancer; K-nearest neighbor; Mega-trend diffusion; Naïve Bayes; Random forest; Support vector machines

Mesh:

Year:  2014        PMID: 24472367     DOI: 10.1016/j.cmpb.2014.01.001

Source DB:  PubMed          Journal:  Comput Methods Programs Biomed        ISSN: 0169-2607            Impact factor:   5.428


  8 in total

1.  IAPSO-AIRS: A novel improved machine learning-based system for wart disease treatment.

Authors:  Moloud Abdar; Vivi Nur Wijayaningrum; Sadiq Hussain; Roohallah Alizadehsani; Pawel Plawiak; U Rajendra Acharya; Vladimir Makarenkov
Journal:  J Med Syst       Date:  2019-06-07       Impact factor: 4.460

2.  Performance Analysis of Conventional Machine Learning Algorithms for Diabetic Sensorimotor Polyneuropathy Severity Classification.

Authors:  Fahmida Haque; Mamun Bin Ibne Reaz; Muhammad Enamul Hoque Chowdhury; Geetika Srivastava; Sawal Hamid Md Ali; Ahmad Ashrif A Bakar; Mohammad Arif Sobhan Bhuiyan
Journal:  Diagnostics (Basel)       Date:  2021-04-28

3.  Diabetic peripheral neuropathy class prediction by multicategory support vector machine model: a cross-sectional study.

Authors:  Maryam Kazemi; Abbas Moghimbeigi; Javad Kiani; Hossein Mahjub; Javad Faradmal
Journal:  Epidemiol Health       Date:  2016-03-24

4.  CURE-SMOTE algorithm and hybrid algorithm for feature selection and parameter optimization based on random forests.

Authors:  Li Ma; Suohai Fan
Journal:  BMC Bioinformatics       Date:  2017-03-14       Impact factor: 3.169

5.  Preprocessing Breast Cancer Data to Improve the Data Quality, Diagnosis Procedure, and Medical Care Services.

Authors:  Zeinab Sajjadnia; Raof Khayami; Mohammad Reza Moosavi
Journal:  Cancer Inform       Date:  2020-05-27

6.  Research on imbalance machine learning methods for MR[Formula: see text]WI soft tissue sarcoma data.

Authors:  Xuanxuan Liu; Li Guo; Hexiang Wang; Jia Guo; Shifeng Yang; Lisha Duan
Journal:  BMC Med Imaging       Date:  2022-08-26       Impact factor: 2.795

7.  SVM-Prot 2016: A Web-Server for Machine Learning Prediction of Protein Functional Families from Sequence Irrespective of Similarity.

Authors:  Ying Hong Li; Jing Yu Xu; Lin Tao; Xiao Feng Li; Shuang Li; Xian Zeng; Shang Ying Chen; Peng Zhang; Chu Qin; Cheng Zhang; Zhe Chen; Feng Zhu; Yu Zong Chen
Journal:  PLoS One       Date:  2016-08-15       Impact factor: 3.240

8.  A genetic programming approach to oral cancer prognosis.

Authors:  Mei Sze Tan; Jing Wei Tan; Siow-Wee Chang; Hwa Jen Yap; Sameem Abdul Kareem; Rosnah Binti Zain
Journal:  PeerJ       Date:  2016-09-21       Impact factor: 2.984

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.