Literature DB >> 30478811

A reliable method for colorectal cancer prediction based on feature selection and support vector machine.

Dandan Zhao1,2, Hong Liu3,4, Yuanjie Zheng1,2, Yanlin He1,2, Dianjie Lu1,2, Chen Lyu1,2.   

Abstract

Colorectal cancer (CRC) is a common cancer responsible for approximately 600,000 deaths per year worldwide. Thus, it is very important to find the related factors and detect the cancer accurately. However, timely and accurate prediction of the disease is challenging. In this study, we build an integrated model based on logistic regression (LR) and support vector machine (SVM) to classify the CRC into cancer and normal samples. From various factors, human location, age, gender, BMI, and cancer tumor type, tumor grade, and DNA, of the cancer, we select the most significant factors (p < 0.05) using logistic regression as main features, and with these features, a grid-search SVM model is designed using different kernel types (Linear, radial basis function (RBF), Sigmoid, and Polynomial). The result of the logistic regression indicates that the Firmicutes (AUC 0.918), Bacteroidetes (AUC 0.856), body mass index (BMI) (AUC 0.777), and age (AUC 0.710) and their combined factors (AUC 0.942) are effective for CRC detection. And the best kernel type is RBF, which achieves an accuracy of 90.1% when k = 5, and 91.2% when k = 10. This study provides a new method for colorectal cancer prediction based on independent risky factors. Graphical abstract Flow chart depicting the method adopted in the study. LR (logistic regression) and ROC curve are used to select independent features as input of SVM. SVM kernel selection aims to find the best kernel function for classification by comparing Linear, RBF, Sigmoid, and Polynomial kernel types of SVM, and the result shows the best kernel is RBF. Classification performance of LR + RF, LR + NB, LR + KNN, and LR + ANNs models are compared with LR + SVM. After these steps, the cancer and healthy individuals can be classified, and the best model is selected.

Entities:  

Keywords:  Colorectal cancer; Logistic regression; Microbiome; Support vector machine

Mesh:

Year:  2018        PMID: 30478811     DOI: 10.1007/s11517-018-1930-0

Source DB:  PubMed          Journal:  Med Biol Eng Comput        ISSN: 0140-0118            Impact factor:   2.602


  8 in total

1.  Identifying the tumor location-associated candidate genes in development of new drugs for colorectal cancer using machine-learning-based approach.

Authors:  Tuncay Bayrak; Zafer Çetin; E İlker Saygılı; Hasan Ogul
Journal:  Med Biol Eng Comput       Date:  2022-08-10       Impact factor: 3.079

2.  Self-evoluting framework of deep convolutional neural network for multilocus protein subcellular localization.

Authors:  Hanhan Cong; Hong Liu; Yuehui Chen; Yi Cao
Journal:  Med Biol Eng Comput       Date:  2020-10-20       Impact factor: 2.602

3.  Microbiome Analysis of More Than 2,000 NHS Bowel Cancer Screening Programme Samples Shows the Potential to Improve Screening Accuracy.

Authors:  Caroline Young; Henry M Wood; Alba Fuentes Balaguer; Daniel Bottomley; Niall Gallop; Lyndsay Wilkinson; Sally C Benton; Martin Brealey; Cerin John; Carole Burtonwood; Kelsey N Thompson; Yan Yan; Jennifer H Barrett; Eva J A Morris; Curtis Huttenhower; Philip Quirke
Journal:  Clin Cancer Res       Date:  2021-03-03       Impact factor: 13.801

4.  Colorectal Cancer Prediction Based on Weighted Gene Co-Expression Network Analysis and Variational Auto-Encoder.

Authors:  Dongmei Ai; Yuduo Wang; Xiaoxin Li; Hongfei Pan
Journal:  Biomolecules       Date:  2020-08-20

5.  Colorectal Cancer Detected by Machine Learning Models Using Conventional Laboratory Test Data.

Authors:  Hui Li; Jianmei Lin; Yanhong Xiao; Wenwen Zheng; Lu Zhao; Xiangling Yang; Minsheng Zhong; Huanliang Liu
Journal:  Technol Cancer Res Treat       Date:  2021 Jan-Dec

6.  Development and validation of a risk index to predict kidney graft survival: the kidney transplant risk index.

Authors:  Sameera Senanayake; Sanjeewa Kularatna; Helen Healy; Nicholas Graves; Keshwar Baboolal; Matthew P Sypek; Adrian Barnett
Journal:  BMC Med Res Methodol       Date:  2021-06-21       Impact factor: 4.615

7.  Using machine learning techniques to develop risk prediction models to predict graft failure following kidney transplantation: protocol for a retrospective cohort study.

Authors:  Sameera Senanayake; Adrian Barnett; Nicholas Graves; Helen Healy; Keshwar Baboolal; Sanjeewa Kularatna
Journal:  F1000Res       Date:  2019-10-29

8.  A Model Using Support Vector Machines Recursive Feature Elimination (SVM-RFE) Algorithm to Classify Whether COPD Patients Have Been Continuously Managed According to GOLD Guidelines.

Authors:  Jie Xia; Lina Sun; Suqin Xu; Qiu Xiang; Jianping Zhao; Weining Xiong; Yongjian Xu; Shuyuan Chu
Journal:  Int J Chron Obstruct Pulmon Dis       Date:  2020-11-04
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.