Literature DB >> 30537483

Feature selection and tumor classification for microarray data using relaxed Lasso and generalized multi-class support vector machine.

Chuanze Kang1, Yanhao Huo1, Lihui Xin1, Baoguang Tian1, Bin Yu2.   

Abstract

At present, the study of gene expression data provides a reference for tumor diagnosis at the molecular level. It is a challenging task to select the feature genes related to the classification from the high-dimensional and small-sample gene expression data and successfully separate the different subtypes of tumor or between the normal and patient. In this paper, we present a new method for tumor classification-relaxed Lasso (least absolute shrinkage and selection operator) and generalized multi-class support vector machine (rL-GenSVM). The tumor datasets are firstly z-score normalized. Secondly, using relaxed Lasso to select feature gene sets on training set, and finally, generalized multi-class support vector machine (GenSVM) serves as a classifier. We select four two-class datasets and four multi-class datasets for experiments. And four classifiers are used to predict and compare the classification accuracy on test set. To compare with other proposed methods, we obtain satisfactory classification accuracy by 10-fold cross-validation on all samples of each dataset. The experimental results show that the method proposed in this paper selects fewer feature genes and achieves higher classification accuracy. rL-GenSVM uses regularization parameters to avoid overfitting and can be widely applied to high-dimensional and small-sample tumor data classification. The source code and all datasets are available at https://github.com/QUST-AIBBDRC/rL-GenSVM/.
Copyright © 2018 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Feature genes; Gene expression data; Generalized multi-class support vector machine; Relaxed Lasso; Tumor classification

Mesh:

Year:  2018        PMID: 30537483     DOI: 10.1016/j.jtbi.2018.12.010

Source DB:  PubMed          Journal:  J Theor Biol        ISSN: 0022-5193            Impact factor:   2.691


  12 in total

1.  Use of radiomics based on 18F-FDG PET/CT and machine learning methods to aid clinical decision-making in the classification of solitary pulmonary lesions: an innovative approach.

Authors:  Yi Zhou; Xue-Lei Ma; Ting Zhang; Jian Wang; Tao Zhang; Rong Tian
Journal:  Eur J Nucl Med Mol Imaging       Date:  2021-02-05       Impact factor: 9.236

2.  A sex-stratified analysis of neuroimmune gene expression signatures in Alzheimer's disease brains.

Authors:  Cristina Sanfilippo; Paola Castrogiovanni; Manlio Vinciguerra; Rosa Imbesi; Martina Ulivieri; Francesco Fazio; Kaj Blennow; Henrik Zetterberg; Michelino Di Rosa
Journal:  Geroscience       Date:  2022-09-22       Impact factor: 7.581

3.  pSuc-FFSEA: Predicting Lysine Succinylation Sites in Proteins Based on Feature Fusion and Stacking Ensemble Algorithm.

Authors:  Jianhua Jia; Genqiang Wu; Wangren Qiu
Journal:  Front Cell Dev Biol       Date:  2022-05-24

4.  GNG13 Is a Potential Marker of the State of Health of Alzheimer's Disease Patients' Cerebellum.

Authors:  Cristina Sanfilippo; Giuseppe Musumeci; Maria Kazakova; Venera Mazzone; Paola Castrogiovanni; Rosa Imbesi; Michelino Di Rosa
Journal:  J Mol Neurosci       Date:  2020-10-15       Impact factor: 3.444

5.  RNA-Seq-Based Breast Cancer Subtypes Classification Using Machine Learning Approaches.

Authors:  Zhezhou Yu; Zhuo Wang; Xiangchun Yu; Zhe Zhang
Journal:  Comput Intell Neurosci       Date:  2020-10-29

6.  Support Vector Machine for Lung Adenocarcinoma Staging Through Variant Pathways.

Authors:  Feng Di; Chunxiao He; Guimei Pu; Chunyi Zhang
Journal:  G3 (Bethesda)       Date:  2020-07-07       Impact factor: 3.154

7.  Personalized Body Constitution Inquiry Based on Machine Learning.

Authors:  Baochao Fan; Yanghui Li; Guihua Wen; Yan Ren; Yantong Lu; Ziying Wang; Yuan Zhang; Changjun Wang
Journal:  J Healthc Eng       Date:  2020-11-12       Impact factor: 2.682

8.  Brain CHID1 Expression Correlates with NRGN and CALB1 in Healthy Subjects and AD Patients.

Authors:  Paola Castrogiovanni; Cristina Sanfilippo; Rosa Imbesi; Grazia Maugeri; Debora Lo Furno; Daniele Tibullo; Alessandro Castorina; Giuseppe Musumeci; Michelino Di Rosa
Journal:  Cells       Date:  2021-04-13       Impact factor: 6.600

9.  MI_DenseNetCAM: A Novel Pan-Cancer Classification and Prediction Method Based on Mutual Information and Deep Learning Model.

Authors:  Jianlin Wang; Xuebing Dai; Huimin Luo; Chaokun Yan; Ge Zhang; Junwei Luo
Journal:  Front Genet       Date:  2021-06-03       Impact factor: 4.599

10.  Feature selection with the Fisher score followed by the Maximal Clique Centrality algorithm can accurately identify the hub genes of hepatocellular carcinoma.

Authors:  Chengzhang Li; Jiucheng Xu
Journal:  Sci Rep       Date:  2019-11-21       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.