Literature DB >> 20479504

Cancer classification from gene expression data by NPPC ensemble.

Santanu Ghorai1, Anirban Mukherjee, Sanghamitra Sengupta, Pranab K Dutta.   

Abstract

The most important application of microarray in gene expression analysis is to classify the unknown tissue samples according to their gene expression levels with the help of known sample expression levels. In this paper, we present a nonparallel plane proximal classifier (NPPC) ensemble that ensures high classification accuracy of test samples in a computer-aided diagnosis (CAD) framework than that of a single NPPC model. For each data set only, a few genes are selected by using a mutual information criterion. Then a genetic algorithm-based simultaneous feature and model selection scheme is used to train a number of NPPC expert models in multiple subspaces by maximizing cross-validation accuracy. The members of the ensemble are selected by the performance of the trained models on a validation set. Besides the usual majority voting method, we have introduced minimum average proximity-based decision combiner for NPPC ensemble. The effectiveness of the NPPC ensemble and the proposed new approach of combining decisions for cancer diagnosis are studied and compared with support vector machine (SVM) classifier in a similar framework. Experimental results on cancer data sets show that the NPPC ensemble offers comparable testing accuracy to that of SVM ensemble with reduced training time on average.

Entities:  

Mesh:

Year:  2011        PMID: 20479504     DOI: 10.1109/TCBB.2010.36

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  3 in total

1.  Recognition of multiple imbalanced cancer types based on DNA microarray data using ensemble classifiers.

Authors:  Hualong Yu; Shufang Hong; Xibei Yang; Jun Ni; Yuanyuan Dan; Bin Qin
Journal:  Biomed Res Int       Date:  2013-08-26       Impact factor: 3.411

2.  Random Subspace Aggregation for Cancer Prediction with Gene Expression Profiles.

Authors:  Liying Yang; Zhimin Liu; Xiguo Yuan; Jianhua Wei; Junying Zhang
Journal:  Biomed Res Int       Date:  2016-11-24       Impact factor: 3.411

3.  An ensemble machine learning model based on multiple filtering and supervised attribute clustering algorithm for classifying cancer samples.

Authors:  Shilpi Bose; Chandra Das; Abhik Banerjee; Kuntal Ghosh; Matangini Chattopadhyay; Samiran Chattopadhyay; Aishwarya Barik
Journal:  PeerJ Comput Sci       Date:  2021-09-16
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.