Literature DB >> 22868679

Biomarker identification and cancer classification based on microarray data using Laplace naive Bayes model with mean shrinkage.

Meng-Yun Wu1, Dao-Qing Dai, Yu Shi, Hong Yan, Xiao-Fei Zhang.   

Abstract

Biomarker identification and cancer classification are two closely related problems. In gene expression data sets, the correlation between genes can be high when they share the same biological pathway. Moreover, the gene expression data sets may contain outliers due to either chemical or electrical reasons. A good gene selection method should take group effects into account and be robust to outliers. In this paper, we propose a Laplace naive Bayes model with mean shrinkage (LNB-MS). The Laplace distribution instead of the normal distribution is used as the conditional distribution of the samples for the reasons that it is less sensitive to outliers and has been applied in many fields. The key technique is the L1 penalty imposed on the mean of each class to achieve automatic feature selection. The objective function of the proposed model is a piecewise linear function with respect to the mean of each class, of which the optimal value can be evaluated at the breakpoints simply. An efficient algorithm is designed to estimate the parameters in the model. A new strategy that uses the number of selected features to control the regularization parameter is introduced. Experimental results on simulated data sets and 17 publicly available cancer data sets attest to the accuracy, sparsity, efficiency, and robustness of the proposed algorithm. Many biomarkers identified with our method have been verified in biochemical or biomedical research. The analysis of biological and functional correlation of the genes based on Gene Ontology (GO) terms shows that the proposed method guarantees the selection of highly correlated genes simultaneously

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 22868679     DOI: 10.1109/TCBB.2012.105

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  7 in total

1.  Folded concave penalized learning in identifying multimodal MRI marker for Parkinson's disease.

Authors:  Hongcheng Liu; Guangwei Du; Lijun Zhang; Mechelle M Lewis; Xue Wang; Tao Yao; Runze Li; Xuemei Huang
Journal:  J Neurosci Methods       Date:  2016-04-19       Impact factor: 2.390

2.  Integration of 24 Feature Types to Accurately Detect and Predict Seizures Using Scalp EEG Signals.

Authors:  Yinda Zhang; Shuhan Yang; Yang Liu; Yexian Zhang; Bingfeng Han; Fengfeng Zhou
Journal:  Sensors (Basel)       Date:  2018-04-28       Impact factor: 3.576

3.  Cancer subtype discovery and biomarker identification via a new robust network clustering algorithm.

Authors:  Meng-Yun Wu; Dao-Qing Dai; Xiao-Fei Zhang; Yuan Zhu
Journal:  PLoS One       Date:  2013-06-17       Impact factor: 3.240

4.  Hierarchical gene selection and genetic fuzzy system for cancer microarray data classification.

Authors:  Thanh Nguyen; Abbas Khosravi; Douglas Creighton; Saeid Nahavandi
Journal:  PLoS One       Date:  2015-03-30       Impact factor: 3.240

5.  An elastic-net logistic regression approach to generate classifiers and gene signatures for types of immune cells and T helper cell subsets.

Authors:  Arezo Torang; Paraag Gupta; David J Klinke
Journal:  BMC Bioinformatics       Date:  2019-08-22       Impact factor: 3.169

6.  StressGenePred: a twin prediction model architecture for classifying the stress types of samples and discovering stress-related genes in arabidopsis.

Authors:  Dongwon Kang; Hongryul Ahn; Sangseon Lee; Chai-Jin Lee; Jihye Hur; Woosuk Jung; Sun Kim
Journal:  BMC Genomics       Date:  2019-12-20       Impact factor: 3.969

7.  Identification of genes associated with renal cell carcinoma using gene expression profiling analysis.

Authors:  Ting Yao; Qinfu Wang; Wenyong Zhang; Aihong Bian; Jinping Zhang
Journal:  Oncol Lett       Date:  2016-05-16       Impact factor: 2.967

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.