Literature DB >> 16249260

Gene selection using support vector machines with non-convex penalty.

Hao Helen Zhang1, Jeongyoun Ahn, Xiaodong Lin, Cheolwoo Park.   

Abstract

MOTIVATION: With the development of DNA microarray technology, scientists can now measure the expression levels of thousands of genes simultaneously in one single experiment. One current difficulty in interpreting microarray data comes from their innate nature of 'high-dimensional low sample size'. Therefore, robust and accurate gene selection methods are required to identify differentially expressed group of genes across different samples, e.g. between cancerous and normal cells. Successful gene selection will help to classify different cancer types, lead to a better understanding of genetic signatures in cancers and improve treatment strategies. Although gene selection and cancer classification are two closely related problems, most existing approaches handle them separately by selecting genes prior to classification. We provide a unified procedure for simultaneous gene selection and cancer classification, achieving high accuracy in both aspects.
RESULTS: In this paper we develop a novel type of regularization in support vector machines (SVMs) to identify important genes for cancer classification. A special nonconvex penalty, called the smoothly clipped absolute deviation penalty, is imposed on the hinge loss function in the SVM. By systematically thresholding small estimates to zeros, the new procedure eliminates redundant genes automatically and yields a compact and accurate classifier. A successive quadratic algorithm is proposed to convert the non-differentiable and non-convex optimization problem into easily solved linear equation systems. The method is applied to two real datasets and has produced very promising results. AVAILABILITY: MATLAB codes are available upon request from the authors.

Entities:  

Mesh:

Year:  2005        PMID: 16249260     DOI: 10.1093/bioinformatics/bti736

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  45 in total

1.  HIV-associated sensory polyneuropathy and neuronal injury are associated with miRNA-455-3p induction.

Authors:  Eugene L Asahchop; William G Branton; Anand Krishnan; Patricia A Chen; Dong Yang; Linglong Kong; Douglas W Zochodne; Bruce J Brew; M John Gill; Christopher Power
Journal:  JCI Insight       Date:  2018-12-06

2.  Identification of differential gene pathways with principal component analysis.

Authors:  Shuangge Ma; Michael R Kosorok
Journal:  Bioinformatics       Date:  2009-02-17       Impact factor: 6.937

Review 3.  Penalized feature selection and classification in bioinformatics.

Authors:  Shuangge Ma; Jian Huang
Journal:  Brief Bioinform       Date:  2008-06-18       Impact factor: 11.622

4.  Two-Dimensional Solution Surface for Weighted Support Vector Machines.

Authors:  Seung Jun Shin; Yichao Wu; Hao Helen Zhang
Journal:  J Comput Graph Stat       Date:  2014-04-03       Impact factor: 2.302

5.  Interaction-based feature selection and classification for high-dimensional biological data.

Authors:  Haitian Wang; Shaw-Hwa Lo; Tian Zheng; Inchi Hu
Journal:  Bioinformatics       Date:  2012-09-03       Impact factor: 6.937

6.  Prediction-based structured variable selection through the receiver operating characteristic curves.

Authors:  Yuanjia Wang; Huaihou Chen; Runze Li; Naihua Duan; Roberto Lewis-Fernández
Journal:  Biometrics       Date:  2010-12-22       Impact factor: 2.571

7.  A Perturbation Method for Inference on Regularized Regression Estimates.

Authors:  Jessica Minnier; Lu Tian; Tianxi Cai
Journal:  J Am Stat Assoc       Date:  2012-01-24       Impact factor: 5.033

8.  Variable Selection for Support Vector Machines in Moderately High Dimensions.

Authors:  Xiang Zhang; Yichao Wu; Lan Wang; Runze Li
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2015-01-05       Impact factor: 4.488

9.  High-Dimensional Structured Feature Screening Using Binary Markov Random Fields.

Authors:  Jie Liu; Peggy Peissig; Chunming Zhang; Elizabeth Burnside; Catherine McCarty; David Page
Journal:  JMLR Workshop Conf Proc       Date:  2012

10.  Prediction of candidate primary immunodeficiency disease genes using a support vector machine learning approach.

Authors:  Shivakumar Keerthikumar; Sahely Bhadra; Kumaran Kandasamy; Rajesh Raju; Y L Ramachandra; Chiranjib Bhattacharyya; Kohsuke Imai; Osamu Ohara; Sujatha Mohan; Akhilesh Pandey
Journal:  DNA Res       Date:  2009-10-03       Impact factor: 4.458

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.