Literature DB >> 18562478

Penalized feature selection and classification in bioinformatics.

Shuangge Ma1, Jian Huang.   

Abstract

In bioinformatics studies, supervised classification with high-dimensional input variables is frequently encountered. Examples routinely arise in genomic, epigenetic and proteomic studies. Feature selection can be employed along with classifier construction to avoid over-fitting, to generate more reliable classifier and to provide more insights into the underlying causal relationships. In this article, we provide a review of several recently developed penalized feature selection and classification techniques--which belong to the family of embedded feature selection methods--for bioinformatics studies with high-dimensional input. Classification objective functions, penalty functions and computational algorithms are discussed. Our goal is to make interested researchers aware of these feature selection and classification methods that are applicable to high-dimensional bioinformatics data.

Mesh:

Year:  2008        PMID: 18562478      PMCID: PMC2733190          DOI: 10.1093/bib/bbn027

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  31 in total

1.  A simple and efficient algorithm for gene selection using sparse logistic regression.

Authors:  S K Shevade; S S Keerthi
Journal:  Bioinformatics       Date:  2003-11-22       Impact factor: 6.937

2.  Regression approaches for microarray data analysis.

Authors:  Mark R Segal; Kam D Dahlquist; Bruce R Conklin
Journal:  J Comput Biol       Date:  2003       Impact factor: 1.479

3.  Regularized ROC method for disease classification and biomarker selection with microarray data.

Authors:  Shuangge Ma; Jian Huang
Journal:  Bioinformatics       Date:  2005-10-18       Impact factor: 6.937

4.  Prediction of protein subcellular localization.

Authors:  Chin-Sheng Yu; Yu-Ching Chen; Chih-Hao Lu; Jenn-Kang Hwang
Journal:  Proteins       Date:  2006-08-15

5.  Sparse logistic regression with Lp penalty for biomarker identification.

Authors:  Zhenqiu Liu; Feng Jiang; Guoliang Tian; Suna Wang; Fumiaki Sato; Stephen J Meltzer; Ming Tan
Journal:  Stat Appl Genet Mol Biol       Date:  2007-02-10

6.  Group SCAD regression analysis for microarray time course gene expression data.

Authors:  Lifeng Wang; Guang Chen; Hongzhe Li
Journal:  Bioinformatics       Date:  2007-04-26       Impact factor: 6.937

7.  Tumor classification by partial least squares using microarray gene expression data.

Authors:  Danh V Nguyen; David M Rocke
Journal:  Bioinformatics       Date:  2002-01       Impact factor: 6.937

8.  Cellular vitamins, DNA methylation and cancer risk.

Authors:  Chandrika J Piyathilake; Gary L Johanning
Journal:  J Nutr       Date:  2002-08       Impact factor: 4.798

9.  Classification and selection of biomarkers in genomic data using LASSO.

Authors:  Debashis Ghosh; Arul M Chinnaiyan
Journal:  J Biomed Biotechnol       Date:  2005-06-30

10.  Additive risk survival model with microarray data.

Authors:  Shuangge Ma; Jian Huang
Journal:  BMC Bioinformatics       Date:  2007-06-08       Impact factor: 3.169

View more
  47 in total

1.  Principal component analysis based methods in bioinformatics studies.

Authors:  Shuangge Ma; Ying Dai
Journal:  Brief Bioinform       Date:  2011-01-17       Impact factor: 11.622

2.  Semiparametric prognosis models in genomic studies.

Authors:  Shuangge Ma; Jian Huang; Mingyu Shi; Yang Li; Ben-Chang Shia
Journal:  Brief Bioinform       Date:  2010-02-01       Impact factor: 11.622

3.  Diagnosis of Chronic Kidney Disease Based on Support Vector Machine by Feature Selection Methods.

Authors:  Huseyin Polat; Homay Danaei Mehr; Aydin Cetin
Journal:  J Med Syst       Date:  2017-02-27       Impact factor: 4.460

4.  Identification of cancer-associated gene clusters and genes via clustering penalization.

Authors:  Shuangge Ma; Jian Huang; Shihao Shen
Journal:  Stat Interface       Date:  2009-01-01       Impact factor: 0.582

5.  Performance of feature selection methods.

Authors:  Edward R Dougherty; Jianping Hua; Chao Sima
Journal:  Curr Genomics       Date:  2009-09       Impact factor: 2.236

6.  Incorporating gene co-expression network in identification of cancer prognosis markers.

Authors:  Shuangge Ma; Mingyu Shi; Yang Li; Danhui Yi; Ben-Chang Shia
Journal:  BMC Bioinformatics       Date:  2010-05-20       Impact factor: 3.169

7.  Characterization of the effectiveness of reporting lists of small feature sets relative to the accuracy of the prior biological knowledge.

Authors:  Chen Zhao; Michael L Bittner; Robert S Chapkin; Edward R Dougherty
Journal:  Cancer Inform       Date:  2010-03-18

8.  MicroRNA-integrated and network-embedded gene selection with diffusion distance.

Authors:  Di Huang; Xiaobo Zhou; Christopher J Lyon; Willa A Hsueh; Stephen T C Wong
Journal:  PLoS One       Date:  2010-10-29       Impact factor: 3.240

9.  Identification of genes associated with multiple cancers via integrative analysis.

Authors:  Shuangge Ma; Jian Huang; Meena S Moran
Journal:  BMC Genomics       Date:  2009-11-17       Impact factor: 3.969

10.  Detection of gene pathways with predictive power for breast cancer prognosis.

Authors:  Shuangge Ma; Michael R Kosorok
Journal:  BMC Bioinformatics       Date:  2010-01-01       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.