Literature DB >> 14594712

Linear regression and two-class classification with gene expression data.

Xiaohong Huang1, Wei Pan.   

Abstract

MOTIVATION: Using gene expression data to classify (or predict) tumor types has received much research attention recently. Due to some special features of gene expression data, several new methods have been proposed, including the weighted voting scheme of Golub et al., the compound covariate method of Hedenfalk et al. (originally proposed by Tukey), and the shrunken centroids method of Tibshirani et al. These methods look different and are more or less ad hoc.
RESULTS: We point out a close connection of the three methods with a linear regression model. Casting the classification problem in the general framework of linear regression naturally leads to new alternatives, such as partial least squares (PLS) methods and penalized PLS (PPLS) methods. Using two real data sets, we show the competitive performance of our new methods when compared with the other three methods.

Entities:  

Mesh:

Year:  2003        PMID: 14594712     DOI: 10.1093/bioinformatics/btg283

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  18 in total

1.  High Dimensional Classification Using Features Annealed Independence Rules.

Authors:  Jianqing Fan; Yingying Fan
Journal:  Ann Stat       Date:  2008       Impact factor: 4.028

2.  A ROAD to Classification in High Dimensional Space.

Authors:  Jianqing Fan; Yang Feng; Xin Tong
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2012-04-12       Impact factor: 4.488

3.  A Selective Overview of Variable Selection in High Dimensional Feature Space.

Authors:  Jianqing Fan; Jinchi Lv
Journal:  Stat Sin       Date:  2010-01       Impact factor: 1.261

4.  Radiomics score: a potential prognostic imaging feature for postoperative survival of solitary HCC patients.

Authors:  Bo-Hao Zheng; Long-Zi Liu; Zhi-Zhi Zhang; Jie-Yi Shi; Liang-Qing Dong; Ling-Yu Tian; Zhen-Bin Ding; Yuan Ji; Sheng-Xiang Rao; Jian Zhou; Jia Fan; Xiao-Ying Wang; Qiang Gao
Journal:  BMC Cancer       Date:  2018-11-21       Impact factor: 4.430

Review 5.  Statistical methods for integrating multiple types of high-throughput data.

Authors:  Yang Xie; Chul Ahn
Journal:  Methods Mol Biol       Date:  2010

6.  Feature Augmentation via Nonparametrics and Selection (FANS) in High-Dimensional Classification.

Authors:  Jianqing Fan; Yang Feng; Jiancheng Jiang; Xin Tong
Journal:  J Am Stat Assoc       Date:  2016-05-05       Impact factor: 5.033

7.  The simple classification of multiple cancer types using a small number of significant genes.

Authors:  Tae Young Yang
Journal:  Mol Diagn Ther       Date:  2007       Impact factor: 4.074

8.  An integrated method for cancer classification and rule extraction from microarray data.

Authors:  Liang-Tsung Huang
Journal:  J Biomed Sci       Date:  2009-02-24       Impact factor: 8.410

9.  SlimPLS: a method for feature selection in gene expression-based disease classification.

Authors:  Michael Gutkin; Ron Shamir; Gideon Dror
Journal:  PLoS One       Date:  2009-07-29       Impact factor: 3.240

10.  geneCBR: a translational tool for multiple-microarray analysis and integrative information retrieval for aiding diagnosis in cancer research.

Authors:  Daniel Glez-Peña; Fernando Díaz; Jesús M Hernández; Juan M Corchado; Florentino Fdez-Riverola
Journal:  BMC Bioinformatics       Date:  2009-06-18       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.