Literature DB >> 17384429

Improved centroids estimation for the nearest shrunken centroid classifier.

Sijian Wang1, Ji Zhu.   

Abstract

MOTIVATION: The nearest shrunken centroid (NSC) method has been successfully applied in many DNA-microarray classification problems. The NSC uses 'shrunken' centroids as prototypes for each class and identifies subsets of genes that best characterize each class. Classification is then made to the nearest (shrunken) centroid. The NSC is very easy to implement and very easy to interpret, however, it has drawbacks.
RESULTS: We show that the NSC method can be interpreted in the framework of LASSO regression. Based on that, we consider two new methods, adaptive L(infinity)-norm penalized NSC (ALP-NSC) and adaptive hierarchically penalized NSC (AHP-NSC), with two different penalty functions for microarray classification, which improve over the NSC. Unlike the L(1)-norm penalty used in LASSO, the penalty terms that we consider make use of the fact that parameters belonging to one gene should be treated as a natural group. Numerical results indicate that the two new methods tend to remove irrelevant genes more effectively and provide better classification results than the L(1)-norm approach. AVAILABILITY: R code for the ALP-NSC and the AHP-NSC algorithms are available from authors upon request.

Entities:  

Mesh:

Year:  2007        PMID: 17384429     DOI: 10.1093/bioinformatics/btm046

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  12 in total

Review 1.  A Cancer Biologist's Primer on Machine Learning Applications in High-Dimensional Cytometry.

Authors:  Timothy J Keyes; Pablo Domizi; Yu-Chen Lo; Garry P Nolan; Kara L Davis
Journal:  Cytometry A       Date:  2020-06-30       Impact factor: 4.355

2.  A ROAD to Classification in High Dimensional Space.

Authors:  Jianqing Fan; Yang Feng; Xin Tong
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2012-04-12       Impact factor: 4.488

3.  Optimal Feature Selection in High-Dimensional Discriminant Analysis.

Authors:  Mladen Kolar; Han Liu
Journal:  IEEE Trans Inf Theory       Date:  2015-02       Impact factor: 2.501

4.  Bias-corrected diagonal discriminant rules for high-dimensional classification.

Authors:  Song Huang; Tiejun Tong; Hongyu Zhao
Journal:  Biometrics       Date:  2010-12       Impact factor: 2.571

5.  Covariance-enhanced discriminant analysis.

Authors:  Peirong Xu; J I Zhu; Lixing Zhu; Y I Li
Journal:  Biometrika       Date:  2014-12-03       Impact factor: 2.445

6.  Block-diagonal discriminant analysis and its bias-corrected rules.

Authors:  Herbert Pang; Tiejun Tong; Michael Ng
Journal:  Stat Appl Genet Mol Biol       Date:  2013-06

7.  Identification of significant features in DNA microarray data.

Authors:  Eric Bair
Journal:  Wiley Interdiscip Rev Comput Stat       Date:  2013-07

8.  High-dimensional integrative copula discriminant analysis for multiomics data.

Authors:  Yong He; Hao Chen; Hao Sun; Jiadong Ji; Yufeng Shi; Xinsheng Zhang; Lei Liu
Journal:  Stat Med       Date:  2020-10-15       Impact factor: 2.373

9.  Improved shrunken centroid classifiers for high-dimensional class-imbalanced data.

Authors:  Rok Blagus; Lara Lusa
Journal:  BMC Bioinformatics       Date:  2013-02-23       Impact factor: 3.169

10.  Lung cancer gene expression database analysis incorporating prior knowledge with support vector machine-based classification method.

Authors:  Peng Guan; Desheng Huang; Miao He; Baosen Zhou
Journal:  J Exp Clin Cancer Res       Date:  2009-07-18
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.