Literature DB >> 31983784

Graph-based sparse linear discriminant analysis for high-dimensional classification.

Jianyu Liu1, Guan Yu2, Yufeng Liu1,3.   

Abstract

Linear discriminant analysis (LDA) is a well-known classification technique that enjoyed great success in practical applications. Despite its effectiveness for traditional low-dimensional problems, extensions of LDA are necessary in order to classify high-dimensional data. Many variants of LDA have been proposed in the literature. However, most of these methods do not fully incorporate the structure information among predictors when such information is available. In this paper, we introduce a new high-dimensional LDA technique, namely graph-based sparse LDA (GSLDA), that utilizes the graph structure among the features. In particular, we use the regularized regression formulation for penalized LDA techniques, and propose to impose a structure-based sparse penalty on the discriminant vector β . The graph structure can be either given or estimated from the training data. Moreover, we explore the relationship between the within-class feature structure and the overall feature structure. Based on this relationship, we further propose a variant of our proposed GSLDA to utilize effectively unlabeled data, which can be abundant in the semi-supervised learning setting. With the new regularization, we can obtain a sparse estimate of β and more accurate and interpretable classifiers than many existing methods. Both the selection consistency of β estimation and the convergence rate of the classifier are established, and the resulting classifier has an asymptotic Bayes error rate. Finally, we demonstrate the competitive performance of the proposed GSLDA on both simulated and real data studies.

Entities:  

Keywords:  Feature structure; Gaussian graphical models; Regularization; Undirected graph

Year:  2018        PMID: 31983784      PMCID: PMC6980367          DOI: 10.1016/j.jmva.2018.12.007

Source DB:  PubMed          Journal:  J Multivar Anal        ISSN: 0047-259X            Impact factor:   1.473


  23 in total

1.  Network-constrained regularization and variable selection for analysis of genomic data.

Authors:  Caiyan Li; Hongzhe Li
Journal:  Bioinformatics       Date:  2008-03-01       Impact factor: 6.937

2.  High Dimensional Classification Using Features Annealed Independence Rules.

Authors:  Jianqing Fan; Yingying Fan
Journal:  Ann Stat       Date:  2008       Impact factor: 4.028

3.  Multicategory Large-Margin Unified Machines.

Authors:  Chong Zhang; Yufeng Liu
Journal:  J Mach Learn Res       Date:  2013-05-01       Impact factor: 3.654

4.  Reinforced Angle-based Multicategory Support Vector Machines.

Authors:  Chong Zhang; Yufeng Liu; Junhui Wang; Hongtu Zhu
Journal:  J Comput Graph Stat       Date:  2016-08-05       Impact factor: 2.302

5.  Penalized model-based clustering with unconstrained covariance matrices.

Authors:  Hui Zhou; Wei Pan; Xiaotong Shen
Journal:  Electron J Stat       Date:  2009-01-01       Impact factor: 1.125

6.  Diagnosis of multiple cancer types by shrunken centroids of gene expression.

Authors:  Robert Tibshirani; Trevor Hastie; Balasubramanian Narasimhan; Gilbert Chu
Journal:  Proc Natl Acad Sci U S A       Date:  2002-05-14       Impact factor: 11.205

7.  Semi-supervised spectral clustering with application to detect population stratification.

Authors:  Binghui Liu; Xiaotong Shen; Wei Pan
Journal:  Front Genet       Date:  2013-10-25       Impact factor: 4.599

8.  Incorporating predictor network in penalized regression with application to microarray data.

Authors:  Wei Pan; Benhuai Xie; Xiaotong Shen
Journal:  Biometrics       Date:  2009-07-23       Impact factor: 2.571

9.  Graph Estimation with Joint Additive Models.

Authors:  Arend Voorman; Ali Shojaie; Daniela Witten
Journal:  Biometrika       Date:  2014-03-01       Impact factor: 2.445

10.  Molecular pathway identification using biological network-regularized logistic models.

Authors:  Wen Zhang; Ying-Wooi Wan; Genevera I Allen; Kaifang Pang; Matthew L Anderson; Zhandong Liu
Journal:  BMC Genomics       Date:  2013-12-09       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.