Literature DB >> 24098061

Simultaneous grouping pursuit and feature selection over an undirected graph.

Yunzhang Zhu, Xiaotong Shen, Wei Pan.   

Abstract

In high-dimensional regression, grouping pursuit and feature selection have their own merits while complementing each other in battling the curse of dimensionality. To seek a parsimonious model, we perform simultaneous grouping pursuit and feature selection over an arbitrary undirected graph with each node corresponding to one predictor. When the corresponding nodes are reachable from each other over the graph, regression coefficients can be grouped, whose absolute values are the same or close. This is motivated from gene network analysis, where genes tend to work in groups according to their biological functionalities. Through a nonconvex penalty, we develop a computational strategy and analyze the proposed method. Theoretical analysis indicates that the proposed method reconstructs the oracle estimator, that is, the unbiased least squares estimator given the true grouping, leading to consistent reconstruction of grouping structures and informative features, as well as to optimal parameter estimation. Simulation studies suggest that the method combines the benefit of grouping pursuit with that of feature selection, and compares favorably against its competitors in selection accuracy and predictive performance. An application to eQTL data is used to illustrate the methodology, where a network is incorporated into analysis through an undirected graph.

Entities:  

Keywords:  Network analysis; nonconvex minimization; prediction; structured data

Year:  2013        PMID: 24098061      PMCID: PMC3787732          DOI: 10.1080/01621459.2013.770704

Source DB:  PubMed          Journal:  J Am Stat Assoc        ISSN: 0162-1459            Impact factor:   5.033


  14 in total

1.  Likelihood-based selection and sharp parameter estimation.

Authors:  Xiaotong Shen; Wei Pan; Yunzhang Zhu
Journal:  J Am Stat Assoc       Date:  2012-06-11       Impact factor: 5.033

2.  Integrating pathway analysis and genetics of gene expression for genome-wide association studies.

Authors:  Hua Zhong; Xia Yang; Lee M Kaplan; Cliona Molony; Eric E Schadt
Journal:  Am J Hum Genet       Date:  2010-03-25       Impact factor: 11.025

3.  Bayesian detection of expression quantitative trait loci hot spots.

Authors:  Leonardo Bottolo; Enrico Petretto; Stefan Blankenberg; François Cambien; Stuart A Cook; Laurence Tiret; Sylvia Richardson
Journal:  Genetics       Date:  2011-09-16       Impact factor: 4.562

4.  Molecular markers of early Parkinson's disease based on gene expression in blood.

Authors:  Clemens R Scherzer; Aron C Eklund; Lee J Morse; Zhixiang Liao; Joseph J Locascio; Daniel Fefer; Michael A Schwarzschild; Michael G Schlossmacher; Michael A Hauser; Jeffery M Vance; Lewis R Sudarsky; David G Standaert; John H Growdon; Roderick V Jensen; Steven R Gullans
Journal:  Proc Natl Acad Sci U S A       Date:  2007-01-10       Impact factor: 11.205

5.  Grouping pursuit through a regularization solution surface.

Authors:  Xiaotong Shen; Hsin-Cheng Huang
Journal:  J Am Stat Assoc       Date:  2010-06-01       Impact factor: 5.033

6.  Trait-associated SNPs are more likely to be eQTLs: annotation to enhance discovery from GWAS.

Authors:  Dan L Nicolae; Eric Gamazon; Wei Zhang; Shiwei Duan; M Eileen Dolan; Nancy J Cox
Journal:  PLoS Genet       Date:  2010-04-01       Impact factor: 5.917

7.  VARIABLE SELECTION AND REGRESSION ANALYSIS FOR GRAPH-STRUCTURED COVARIATES WITH AN APPLICATION TO GENOMICS.

Authors:  Caiyan Li; Hongzhe Li
Journal:  Ann Appl Stat       Date:  2010-09-01       Impact factor: 2.083

8.  Incorporating predictor network in penalized regression with application to microarray data.

Authors:  Wei Pan; Benhuai Xie; Xiaotong Shen
Journal:  Biometrics       Date:  2009-07-23       Impact factor: 2.571

9.  Statistical estimation of correlated genome associations to a quantitative trait network.

Authors:  Seyoung Kim; Eric P Xing
Journal:  PLoS Genet       Date:  2009-08-14       Impact factor: 5.917

10.  A Bayesian partition method for detecting pleiotropic and epistatic eQTL modules.

Authors:  Wei Zhang; Jun Zhu; Eric E Schadt; Jun S Liu
Journal:  PLoS Comput Biol       Date:  2010-01-15       Impact factor: 4.475

View more
  9 in total

1.  Feature Grouping and Selection Over an Undirected Graph.

Authors:  Sen Yang; Lei Yuan; Ying-Cheng Lai; Xiaotong Shen; Peter Wonka; Jieping Ye
Journal:  KDD       Date:  2012

2.  Graph-based sparse linear discriminant analysis for high-dimensional classification.

Authors:  Jianyu Liu; Guan Yu; Yufeng Liu
Journal:  J Multivar Anal       Date:  2018-12-17       Impact factor: 1.473

3.  Structural pursuit over multiple undirected graphs.

Authors:  Yunzhang Zhu; Xiaotong Shen; Wei Pan
Journal:  J Am Stat Assoc       Date:  2014-10       Impact factor: 5.033

4.  Sparse Regression Incorporating Graphical Structure among Predictors.

Authors:  Guan Yu; Yufeng Liu
Journal:  J Am Stat Assoc       Date:  2016-08-18       Impact factor: 5.033

5.  Sparse Methods for Biomedical Data.

Authors:  Jieping Ye; Jun Liu
Journal:  SIGKDD Explor       Date:  2012-06-01

6.  Homogeneity Pursuit.

Authors:  Tracy Ke; Jianqing Fan; Yichao Wu
Journal:  J Am Stat Assoc       Date:  2015       Impact factor: 5.033

7.  Penalized regression approaches to testing for quantitative trait-rare variant association.

Authors:  Sunkyung Kim; Wei Pan; Xiaotong Shen
Journal:  Front Genet       Date:  2014-05-13       Impact factor: 4.599

Review 8.  Incorporating Pathway Information into Feature Selection towards Better Performed Gene Signatures.

Authors:  Suyan Tian; Chi Wang; Bing Wang
Journal:  Biomed Res Int       Date:  2019-04-03       Impact factor: 3.411

9.  Provable Convex Co-clustering of Tensors.

Authors:  Eric C Chi; Brian R Gaines; Will Wei Sun; Hua Zhou; Jian Yang
Journal:  J Mach Learn Res       Date:  2020       Impact factor: 5.177

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.