Literature DB >> 26393533

A significance test for graph-constrained estimation.

Sen Zhao1, Ali Shojaie1.   

Abstract

Graph-constrained estimation methods encourage similarities among neighboring covariates presented as nodes of a graph, and can result in more accurate estimates, especially in high-dimensional settings. Variable selection approaches can then be utilized to select a subset of variables that are associated with the response. However, existing procedures do not provide measures of uncertainty of estimates. Further, the vast majority of existing approaches assume that available graph accurately captures the association among covariates; violations to this assumption could severely hurt the reliability of the resulting estimates. In this article, we present a new inference framework, called the Grace test, which produces coefficient estimates and corresponding p-values by incorporating the external graph information. We show, both theoretically and via numerical studies, that the proposed method asymptotically controls the type-I error rate regardless of the choice of the graph. We also show that when the underlying graph is informative, the Grace test is asymptotically more powerful than similar tests that ignore the external information. We study the power properties of the proposed test when the graph is not fully informative and develop a more powerful Grace-ridge test for such settings. Our numerical studies show that as long as the graph is reasonably informative, the proposed inference procedures deliver improved statistical power over existing methods that ignore external information.
© 2015, The International Biometric Society.

Entities:  

Keywords:  Biological networks; Graph-constrained estimation; High-dimensional data; Significance test; Variable selection

Mesh:

Year:  2015        PMID: 26393533      PMCID: PMC4828333          DOI: 10.1111/biom.12418

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   2.571


  16 in total

1.  Discovery of meaningful associations in genomic data using partial correlation coefficients.

Authors:  Alberto de la Fuente; Nan Bing; Ina Hoeschele; Pedro Mendes
Journal:  Bioinformatics       Date:  2004-07-29       Impact factor: 6.937

2.  Incorporating gene networks into statistical tests for genomic data via a spatially correlated mixture model.

Authors:  Peng Wei; Wei Pan
Journal:  Bioinformatics       Date:  2007-12-14       Impact factor: 6.937

3.  Network-constrained regularization and variable selection for analysis of genomic data.

Authors:  Caiyan Li; Hongzhe Li
Journal:  Bioinformatics       Date:  2008-03-01       Impact factor: 6.937

4.  Semiparametric regression of multidimensional genetic pathway data: least-squares kernel machines and linear mixed models.

Authors:  Dawei Liu; Xihong Lin; Debashis Ghosh
Journal:  Biometrics       Date:  2007-12       Impact factor: 2.571

5.  Analysis of gene sets based on the underlying regulatory network.

Authors:  Ali Shojaie; George Michailidis
Journal:  J Comput Biol       Date:  2009-03       Impact factor: 1.479

6.  Structured penalties for functional linear models-partially empirical eigenvectors for regression.

Authors:  Timothy W Randolph; Jaroslaw Harezlak; Ziding Feng
Journal:  Electron J Stat       Date:  2012-01-01       Impact factor: 1.125

7.  Comparisons of distance methods for combining covariates and abundances in microbiome studies.

Authors:  Julia Fukuyama; Paul J McMurdie; Les Dethlefsen; David A Relman; Susan Holmes
Journal:  Pac Symp Biocomput       Date:  2012

8.  VARIABLE SELECTION AND REGRESSION ANALYSIS FOR GRAPH-STRUCTURED COVARIATES WITH AN APPLICATION TO GENOMICS.

Authors:  Caiyan Li; Hongzhe Li
Journal:  Ann Appl Stat       Date:  2010-09-01       Impact factor: 2.083

Review 9.  Ten years of pathway analysis: current approaches and outstanding challenges.

Authors:  Purvesh Khatri; Marina Sirota; Atul J Butte
Journal:  PLoS Comput Biol       Date:  2012-02-23       Impact factor: 4.475

10.  How complete are current yeast and human protein-interaction networks?

Authors:  G Traver Hart; Arun K Ramani; Edward M Marcotte
Journal:  Genome Biol       Date:  2006       Impact factor: 13.583

View more
  5 in total

1.  Graph-based sparse linear discriminant analysis for high-dimensional classification.

Authors:  Jianyu Liu; Guan Yu; Yufeng Liu
Journal:  J Multivar Anal       Date:  2018-12-17       Impact factor: 1.473

2.  Connectivity-informed adaptive regularization for generalized outcomes.

Authors:  Damian Brzyski; Marta Karas; Beau M Ances; Mario Dzemidzic; Joaquín Goñi; Timothy W Randolph; Jaroslaw Harezlak
Journal:  Can J Stat       Date:  2021-02-15       Impact factor: 0.875

3.  Gene-gene interaction analysis incorporating network information via a structured Bayesian approach.

Authors:  Xing Qin; Shuangge Ma; Mengyun Wu
Journal:  Stat Med       Date:  2021-09-20       Impact factor: 2.373

4.  KERNEL-PENALIZED REGRESSION FOR ANALYSIS OF MICROBIOME DATA.

Authors:  Timothy W Randolph; Sen Zhao; Wade Copeland; Meredith Hullar; Ali Shojaie
Journal:  Ann Appl Stat       Date:  2018-03-09       Impact factor: 2.083

5.  Prediction of response to anti-cancer drugs becomes robust via network integration of molecular data.

Authors:  Marcela Franco; Ashwini Jeggari; Sylvain Peuget; Franziska Böttger; Galina Selivanova; Andrey Alexeyenko
Journal:  Sci Rep       Date:  2019-02-20       Impact factor: 4.379

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.