Literature DB >> 22904608

A Sparse Structured Shrinkage Estimator for Nonparametric Varying-Coefficient Model with an Application in Genomics.

Z John Daye1, Jichun Xie, Hongzhe Li.   

Abstract

Many problems in genomics are related to variable selection where high-dimensional genomic data are treated as covariates. Such genomic covariates often have certain structures and can be represented as vertices of an undirected graph. Biological processes also vary as functions depending upon some biological state, such as time. High-dimensional variable selection where covariates are graph-structured and underlying model is nonparametric presents an important but largely unaddressed statistical challenge. Motivated by the problem of regression-based motif discovery, we consider the problem of variable selection for high-dimensional nonparametric varying-coefficient models and introduce a sparse structured shrinkage (SSS) estimator based on basis function expansions and a novel smoothed penalty function. We present an efficient algorithm for computing the SSS estimator. Results on model selection consistency and estimation bounds are derived. Moreover, finite-sample performances are studied via simulations, and the effects of high-dimensionality and structural information of the covariates are especially highlighted. We apply our method to motif finding problem using a yeast cell-cycle gene expression dataset and word counts in genes' promoter sequences. Our results demonstrate that the proposed method can result in better variable selection and prediction for high-dimensional regression when the underlying model is nonparametric and covariates are structured. Supplemental materials for the article are available online.

Entities:  

Year:  2012        PMID: 22904608      PMCID: PMC3419598          DOI: 10.1198/jcgs.2011.10102

Source DB:  PubMed          Journal:  J Comput Graph Stat        ISSN: 1061-8600            Impact factor:   2.302


  16 in total

1.  Regulatory element detection using correlation with expression.

Authors:  H J Bussemaker; H Li; E D Siggia
Journal:  Nat Genet       Date:  2001-02       Impact factor: 38.330

2.  Structural analysis of conserved base pairs in protein-DNA complexes.

Authors:  Leonid A Mirny; Mikhail S Gelfand
Journal:  Nucleic Acids Res       Date:  2002-04-01       Impact factor: 16.971

3.  Network-constrained regularization and variable selection for analysis of genomic data.

Authors:  Caiyan Li; Hongzhe Li
Journal:  Bioinformatics       Date:  2008-03-01       Impact factor: 6.937

4.  Group SCAD regression analysis for microarray time course gene expression data.

Authors:  Lifeng Wang; Guang Chen; Hongzhe Li
Journal:  Bioinformatics       Date:  2007-04-26       Impact factor: 6.937

5.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

6.  Covariance-regularized regression and classification for high-dimensional problems.

Authors:  Daniela M Witten; Robert Tibshirani
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2009-02-20       Impact factor: 4.488

7.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization.

Authors:  P T Spellman; G Sherlock; M Q Zhang; V R Iyer; K Anders; M B Eisen; P O Brown; D Botstein; B Futcher
Journal:  Mol Biol Cell       Date:  1998-12       Impact factor: 4.138

8.  VARIABLE SELECTION AND REGRESSION ANALYSIS FOR GRAPH-STRUCTURED COVARIATES WITH AN APPLICATION TO GENOMICS.

Authors:  Caiyan Li; Hongzhe Li
Journal:  Ann Appl Stat       Date:  2010-09-01       Impact factor: 2.083

9.  Position specific variation in the rate of evolution in transcription factor binding sites.

Authors:  Alan M Moses; Derek Y Chiang; Manolis Kellis; Eric S Lander; Michael B Eisen
Journal:  BMC Evol Biol       Date:  2003-08-28       Impact factor: 3.260

10.  Detecting DNA regulatory motifs by incorporating positional trends in information content.

Authors:  Katherina J Kechris; Erik van Zwet; Peter J Bickel; Michael B Eisen
Journal:  Genome Biol       Date:  2004-06-24       Impact factor: 13.583

View more
  2 in total

1.  White matter microstructural abnormalities and default network degeneration are associated with early memory deficit in Alzheimer's disease continuum.

Authors:  Fang Ji; Ofer Pasternak; Kwun Kei Ng; Joanna Su Xian Chong; Siwei Liu; Liwen Zhang; Hee Youn Shim; Yng Miin Loke; Boon Yeow Tan; Narayanaswamy Venketasubramanian; Christopher Li-Hsian Chen; Juan Helen Zhou
Journal:  Sci Rep       Date:  2019-03-18       Impact factor: 4.379

2.  Stage-dependent differential influence of metabolic and structural networks on memory across Alzheimer's disease continuum.

Authors:  Kok Pin Ng; Xing Qian; Kwun Kei Ng; Fang Ji; Pedro Rosa-Neto; Serge Gauthier; Nagaendran Kandiah; Juan Helen Zhou
Journal:  Elife       Date:  2022-09-02       Impact factor: 8.713

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.