Literature DB >> 26085701

Homogeneity Pursuit.

Tracy Ke1, Jianqing Fan1, Yichao Wu2.   

Abstract

This paper explores the homogeneity of coefficients in high-dimensional regression, which extends the sparsity concept and is more general and suitable for many applications. Homogeneity arises when regression coefficients corresponding to neighboring geographical regions or a similar cluster of covariates are expected to be approximately the same. Sparsity corresponds to a special case of homogeneity with a large cluster of known atom zero. In this article, we propose a new method called clustering algorithm in regression via data-driven segmentation (CARDS) to explore homogeneity. New mathematics are provided on the gain that can be achieved by exploring homogeneity. Statistical properties of two versions of CARDS are analyzed. In particular, the asymptotic normality of our proposed CARDS estimator is established, which reveals better estimation accuracy for homogeneous parameters than that without homogeneity exploration. When our methods are combined with sparsity exploration, further efficiency can be achieved beyond the exploration of sparsity alone. This provides additional insights into the power of exploring low-dimensional structures in high-dimensional regression: homogeneity and sparsity. Our results also shed lights on the properties of the fussed Lasso. The newly developed method is further illustrated by simulation studies and applications to real data. Supplementary materials for this article are available online.

Entities:  

Keywords:  clustering; homogeneity; sparsity

Year:  2015        PMID: 26085701      PMCID: PMC4465377          DOI: 10.1080/01621459.2014.892882

Source DB:  PubMed          Journal:  J Am Stat Assoc        ISSN: 0162-1459            Impact factor:   5.033


  15 in total

1.  Non-Concave Penalized Likelihood with NP-Dimensionality.

Authors:  Jianqing Fan; Jinchi Lv
Journal:  IEEE Trans Inf Theory       Date:  2011-08       Impact factor: 2.501

2.  Averaged gene expressions for regression.

Authors:  Mee Young Park; Trevor Hastie; Robert Tibshirani
Journal:  Biostatistics       Date:  2006-05-11       Impact factor: 5.899

3.  Feature Grouping and Selection Over an Undirected Graph.

Authors:  Sen Yang; Lei Yuan; Ying-Cheng Lai; Xiaotong Shen; Peter Wonka; Jieping Ye
Journal:  KDD       Date:  2012

4.  Discussion of "Sure Independence Screening for Ultra-High Dimensional Feature Space.

Authors:  Hao Helen Zhang
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2008-11       Impact factor: 4.488

5.  Grouping pursuit through a regularization solution surface.

Authors:  Xiaotong Shen; Hsin-Cheng Huang
Journal:  J Am Stat Assoc       Date:  2010-06-01       Impact factor: 5.033

6.  Sparse High Dimensional Models in Economics.

Authors:  Jianqing Fan; Jinchi Lv; Lei Qi
Journal:  Annu Rev Econom       Date:  2011-09

7.  An in-silico method for prediction of polyadenylation signals in human sequences.

Authors:  Huiqing Liu; Hao Han; Jinyan Li; Limsoon Wong
Journal:  Genome Inform       Date:  2003

8.  Simultaneous grouping pursuit and feature selection over an undirected graph.

Authors:  Yunzhang Zhu; Xiaotong Shen; Wei Pan
Journal:  J Am Stat Assoc       Date:  2013-01-01       Impact factor: 5.033

9.  VARIABLE SELECTION AND REGRESSION ANALYSIS FOR GRAPH-STRUCTURED COVARIATES WITH AN APPLICATION TO GENOMICS.

Authors:  Caiyan Li; Hongzhe Li
Journal:  Ann Appl Stat       Date:  2010-09-01       Impact factor: 2.083

10.  Statistical estimation of correlated genome associations to a quantitative trait network.

Authors:  Seyoung Kim; Eric P Xing
Journal:  PLoS Genet       Date:  2009-08-14       Impact factor: 5.917

View more
  4 in total

1.  Fused lasso with the adaptation of parameter ordering in combining multiple studies with repeated measurements.

Authors:  Fei Wang; Lu Wang; Peter X-K Song
Journal:  Biometrics       Date:  2016-02-22       Impact factor: 2.571

2.  Statistical modelling of citation exchange between statistics journals.

Authors:  Cristiano Varin; Manuela Cattelan; David Firth
Journal:  J R Stat Soc Ser A Stat Soc       Date:  2015-11-03       Impact factor: 2.483

3.  Mixed-Effect Time-Varying Network Model and Application in Brain Connectivity Analysis.

Authors:  Jingfei Zhang; Will Wei Sun; Lexin Li
Journal:  J Am Stat Assoc       Date:  2019-11-05       Impact factor: 5.033

4.  Fused Lasso Approach in Regression Coefficients Clustering - Learning Parameter Heterogeneity in Data Integration.

Authors:  Lu Tang; Peter X K Song
Journal:  J Mach Learn Res       Date:  2016       Impact factor: 3.654

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.