Literature DB >> 20689721

Grouping pursuit through a regularization solution surface.

Xiaotong Shen, Hsin-Cheng Huang.   

Abstract

Extracting grouping structure or identifying homogenous subgroups of predictors in regression is crucial for high-dimensional data analysis. A low-dimensional structure in particular-grouping, when captured in a regression model, enables to enhance predictive performance and to facilitate a model's interpretability Grouping pursuit extracts homogenous subgroups of predictors most responsible for outcomes of a response. This is the case in gene network analysis, where grouping reveals gene functionalities with regard to progression of a disease. To address challenges in grouping pursuit, we introduce a novel homotopy method for computing an entire solution surface through regularization involving a piecewise linear penalty. This nonconvex and overcomplete penalty permits adaptive grouping and nearly unbiased estimation, which is treated with a novel concept of grouped subdifferentials and difference convex programming for efficient computation. Finally, the proposed method not only achieves high performance as suggested by numerical analysis, but also has the desired optimality with regard to grouping pursuit and prediction as showed by our theoretical results.

Entities:  

Year:  2010        PMID: 20689721      PMCID: PMC2913333          DOI: 10.1198/jasa.2010.tm09380

Source DB:  PubMed          Journal:  J Am Stat Assoc        ISSN: 0162-1459            Impact factor:   5.033


  6 in total

1.  Simultaneous regression shrinkage, variable selection, and supervised clustering of predictors with OSCAR.

Authors:  Howard D Bondell; Brian J Reich
Journal:  Biometrics       Date:  2007-06-30       Impact factor: 2.571

2.  Network-constrained regularization and variable selection for analysis of genomic data.

Authors:  Caiyan Li; Hongzhe Li
Journal:  Bioinformatics       Date:  2008-03-01       Impact factor: 6.937

3.  Adaptive regularization using the entire solution surface.

Authors:  S Wu; X Shen; C J Geyer
Journal:  Biometrika       Date:  2009-09       Impact factor: 2.445

4.  Gene-expression profiles to predict distant metastasis of lymph-node-negative primary breast cancer.

Authors:  Yixin Wang; Jan G M Klijn; Yi Zhang; Anieta M Sieuwerts; Maxime P Look; Fei Yang; Dmitri Talantov; Mieke Timmermans; Marion E Meijer-van Gelder; Jack Yu; Tim Jatkoe; Els M J J Berns; David Atkins; John A Foekens
Journal:  Lancet       Date:  2005 Feb 19-25       Impact factor: 79.321

5.  Focus on the p53 gene and cancer: advances in TP53 mutation research.

Authors:  Thierry Soussi
Journal:  Hum Mutat       Date:  2003-03       Impact factor: 4.878

6.  Network-based classification of breast cancer metastasis.

Authors:  Han-Yu Chuang; Eunjung Lee; Yu-Tsueng Liu; Doheon Lee; Trey Ideker
Journal:  Mol Syst Biol       Date:  2007-10-16       Impact factor: 11.429

  6 in total
  9 in total

1.  Simultaneous supervised clustering and feature selection over a graph.

Authors:  Xiaotong Shen; Hsin-Cheng Huang; Wei Pan
Journal:  Biometrika       Date:  2012-10-18       Impact factor: 2.445

2.  Feature Grouping and Selection Over an Undirected Graph.

Authors:  Sen Yang; Lei Yuan; Ying-Cheng Lai; Xiaotong Shen; Peter Wonka; Jieping Ye
Journal:  KDD       Date:  2012

3.  Cluster Analysis: Unsupervised Learning via Supervised Learning with a Non-convex Penalty.

Authors:  Wei Pan; Xiaotong Shen; Binghui Liu
Journal:  J Mach Learn Res       Date:  2013-07-01       Impact factor: 3.654

4.  Network-based penalized regression with application to genomic data.

Authors:  Sunkyung Kim; Wei Pan; Xiaotong Shen
Journal:  Biometrics       Date:  2013-07-03       Impact factor: 2.571

5.  Homogeneity Pursuit.

Authors:  Tracy Ke; Jianqing Fan; Yichao Wu
Journal:  J Am Stat Assoc       Date:  2015       Impact factor: 5.033

6.  Simultaneous grouping pursuit and feature selection over an undirected graph.

Authors:  Yunzhang Zhu; Xiaotong Shen; Wei Pan
Journal:  J Am Stat Assoc       Date:  2013-01-01       Impact factor: 5.033

7.  Fused Lasso Approach in Regression Coefficients Clustering - Learning Parameter Heterogeneity in Data Integration.

Authors:  Lu Tang; Peter X K Song
Journal:  J Mach Learn Res       Date:  2016       Impact factor: 3.654

8.  Penalized regression approaches to testing for quantitative trait-rare variant association.

Authors:  Sunkyung Kim; Wei Pan; Xiaotong Shen
Journal:  Front Genet       Date:  2014-05-13       Impact factor: 4.599

9.  Provable Convex Co-clustering of Tensors.

Authors:  Eric C Chi; Brian R Gaines; Will Wei Sun; Hua Zhou; Jian Yang
Journal:  J Mach Learn Res       Date:  2020       Impact factor: 5.177

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.