Literature DB >> 29056876

Fused Lasso Approach in Regression Coefficients Clustering - Learning Parameter Heterogeneity in Data Integration.

Lu Tang1, Peter X K Song1.   

Abstract

As data sets of related studies become more easily accessible, combining data sets of similar studies is often undertaken in practice to achieve a larger sample size and higher power. A major challenge arising from data integration pertains to data heterogeneity in terms of study population, study design, or study coordination. Ignoring such heterogeneity in data analysis may result in biased estimation and misleading inference. Traditional techniques of remedy to data heterogeneity include the use of interactions and random effects, which are inferior to achieving desirable statistical power or providing a meaningful interpretation, especially when a large number of smaller data sets are combined. In this paper, we propose a regularized fusion method that allows us to identify and merge inter-study homogeneous parameter clusters in regression analysis, without the use of hypothesis testing approach. Using the fused lasso, we establish a computationally efficient procedure to deal with large-scale integrated data. Incorporating the estimated parameter ordering in the fused lasso facilitates computing speed with no loss of statistical power. We conduct extensive simulation studies and provide an application example to demonstrate the performance of the new method with a comparison to the conventional methods.

Entities:  

Keywords:  Data integration; Extended BIC; Fused lasso; Generalized Linear Models

Year:  2016        PMID: 29056876      PMCID: PMC5647925     

Source DB:  PubMed          Journal:  J Mach Learn Res        ISSN: 1532-4435            Impact factor:   3.654


  15 in total

1.  Meta-analysis of genetic association studies supports a contribution of common variants to susceptibility to common disease.

Authors:  Kirk E Lohmueller; Celeste L Pearce; Malcolm Pike; Eric S Lander; Joel N Hirschhorn
Journal:  Nat Genet       Date:  2003-01-13       Impact factor: 38.330

2.  Random-effects model for meta-analysis of clinical trials: an update.

Authors:  Rebecca DerSimonian; Raghu Kacker
Journal:  Contemp Clin Trials       Date:  2006-05-12       Impact factor: 2.226

3.  Feature Grouping and Selection Over an Undirected Graph.

Authors:  Sen Yang; Lei Yuan; Ying-Cheng Lai; Xiaotong Shen; Peter Wonka; Jieping Ye
Journal:  KDD       Date:  2012

4.  Adaptive Estimation with Partially Overlapping Models.

Authors:  Sunyoung Shin; Jason Fine; Yufeng Liu
Journal:  Stat Sin       Date:  2016-01       Impact factor: 1.261

5.  Fused lasso with the adaptation of parameter ordering in combining multiple studies with repeated measurements.

Authors:  Fei Wang; Lu Wang; Peter X-K Song
Journal:  Biometrics       Date:  2016-02-22       Impact factor: 2.571

6.  Efficacy and safety of ephedra and ephedrine for weight loss and athletic performance: a meta-analysis.

Authors:  Paul G Shekelle; Mary L Hardy; Sally C Morton; Margaret Maglione; Walter A Mojica; Marika J Suttorp; Shannon L Rhodes; Lara Jungvig; James Gagné
Journal:  JAMA       Date:  2003-03-10       Impact factor: 56.272

7.  Grouping pursuit through a regularization solution surface.

Authors:  Xiaotong Shen; Hsin-Cheng Huang
Journal:  J Am Stat Assoc       Date:  2010-06-01       Impact factor: 5.033

8.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

9.  Multivariate Meta-Analysis of Heterogeneous Studies Using Only Summary Statistics: Efficiency and Robustness.

Authors:  Dungang Liu; Regina Liu; Minge Xie
Journal:  J Am Stat Assoc       Date:  2015       Impact factor: 5.033

10.  Capturing heterogeneity in gene expression studies by surrogate variable analysis.

Authors:  Jeffrey T Leek; John D Storey
Journal:  PLoS Genet       Date:  2007-08-01       Impact factor: 5.917

View more
  5 in total

1.  Fusion Learning Algorithm to Combine Partially Heterogeneous Cox Models.

Authors:  Lu Tang; Ling Zhou; Peter X K Song
Journal:  Comput Stat       Date:  2018-07-17       Impact factor: 1.000

2.  Distributed Simultaneous Inference in Generalized Linear Models via Confidence Distribution.

Authors:  Lu Tang; Ling Zhou; Peter X-K Song
Journal:  J Multivar Anal       Date:  2019-11-28       Impact factor: 1.473

3.  Building a Data Platform for Cross-Country Urban Health Studies: the SALURBAL Study.

Authors:  D Alex Quistberg; Ana V Diez Roux; Usama Bilal; Kari Moore; Ana Ortigoza; Daniel A Rodriguez; Olga L Sarmiento; Patricia Frenz; Amélia Augusta Friche; Waleska Teixeira Caiaffa; Alejandra Vives; J Jaime Miranda
Journal:  J Urban Health       Date:  2019-04       Impact factor: 3.671

4.  Early Life Exposure in Mexico to ENvironmental Toxicants (ELEMENT) Project.

Authors:  Wei Perng; Marcela Tamayo-Ortiz; Lu Tang; Brisa N Sánchez; Alejandra Cantoral; John D Meeker; Dana C Dolinoy; Elizabeth F Roberts; Esperanza Angeles Martinez-Mier; Hector Lamadrid-Figueroa; Peter X K Song; Adrienne S Ettinger; Robert Wright; Manish Arora; Lourdes Schnaas; Deborah J Watkins; Jaclyn M Goodrich; Robin C Garcia; Maritsa Solano-Gonzalez; Luis F Bautista-Arredondo; Adriana Mercado-Garcia; Howard Hu; Mauricio Hernandez-Avila; Martha Maria Tellez-Rojo; Karen E Peterson
Journal:  BMJ Open       Date:  2019-08-26       Impact factor: 2.692

5.  Meta-Analyzing Multiple Omics Data With Robust Variable Selection.

Authors:  Zongliang Hu; Yan Zhou; Tiejun Tong
Journal:  Front Genet       Date:  2021-07-05       Impact factor: 4.599

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.