Literature DB >> 26392642

Regularization Methods for High-Dimensional Instrumental Variables Regression With an Application to Genetical Genomics.

Wei Lin1, Rui Feng1, Hongzhe Li1.   

Abstract

In genetical genomics studies, it is important to jointly analyze gene expression data and genetic variants in exploring their associations with complex traits, where the dimensionality of gene expressions and genetic variants can both be much larger than the sample size. Motivated by such modern applications, we consider the problem of variable selection and estimation in high-dimensional sparse instrumental variables models. To overcome the difficulty of high dimensionality and unknown optimal instruments, we propose a two-stage regularization framework for identifying and estimating important covariate effects while selecting and estimating optimal instruments. The methodology extends the classical two-stage least squares estimator to high dimensions by exploiting sparsity using sparsity-inducing penalty functions in both stages. The resulting procedure is efficiently implemented by coordinate descent optimization. For the representative L1 regularization and a class of concave regularization methods, we establish estimation, prediction, and model selection properties of the two-stage regularized estimators in the high-dimensional setting where the dimensionality of co-variates and instruments are both allowed to grow exponentially with the sample size. The practical performance of the proposed method is evaluated by simulation studies and its usefulness is illustrated by an analysis of mouse obesity data. Supplementary materials for this article are available online.

Entities:  

Keywords:  Causal inference; Confounding; Endogeneity; Sparse regression; Two-stage least squares; Variable selection

Year:  2015        PMID: 26392642      PMCID: PMC4573639          DOI: 10.1080/01621459.2014.908125

Source DB:  PubMed          Journal:  J Am Stat Assoc        ISSN: 0162-1459            Impact factor:   5.033


  28 in total

1.  A large-sample QTL study in mice: I. Growth.

Authors:  Joao L Rocha; Eugene J Eisen; L Dale Van Vleck; Daniel Pomp
Journal:  Mamm Genome       Date:  2004-02       Impact factor: 2.957

2.  Expression quantitative trait loci: replication, tissue- and sex-specificity in mice.

Authors:  Atila van Nas; Leslie Ingram-Drake; Janet S Sinsheimer; Susanna S Wang; Eric E Schadt; Thomas Drake; Aldons J Lusis
Journal:  Genetics       Date:  2010-05-03       Impact factor: 4.562

3.  On the robustness of the adaptive lasso to model misspecification.

Authors:  W Lu; Y Goldberg; J P Fine
Journal:  Biometrika       Date:  2012-07-11       Impact factor: 2.445

4.  A Selective Overview of Variable Selection in High Dimensional Feature Space.

Authors:  Jianqing Fan; Jinchi Lv
Journal:  Stat Sin       Date:  2010-01       Impact factor: 1.261

5.  Genetics of gene expression and its effect on disease.

Authors:  Valur Emilsson; Gudmar Thorleifsson; Bin Zhang; Amy S Leonardson; Florian Zink; Jun Zhu; Sonia Carlson; Agnar Helgason; G Bragi Walters; Steinunn Gunnarsdottir; Magali Mouy; Valgerdur Steinthorsdottir; Gudrun H Eiriksdottir; Gyda Bjornsdottir; Inga Reynisdottir; Daniel Gudbjartsson; Anna Helgadottir; Aslaug Jonasdottir; Adalbjorg Jonasdottir; Unnur Styrkarsdottir; Solveig Gretarsdottir; Kristinn P Magnusson; Hreinn Stefansson; Ragnheidur Fossdal; Kristleifur Kristjansson; Hjortur G Gislason; Tryggvi Stefansson; Bjorn G Leifsson; Unnur Thorsteinsdottir; John R Lamb; Jeffrey R Gulcher; Marc L Reitman; Augustine Kong; Eric E Schadt; Kari Stefansson
Journal:  Nature       Date:  2008-03-16       Impact factor: 49.962

Review 6.  Gastrointestinal satiety signals IV. Apolipoprotein A-IV.

Authors:  Patrick Tso; William Sun; Min Liu
Journal:  Am J Physiol Gastrointest Liver Physiol       Date:  2004-06       Impact factor: 4.052

7.  Plasma glycosylphosphatidylinositol-specific phospholipase D predicts the change in insulin sensitivity in response to a low-fat but not a low-carbohydrate diet in obese women.

Authors:  Dona L Gray; Kevin D O'Brien; David A D'Alessio; Bonnie J Brehm; Mark A Deeg
Journal:  Metabolism       Date:  2008-04       Impact factor: 8.694

8.  The Mouse Genome Database (MGD): comprehensive resource for genetics and genomics of the laboratory mouse.

Authors:  Janan T Eppig; Judith A Blake; Carol J Bult; James A Kadin; Joel E Richardson
Journal:  Nucleic Acids Res       Date:  2011-11-10       Impact factor: 16.971

9.  Patterns of cis regulatory variation in diverse human populations.

Authors:  Barbara E Stranger; Stephen B Montgomery; Antigone S Dimas; Leopold Parts; Oliver Stegle; Catherine E Ingle; Magda Sekowska; George Davey Smith; David Evans; Maria Gutierrez-Arcelus; Alkes Price; Towfique Raj; James Nisbett; Alexandra C Nica; Claude Beazley; Richard Durbin; Panos Deloukas; Emmanouil T Dermitzakis
Journal:  PLoS Genet       Date:  2012-04-19       Impact factor: 5.917

10.  Genetic and genomic analysis of a fat mass trait with complex inheritance reveals marked sex specificity.

Authors:  Susanna Wang; Nadir Yehya; Eric E Schadt; Hui Wang; Thomas A Drake; Aldons J Lusis
Journal:  PLoS Genet       Date:  2006-02-03       Impact factor: 5.917

View more
  14 in total

1.  Integrative analysis of genetical genomics data incorporating network structures.

Authors:  Bin Gao; Xu Liu; Hongzhe Li; Yuehua Cui
Journal:  Biometrics       Date:  2019-04-29       Impact factor: 2.571

2.  Outcome-adaptive lasso: Variable selection for causal inference.

Authors:  Susan M Shortreed; Ashkan Ertefaie
Journal:  Biometrics       Date:  2017-03-08       Impact factor: 2.571

3.  Causal Genetic Inference Using Haplotypes as Instrumental Variables.

Authors:  Fan Wang; Nuala J Meyer; Keith R Walley; James A Russell; Rui Feng
Journal:  Genet Epidemiol       Date:  2015-12-01       Impact factor: 2.135

4.  A directed learning strategy integrating multiple omic data improves genomic prediction.

Authors:  Xuehai Hu; Weibo Xie; Chengchao Wu; Shizhong Xu
Journal:  Plant Biotechnol J       Date:  2019-04-14       Impact factor: 9.803

5.  Regularized estimation in sparse high-dimensional multivariate regression, with application to a DNA methylation study.

Authors:  Haixiang Zhang; Yinan Zheng; Grace Yoon; Zhou Zhang; Tao Gao; Brian Joyce; Wei Zhang; Joel Schwartz; Pantel Vokonas; Elena Colicino; Andrea Baccarelli; Lifang Hou; Lei Liu
Journal:  Stat Appl Genet Mol Biol       Date:  2017-07-26

6.  DOUBLY DEBIASED LASSO: HIGH-DIMENSIONAL INFERENCE UNDER HIDDEN CONFOUNDING.

Authors:  Zijian Guo; Domagoj Ćevid; Peter Bühlmann
Journal:  Ann Stat       Date:  2022-06-16       Impact factor: 4.904

Review 7.  Statistical methods for Mendelian randomization in genome-wide association studies: A review.

Authors:  Frederick J Boehm; Xiang Zhou
Journal:  Comput Struct Biotechnol J       Date:  2022-05-14       Impact factor: 6.155

8.  An efficient and robust approach to Mendelian randomization with measured pleiotropic effects in a high-dimensional setting.

Authors:  Andrew J Grant; Stephen Burgess
Journal:  Biostatistics       Date:  2022-04-13       Impact factor: 5.899

9.  On the use of kernel machines for Mendelian randomization.

Authors:  Weiming Zhang; Debashis Ghosh
Journal:  Quant Biol       Date:  2017-11-20

10.  2D association and integrative omics analysis in rice provides systems biology view in trait analysis.

Authors:  Wenchao Zhang; Xinbin Dai; Shizhong Xu; Patrick X Zhao
Journal:  Commun Biol       Date:  2018-09-27
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.