Literature DB >> 27217599

Variable Selection with Prior Information for Generalized Linear Models via the Prior LASSO Method.

Yuan Jiang1, Yunxiao He1, Heping Zhang1.   

Abstract

LASSO is a popular statistical tool often used in conjunction with generalized linear models that can simultaneously select variables and estimate parameters. When there are many variables of interest, as in current biological and biomedical studies, the power of LASSO can be limited. Fortunately, so much biological and biomedical data have been collected and they may contain useful information about the importance of certain variables. This paper proposes an extension of LASSO, namely, prior LASSO (pLASSO), to incorporate that prior information into penalized generalized linear models. The goal is achieved by adding in the LASSO criterion function an additional measure of the discrepancy between the prior information and the model. For linear regression, the whole solution path of the pLASSO estimator can be found with a procedure similar to the Least Angle Regression (LARS). Asymptotic theories and simulation results show that pLASSO provides significant improvement over LASSO when the prior information is relatively accurate. When the prior information is less reliable, pLASSO shows great robustness to the misspecification. We illustrate the application of pLASSO using a real data set from a genome-wide association study.

Entities:  

Keywords:  Asymptotic efficiency; Oracle inequalities; Solution path; Weak oracle property

Year:  2016        PMID: 27217599      PMCID: PMC4874534          DOI: 10.1080/01621459.2015.1008363

Source DB:  PubMed          Journal:  J Am Stat Assoc        ISSN: 0162-1459            Impact factor:   5.033


  19 in total

1.  Non-Concave Penalized Likelihood with NP-Dimensionality.

Authors:  Jianqing Fan; Jinchi Lv
Journal:  IEEE Trans Inf Theory       Date:  2011-08       Impact factor: 2.501

Review 2.  Genetics of affective (mood) disorders.

Authors:  Nick Craddock; Liz Forty
Journal:  Eur J Hum Genet       Date:  2006-06       Impact factor: 4.246

3.  Genome-wide association analysis by lasso penalized logistic regression.

Authors:  Tong Tong Wu; Yi Fang Chen; Trevor Hastie; Eric Sobel; Kenneth Lange
Journal:  Bioinformatics       Date:  2009-01-28       Impact factor: 6.937

4.  An ordinary differential equation based solution path algorithm.

Authors:  Yichao Wu
Journal:  J Nonparametr Stat       Date:  2011       Impact factor: 1.231

5.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

Review 6.  Meta-analysis in genome-wide association studies.

Authors:  Eleftheria Zeggini; John P A Ioannidis
Journal:  Pharmacogenomics       Date:  2009-02       Impact factor: 2.533

7.  Genome-wide association and meta-analysis of bipolar disorder in individuals of European ancestry.

Authors:  Laura J Scott; Pierandrea Muglia; Xiangyang Q Kong; Weihua Guan; Matthew Flickinger; Ruchi Upmanyu; Federica Tozzi; Jun Z Li; Margit Burmeister; Devin Absher; Robert C Thompson; Clyde Francks; Fan Meng; Athos Antoniades; Audrey M Southwick; Alan F Schatzberg; William E Bunney; Jack D Barchas; Edward G Jones; Richard Day; Keith Matthews; Peter McGuffin; John S Strauss; James L Kennedy; Lefkos Middleton; Allen D Roses; Stanley J Watson; John B Vincent; Richard M Myers; Ann E Farmer; Huda Akil; Daniel K Burns; Michael Boehnke
Journal:  Proc Natl Acad Sci U S A       Date:  2009-04-28       Impact factor: 11.205

8.  Findings from bipolar disorder genome-wide association studies replicate in a Finnish bipolar family-cohort.

Authors:  H M Ollila; P Soronen; K Silander; O M Palo; T Kieseppä; M A Kaunisto; J Lönnqvist; L Peltonen; T Partonen; T Paunio
Journal:  Mol Psychiatry       Date:  2009-04       Impact factor: 15.992

9.  An open access database of genome-wide association results.

Authors:  Andrew D Johnson; Christopher J O'Donnell
Journal:  BMC Med Genet       Date:  2009-01-22       Impact factor: 2.103

10.  Collaborative genome-wide association analysis supports a role for ANK3 and CACNA1C in bipolar disorder.

Authors:  Manuel A R Ferreira; Michael C O'Donovan; Yan A Meng; Ian R Jones; Douglas M Ruderfer; Lisa Jones; Jinbo Fan; George Kirov; Roy H Perlis; Elaine K Green; Jordan W Smoller; Detelina Grozeva; Jennifer Stone; Ivan Nikolov; Kimberly Chambert; Marian L Hamshere; Vishwajit L Nimgaonkar; Valentina Moskvina; Michael E Thase; Sian Caesar; Gary S Sachs; Jennifer Franklin; Katherine Gordon-Smith; Kristin G Ardlie; Stacey B Gabriel; Christine Fraser; Brendan Blumenstiel; Matthew Defelice; Gerome Breen; Michael Gill; Derek W Morris; Amanda Elkin; Walter J Muir; Kevin A McGhee; Richard Williamson; Donald J MacIntyre; Alan W MacLean; Clair David St; Michelle Robinson; Margaret Van Beck; Ana C P Pereira; Radhika Kandaswamy; Andrew McQuillin; David A Collier; Nicholas J Bass; Allan H Young; Jacob Lawrence; I Nicol Ferrier; Adebayo Anjorin; Anne Farmer; David Curtis; Edward M Scolnick; Peter McGuffin; Mark J Daly; Aiden P Corvin; Peter A Holmans; Douglas H Blackwood; Hugh M Gurling; Michael J Owen; Shaun M Purcell; Pamela Sklar; Nick Craddock
Journal:  Nat Genet       Date:  2008-09       Impact factor: 38.330

View more
  19 in total

1.  Consistent Estimation of Generalized Linear Models with High Dimensional Predictors via Stepwise Regression.

Authors:  Alex Pijyan; Qi Zheng; Hyokyoung G Hong; Yi Li
Journal:  Entropy (Basel)       Date:  2020-08-31       Impact factor: 2.524

2.  Assisted graphical model for gene expression data analysis.

Authors:  Xinyan Fan; Kuangnan Fang; Shuangge Ma; Shuaichao Wang; Qingzhao Zhang
Journal:  Stat Med       Date:  2019-03-10       Impact factor: 2.373

3.  Borrowing Strength and Borrowing Index for Bayesian Hierarchical Models.

Authors:  Ganggang Xu; Huirong Zhu; J Jack Lee
Journal:  Comput Stat Data Anal       Date:  2020-04       Impact factor: 1.681

4.  Identifying gene-environment interactions incorporating prior information.

Authors:  Xiaoyan Wang; Yonghong Xu; Shuangge Ma
Journal:  Stat Med       Date:  2019-01-13       Impact factor: 2.373

5.  Conditional screening for ultra-high dimensional covariates with survival outcomes.

Authors:  Hyokyoung G Hong; Jian Kang; Yi Li
Journal:  Lifetime Data Anal       Date:  2016-12-08       Impact factor: 1.588

6.  Regularized estimation in sparse high-dimensional multivariate regression, with application to a DNA methylation study.

Authors:  Haixiang Zhang; Yinan Zheng; Grace Yoon; Zhou Zhang; Tao Gao; Brian Joyce; Wei Zhang; Joel Schwartz; Pantel Vokonas; Elena Colicino; Andrea Baccarelli; Lifang Hou; Lei Liu
Journal:  Stat Appl Genet Mol Biol       Date:  2017-07-26

7.  Network-based cancer heterogeneity analysis incorporating multi-view of prior information.

Authors:  Yang Li; Shaodong Xu; Shuangge Ma; Mengyun Wu
Journal:  Bioinformatics       Date:  2022-05-13       Impact factor: 6.931

8.  GEInfo: an R package for gene-environment interaction analysis incorporating prior information.

Authors:  Xiaoyan Wang; Hongduo Liu; Shuangge Ma
Journal:  Bioinformatics       Date:  2022-04-29       Impact factor: 6.931

9.  Adaptive penalization in high-dimensional regression and classification with external covariates using variational Bayes.

Authors:  Britta Velten; Wolfgang Huber
Journal:  Biostatistics       Date:  2021-04-10       Impact factor: 5.899

10.  Information-incorporated Gaussian graphical model for gene expression data.

Authors:  Huangdi Yi; Qingzhao Zhang; Cunjie Lin; Shuangge Ma
Journal:  Biometrics       Date:  2021-02-12       Impact factor: 1.701

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.