Literature DB >> 25258655

Accounting for linkage disequilibrium in genome-wide association studies: A penalized regression method.

Jin Liu1, Kai Wang2, Shuangge Ma1, Jian Huang3.   

Abstract

Penalized regression methods are becoming increasingly popular in genome-wide association studies (GWAS) for identifying genetic markers associated with disease. However, standard penalized methods such as LASSO do not take into account the possible linkage disequilibrium between adjacent markers. We propose a novel penalized approach for GWAS using a dense set of single nucleotide polymorphisms (SNPs). The proposed method uses the minimax concave penalty (MCP) for marker selection and incorporates linkage disequilibrium (LD) information by penalizing the difference of the genetic effects at adjacent SNPs with high correlation. A coordinate descent algorithm is derived to implement the proposed method. This algorithm is efficient in dealing with a large number of SNPs. A multi-split method is used to calculate the p-values of the selected SNPs for assessing their significance. We refer to the proposed penalty function as the smoothed MCP and the proposed approach as the SMCP method. Performance of the proposed SMCP method and its comparison with LASSO and MCP approaches are evaluated through simulation studies, which demonstrate that the proposed method is more accurate in selecting associated SNPs. Its applicability to real data is illustrated using heterogeneous stock mice data and a rheumatoid arthritis.

Entities:  

Keywords:  Feature selection; Genetic association; Linkage disequilibrium; Penalized regression; Single nucleotide polymorphism

Year:  2013        PMID: 25258655      PMCID: PMC4172344          DOI: 10.4310/SII.2013.v6.n1.a10

Source DB:  PubMed          Journal:  Stat Interface        ISSN: 1938-7989            Impact factor:   0.582


  10 in total

1.  Genome-wide genetic association of complex traits in heterogeneous stock mice.

Authors:  William Valdar; Leah C Solberg; Dominique Gauguier; Stephanie Burnett; Paul Klenerman; William O Cookson; Martin S Taylor; J Nicholas P Rawlins; Richard Mott; Jonathan Flint
Journal:  Nat Genet       Date:  2006-07-09       Impact factor: 38.330

2.  Genome-wide association analysis by lasso penalized logistic regression.

Authors:  Tong Tong Wu; Yi Fang Chen; Trevor Hastie; Eric Sobel; Kenneth Lange
Journal:  Bioinformatics       Date:  2009-01-28       Impact factor: 6.937

3.  SparseNet: Coordinate Descent With Nonconvex Penalties.

Authors:  Rahul Mazumder; Jerome H Friedman; Trevor Hastie
Journal:  J Am Stat Assoc       Date:  2011       Impact factor: 5.033

4.  Genetic and environmental effects on complex traits in mice.

Authors:  William Valdar; Leah C Solberg; Dominique Gauguier; William O Cookson; J Nicholas P Rawlins; Richard Mott; Jonathan Flint
Journal:  Genetics       Date:  2006-08-03       Impact factor: 4.562

5.  COORDINATE DESCENT ALGORITHMS FOR NONCONVEX PENALIZED REGRESSION, WITH APPLICATIONS TO BIOLOGICAL FEATURE SELECTION.

Authors:  Patrick Breheny; Jian Huang
Journal:  Ann Appl Stat       Date:  2011-01-01       Impact factor: 2.083

6.  Replication of putative candidate-gene associations with rheumatoid arthritis in >4,000 samples from North America and Sweden: association of susceptibility with PTPN22, CTLA4, and PADI4.

Authors:  Robert M Plenge; Leonid Padyukov; Elaine F Remmers; Shaun Purcell; Annette T Lee; Elizabeth W Karlson; Frederick Wolfe; Daniel L Kastner; Lars Alfredsson; David Altshuler; Peter K Gregersen; Lars Klareskog; John D Rioux
Journal:  Am J Hum Genet       Date:  2005-11-01       Impact factor: 11.025

7.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

8.  A missense single-nucleotide polymorphism in a gene encoding a protein tyrosine phosphatase (PTPN22) is associated with rheumatoid arthritis.

Authors:  Ann B Begovich; Victoria E H Carlton; Lee A Honigberg; Steven J Schrodi; Anand P Chokkalingam; Heather C Alexander; Kristin G Ardlie; Qiqing Huang; Ashley M Smith; Jill M Spoerke; Marion T Conn; Monica Chang; Sheng-Yung P Chang; Randall K Saiki; Joseph J Catanese; Diane U Leong; Veronica E Garcia; Linda B McAllister; Douglas A Jeffery; Annette T Lee; Franak Batliwalla; Elaine Remmers; Lindsey A Criswell; Michael F Seldin; Daniel L Kastner; Christopher I Amos; John J Sninsky; Peter K Gregersen
Journal:  Am J Hum Genet       Date:  2004-06-18       Impact factor: 11.025

Review 9.  A review of the MHC genetics of rheumatoid arthritis.

Authors:  J L Newton; S M J Harney; B P Wordsworth; M A Brown
Journal:  Genes Immun       Date:  2004-05       Impact factor: 2.676

10.  Data for Genetic Analysis Workshop 16 Problem 1, association analysis of rheumatoid arthritis data.

Authors:  Christopher I Amos; Wei Vivien Chen; Michael F Seldin; Elaine F Remmers; Kimberly E Taylor; Lindsey A Criswell; Annette T Lee; Robert M Plenge; Daniel L Kastner; Peter K Gregersen
Journal:  BMC Proc       Date:  2009-12-15
  10 in total
  7 in total

1.  Penalized multimarker vs. single-marker regression methods for genome-wide association studies of quantitative traits.

Authors:  Hui Yi; Patrick Breheny; Netsanet Imam; Yongmei Liu; Ina Hoeschele
Journal:  Genetics       Date:  2014-10-28       Impact factor: 4.562

2.  A pharmacogenomic study on the pharmacokinetics of tacrolimus in healthy subjects using the DMETTM Plus platform.

Authors:  Y Choi; F Jiang; H An; H J Park; J H Choi; H Lee
Journal:  Pharmacogenomics J       Date:  2016-02-16       Impact factor: 3.550

3.  Genome-wide association studies using a penalized moving-window regression.

Authors:  Minli Bao; Kai Wang
Journal:  Bioinformatics       Date:  2017-12-15       Impact factor: 6.937

4.  A penalized robust method for identifying gene-environment interactions.

Authors:  Xingjie Shi; Jin Liu; Jian Huang; Yong Zhou; Yang Xie; Shuangge Ma
Journal:  Genet Epidemiol       Date:  2014-02-24       Impact factor: 2.344

5.  Fast and flexible linear mixed models for genome-wide genetics.

Authors:  Daniel E Runcie; Lorin Crawford
Journal:  PLoS Genet       Date:  2019-02-08       Impact factor: 5.917

6.  Efficient cross-trait penalized regression increases prediction accuracy in large cohorts using secondary phenotypes.

Authors:  Wonil Chung; Jun Chen; Constance Turman; Sara Lindstrom; Zhaozhong Zhu; Po-Ru Loh; Peter Kraft; Liming Liang
Journal:  Nat Commun       Date:  2019-02-04       Impact factor: 14.919

7.  A Physics-Guided Neural Network for Predicting Protein-Ligand Binding Free Energy: From Host-Guest Systems to the PDBbind Database.

Authors:  Sahar Cain; Ali Risheh; Negin Forouzesh
Journal:  Biomolecules       Date:  2022-06-29
  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.