Literature DB >> 25354699

Penalized multimarker vs. single-marker regression methods for genome-wide association studies of quantitative traits.

Hui Yi1, Patrick Breheny2, Netsanet Imam3, Yongmei Liu4, Ina Hoeschele5.   

Abstract

The data from genome-wide association studies (GWAS) in humans are still predominantly analyzed using single-marker association methods. As an alternative to single-marker analysis (SMA), all or subsets of markers can be tested simultaneously. This approach requires a form of penalized regression (PR) as the number of SNPs is much larger than the sample size. Here we review PR methods in the context of GWAS, extend them to perform penalty parameter and SNP selection by false discovery rate (FDR) control, and assess their performance in comparison with SMA. PR methods were compared with SMA, using realistically simulated GWAS data with a continuous phenotype and real data. Based on these comparisons our analytic FDR criterion may currently be the best approach to SNP selection using PR for GWAS. We found that PR with FDR control provides substantially more power than SMA with genome-wide type-I error control but somewhat less power than SMA with Benjamini-Hochberg FDR control (SMA-BH). PR with FDR-based penalty parameter selection controlled the FDR somewhat conservatively while SMA-BH may not achieve FDR control in all situations. Differences among PR methods seem quite small when the focus is on SNP selection with FDR control. Incorporating linkage disequilibrium into the penalization by adapting penalties developed for covariates measured on graphs can improve power but also generate more false positives or wider regions for follow-up. We recommend the elastic net with a mixing weight for the Lasso penalty near 0.5 as the best method.
Copyright © 2015 by the Genetics Society of America.

Entities:  

Keywords:  false discovery rate; genome-wide association study; linkage disequilibrium; penalized regression

Mesh:

Year:  2014        PMID: 25354699      PMCID: PMC4286685          DOI: 10.1534/genetics.114.167817

Source DB:  PubMed          Journal:  Genetics        ISSN: 0016-6731            Impact factor:   4.562


  28 in total

1.  False discovery rate in linkage and association genome screens for complex disorders.

Authors:  Chiara Sabatti; Susan Service; Nelson Freimer
Journal:  Genetics       Date:  2003-06       Impact factor: 4.562

2.  A haplotype map of the human genome.

Authors: 
Journal:  Nature       Date:  2005-10-27       Impact factor: 49.962

3.  Network-constrained regularization and variable selection for analysis of genomic data.

Authors:  Caiyan Li; Hongzhe Li
Journal:  Bioinformatics       Date:  2008-03-01       Impact factor: 6.937

Review 4.  Linkage disequilibrium in humans: models and data.

Authors:  J K Pritchard; M Przeworski
Journal:  Am J Hum Genet       Date:  2001-06-14       Impact factor: 11.025

5.  COORDINATE DESCENT ALGORITHMS FOR NONCONVEX PENALIZED REGRESSION, WITH APPLICATIONS TO BIOLOGICAL FEATURE SELECTION.

Authors:  Patrick Breheny; Jian Huang
Journal:  Ann Appl Stat       Date:  2011-01-01       Impact factor: 2.083

6.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

7.  HIGH DIMENSIONAL VARIABLE SELECTION.

Authors:  Larry Wasserman; Kathryn Roeder
Journal:  Ann Stat       Date:  2009-01-01       Impact factor: 4.028

8.  VARIABLE SELECTION AND REGRESSION ANALYSIS FOR GRAPH-STRUCTURED COVARIATES WITH AN APPLICATION TO GENOMICS.

Authors:  Caiyan Li; Hongzhe Li
Journal:  Ann Appl Stat       Date:  2010-09-01       Impact factor: 2.083

9.  A second generation human haplotype map of over 3.1 million SNPs.

Authors:  Kelly A Frazer; Dennis G Ballinger; David R Cox; David A Hinds; Laura L Stuve; Richard A Gibbs; John W Belmont; Andrew Boudreau; Paul Hardenbol; Suzanne M Leal; Shiran Pasternak; David A Wheeler; Thomas D Willis; Fuli Yu; Huanming Yang; Changqing Zeng; Yang Gao; Haoran Hu; Weitao Hu; Chaohua Li; Wei Lin; Siqi Liu; Hao Pan; Xiaoli Tang; Jian Wang; Wei Wang; Jun Yu; Bo Zhang; Qingrun Zhang; Hongbin Zhao; Hui Zhao; Jun Zhou; Stacey B Gabriel; Rachel Barry; Brendan Blumenstiel; Amy Camargo; Matthew Defelice; Maura Faggart; Mary Goyette; Supriya Gupta; Jamie Moore; Huy Nguyen; Robert C Onofrio; Melissa Parkin; Jessica Roy; Erich Stahl; Ellen Winchester; Liuda Ziaugra; David Altshuler; Yan Shen; Zhijian Yao; Wei Huang; Xun Chu; Yungang He; Li Jin; Yangfan Liu; Yayun Shen; Weiwei Sun; Haifeng Wang; Yi Wang; Ying Wang; Xiaoyan Xiong; Liang Xu; Mary M Y Waye; Stephen K W Tsui; Hong Xue; J Tze-Fei Wong; Luana M Galver; Jian-Bing Fan; Kevin Gunderson; Sarah S Murray; Arnold R Oliphant; Mark S Chee; Alexandre Montpetit; Fanny Chagnon; Vincent Ferretti; Martin Leboeuf; Jean-François Olivier; Michael S Phillips; Stéphanie Roumy; Clémentine Sallée; Andrei Verner; Thomas J Hudson; Pui-Yan Kwok; Dongmei Cai; Daniel C Koboldt; Raymond D Miller; Ludmila Pawlikowska; Patricia Taillon-Miller; Ming Xiao; Lap-Chee Tsui; William Mak; You Qiang Song; Paul K H Tam; Yusuke Nakamura; Takahisa Kawaguchi; Takuya Kitamoto; Takashi Morizono; Atsushi Nagashima; Yozo Ohnishi; Akihiro Sekine; Toshihiro Tanaka; Tatsuhiko Tsunoda; Panos Deloukas; Christine P Bird; Marcos Delgado; Emmanouil T Dermitzakis; Rhian Gwilliam; Sarah Hunt; Jonathan Morrison; Don Powell; Barbara E Stranger; Pamela Whittaker; David R Bentley; Mark J Daly; Paul I W de Bakker; Jeff Barrett; Yves R Chretien; Julian Maller; Steve McCarroll; Nick Patterson; Itsik Pe'er; Alkes Price; Shaun Purcell; Daniel J Richter; Pardis Sabeti; Richa Saxena; Stephen F Schaffner; Pak C Sham; Patrick Varilly; David Altshuler; Lincoln D Stein; Lalitha Krishnan; Albert Vernon Smith; Marcela K Tello-Ruiz; Gudmundur A Thorisson; Aravinda Chakravarti; Peter E Chen; David J Cutler; Carl S Kashuk; Shin Lin; Gonçalo R Abecasis; Weihua Guan; Yun Li; Heather M Munro; Zhaohui Steve Qin; Daryl J Thomas; Gilean McVean; Adam Auton; Leonardo Bottolo; Niall Cardin; Susana Eyheramendy; Colin Freeman; Jonathan Marchini; Simon Myers; Chris Spencer; Matthew Stephens; Peter Donnelly; Lon R Cardon; Geraldine Clarke; David M Evans; Andrew P Morris; Bruce S Weir; Tatsuhiko Tsunoda; James C Mullikin; Stephen T Sherry; Michael Feolo; Andrew Skol; Houcan Zhang; Changqing Zeng; Hui Zhao; Ichiro Matsuda; Yoshimitsu Fukushima; Darryl R Macer; Eiko Suda; Charles N Rotimi; Clement A Adebamowo; Ike Ajayi; Toyin Aniagwu; Patricia A Marshall; Chibuzor Nkwodimmah; Charmaine D M Royal; Mark F Leppert; Missy Dixon; Andy Peiffer; Renzong Qiu; Alastair Kent; Kazuto Kato; Norio Niikawa; Isaac F Adewole; Bartha M Knoppers; Morris W Foster; Ellen Wright Clayton; Jessica Watkin; Richard A Gibbs; John W Belmont; Donna Muzny; Lynne Nazareth; Erica Sodergren; George M Weinstock; David A Wheeler; Imtaz Yakub; Stacey B Gabriel; Robert C Onofrio; Daniel J Richter; Liuda Ziaugra; Bruce W Birren; Mark J Daly; David Altshuler; Richard K Wilson; Lucinda L Fulton; Jane Rogers; John Burton; Nigel P Carter; Christopher M Clee; Mark Griffiths; Matthew C Jones; Kirsten McLay; Robert W Plumb; Mark T Ross; Sarah K Sims; David L Willey; Zhu Chen; Hua Han; Le Kang; Martin Godbout; John C Wallenburg; Paul L'Archevêque; Guy Bellemare; Koji Saeki; Hongguang Wang; Daochang An; Hongbo Fu; Qing Li; Zhen Wang; Renwu Wang; Arthur L Holden; Lisa D Brooks; Jean E McEwen; Mark S Guyer; Vivian Ota Wang; Jane L Peterson; Michael Shi; Jack Spiegel; Lawrence M Sung; Lynn F Zacharia; Francis S Collins; Karen Kennedy; Ruth Jamieson; John Stewart
Journal:  Nature       Date:  2007-10-18       Impact factor: 49.962

10.  PUMA: a unified framework for penalized multiple regression analysis of GWAS data.

Authors:  Gabriel E Hoffman; Benjamin A Logsdon; Jason G Mezey
Journal:  PLoS Comput Biol       Date:  2013-06-27       Impact factor: 4.475

View more
  17 in total

1.  Genetic Variant Selection: Learning Across Traits and Sites.

Authors:  Laurel Stell; Chiara Sabatti
Journal:  Genetics       Date:  2015-12-17       Impact factor: 4.562

2.  Weighted pseudolikelihood for SNP set analysis with multiple secondary outcomes in case-control genetic association studies.

Authors:  Tamar Sofer; Elizabeth D Schifano; David C Christiani; Xihong Lin
Journal:  Biometrics       Date:  2017-03-27       Impact factor: 2.571

3.  Prioritizing genetic variants in GWAS with lasso using permutation-assisted tuning.

Authors:  Songshan Yang; Jiawei Wen; Scott T Eckert; Yaqun Wang; Dajiang J Liu; Rongling Wu; Runze Li; Xiang Zhan
Journal:  Bioinformatics       Date:  2020-06-01       Impact factor: 6.937

Review 4.  Biosignature Discovery for Substance Use Disorders Using Statistical Learning.

Authors:  James W Baurley; Christopher S McMahan; Carolyn M Ervin; Bens Pardamean; Andrew W Bergen
Journal:  Trends Mol Med       Date:  2018-02-04       Impact factor: 11.951

5.  Association of a 3' untranslated region polymorphism in proprotein convertase subtilisin/kexin type 9 with HIV viral load and CD4+ levels in HIV/hepatitis C virus coinfected women.

Authors:  Mark H Kuniholm; Hua Liang; Kathryn Anastos; Deborah Gustafson; Seble Kassaye; Marek Nowicki; Beverly E Sha; Emilia J Pawlowski; Stephen J Gange; Bradley E Aouizerat; Tatiana Pushkarsky; Michael I Bukrinsky; Vinayaka R Prasad
Journal:  AIDS       Date:  2017-11-28       Impact factor: 4.632

6.  Prediction of complex human diseases from pathway-focused candidate markers by joint estimation of marker effects: case of chronic fatigue syndrome.

Authors:  Madhuchhanda Bhattacharjee; Mangalathu S Rajeevan; Mikko J Sillanpää
Journal:  Hum Genomics       Date:  2015-06-11       Impact factor: 4.639

7.  Non-parametric genetic prediction of complex traits with latent Dirichlet process regression models.

Authors:  Ping Zeng; Xiang Zhou
Journal:  Nat Commun       Date:  2017-09-06       Impact factor: 14.919

8.  Resampling-based tests for Lasso in genome-wide association studies.

Authors:  Jaron Arbet; Matt McGue; Snigdhansu Chatterjee; Saonli Basu
Journal:  BMC Genet       Date:  2017-07-24       Impact factor: 2.797

9.  A novel linkage-disequilibrium corrected genomic relationship matrix for SNP-heritability estimation and genomic prediction.

Authors:  Boby Mathew; Jens Léon; Mikko J Sillanpää
Journal:  Heredity (Edinb)       Date:  2017-12-14       Impact factor: 3.821

10.  Performance of a blockwise approach in variable selection using linkage disequilibrium information.

Authors:  Alia Dehman; Christophe Ambroise; Pierre Neuvial
Journal:  BMC Bioinformatics       Date:  2015-05-08       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.