Literature DB >> 21036813

A variable selection method for genome-wide association studies.

Qianchuan He1, Dan-Yu Lin.   

Abstract

MOTIVATION: Genome-wide association studies (GWAS) involving half a million or more single nucleotide polymorphisms (SNPs) allow genetic dissection of complex diseases in a holistic manner. The common practice of analyzing one SNP at a time does not fully realize the potential of GWAS to identify multiple causal variants and to predict risk of disease. Existing methods for joint analysis of GWAS data tend to miss causal SNPs that are marginally uncorrelated with disease and have high false discovery rates (FDRs).
RESULTS: We introduce GWASelect, a statistically powerful and computationally efficient variable selection method designed to tackle the unique challenges of GWAS data. This method searches iteratively over the potential SNPs conditional on previously selected SNPs and is thus capable of capturing causal SNPs that are marginally correlated with disease as well as those that are marginally uncorrelated with disease. A special resampling mechanism is built into the method to reduce false positive findings. Simulation studies demonstrate that the GWASelect performs well under a wide spectrum of linkage disequilibrium patterns and can be substantially more powerful than existing methods in capturing causal variants while having a lower FDR. In addition, the regression models based on the GWASelect tend to yield more accurate prediction of disease risk than existing methods. The advantages of the GWASelect are illustrated with the Wellcome Trust Case-Control Consortium (WTCCC) data. AVAILABILITY: The software implementing GWASelect is available at http://www.bios.unc.edu/~lin. Access to WTCCC data: http://www.wtccc.org.uk/.

Entities:  

Mesh:

Year:  2010        PMID: 21036813      PMCID: PMC3025714          DOI: 10.1093/bioinformatics/btq600

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  21 in total

1.  Genome-wide coexpression dynamics: theory and application.

Authors:  Ker-Chau Li
Journal:  Proc Natl Acad Sci U S A       Date:  2002-12-16       Impact factor: 11.205

2.  GWAsimulator: a rapid whole-genome simulation program.

Authors:  Chun Li; Mingyao Li
Journal:  Bioinformatics       Date:  2007-11-15       Impact factor: 6.937

3.  Genome-wide association analysis by lasso penalized logistic regression.

Authors:  Tong Tong Wu; Yi Fang Chen; Trevor Hastie; Eric Sobel; Kenneth Lange
Journal:  Bioinformatics       Date:  2009-01-28       Impact factor: 6.937

4.  Prediction of individual genetic risk to disease from genome-wide association studies.

Authors:  Naomi R Wray; Michael E Goddard; Peter M Visscher
Journal:  Genome Res       Date:  2007-09-04       Impact factor: 9.043

5.  Interaction of CED-6/GULP, an adapter protein involved in engulfment of apoptotic cells with CED-1 and CD91/low density lipoprotein receptor-related protein (LRP).

Authors:  Hua Poo Su; Kumiko Nakada-Tsukui; Annie-Carole Tosello-Trampont; Yonghe Li; Guojun Bu; Peter M Henson; Kodimangalam S Ravichandran
Journal:  J Biol Chem       Date:  2001-11-29       Impact factor: 5.157

6.  Ultrahigh dimensional feature selection: beyond the linear model.

Authors:  Jianqing Fan; Richard Samworth; Yichao Wu
Journal:  J Mach Learn Res       Date:  2009       Impact factor: 3.654

7.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

8.  From disease association to risk assessment: an optimistic view from genome-wide association studies on type 1 diabetes.

Authors:  Zhi Wei; Kai Wang; Hui-Qi Qu; Haitao Zhang; Jonathan Bradfield; Cecilia Kim; Edward Frackleton; Cuiping Hou; Joseph T Glessner; Rosetta Chiavacci; Charles Stanley; Dimitri Monos; Struan F A Grant; Constantin Polychronakos; Hakon Hakonarson
Journal:  PLoS Genet       Date:  2009-10-09       Impact factor: 5.917

9.  Simultaneous analysis of all SNPs in genome-wide and re-sequencing association studies.

Authors:  Clive J Hoggart; John C Whittaker; Maria De Iorio; David J Balding
Journal:  PLoS Genet       Date:  2008-07-25       Impact factor: 5.917

10.  Isomer-specific effects of CLA on gene expression in human adipose tissue depending on PPARgamma2 P12A polymorphism: a double blind, randomized, controlled cross-over study.

Authors:  J Herrmann; D Rubin; R Häsler; U Helwig; M Pfeuffer; A Auinger; C Laue; P Winkler; S Schreiber; D Bell; J Schrezenmeir
Journal:  Lipids Health Dis       Date:  2009-08-18       Impact factor: 3.876

View more
  51 in total

1.  Detecting genome-wide epistases based on the clustering of relatively frequent items.

Authors:  Minzhu Xie; Jing Li; Tao Jiang
Journal:  Bioinformatics       Date:  2011-11-03       Impact factor: 6.937

2.  Exploiting Linkage Disequilibrium for Ultrahigh-Dimensional Genome-Wide Data with an Integrated Statistical Approach.

Authors:  Michelle Carlsen; Guifang Fu; Shaun Bushman; Christopher Corcoran
Journal:  Genetics       Date:  2015-12-12       Impact factor: 4.562

3.  A FAST ALGORITHM FOR DETECTING GENE-GENE INTERACTIONS IN GENOME-WIDE ASSOCIATION STUDIES.

Authors:  Jiahan Li; Wei Zhong; Runze Li; Rongling Wu
Journal:  Ann Appl Stat       Date:  2014       Impact factor: 2.083

4.  A novel variational Bayes multiple locus Z-statistic for genome-wide association studies with Bayesian model averaging.

Authors:  Benjamin A Logsdon; Cara L Carty; Alexander P Reiner; James Y Dai; Charles Kooperberg
Journal:  Bioinformatics       Date:  2012-05-04       Impact factor: 6.937

5.  A model-free approach for detecting interactions in genetic association studies.

Authors:  Jiahan Li; Jun Dan; Chunlei Li; Rongling Wu
Journal:  Brief Bioinform       Date:  2013-11-21       Impact factor: 11.622

6.  Variable Selection in Heterogeneous Datasets: A Truncated-rank Sparse Linear Mixed Model with Applications to Genome-wide Association Studies.

Authors:  Haohan Wang; Bryon Aragam; Eric P Xing
Journal:  Proceedings (IEEE Int Conf Bioinformatics Biomed)       Date:  2017-12-18

7.  Fine mapping by composite genome-wide association analysis.

Authors:  Joaquim Casellas; Jhon Jacobo Cañas-Álvarez; Marta Fina; Jesús Piedrafita; Alessio Cecchinato
Journal:  Genet Res (Camb)       Date:  2017-06-06       Impact factor: 1.588

8.  GWASinlps: non-local prior based iterative SNP selection tool for genome-wide association studies.

Authors:  Nilotpal Sanyal; Min-Tzu Lo; Karolina Kauppi; Srdjan Djurovic; Ole A Andreassen; Valen E Johnson; Chi-Hua Chen
Journal:  Bioinformatics       Date:  2019-01-01       Impact factor: 6.937

9.  An integrated approach to reduce the impact of minor allele frequency and linkage disequilibrium on variable importance measures for genome-wide data.

Authors:  Raymond Walters; Charles Laurin; Gitta H Lubke
Journal:  Bioinformatics       Date:  2012-07-30       Impact factor: 6.937

10.  The use of vector bootstrapping to improve variable selection precision in Lasso models.

Authors:  Charles Laurin; Dorret Boomsma; Gitta Lubke
Journal:  Stat Appl Genet Mol Biol       Date:  2016-08-01
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.