Literature DB >> 32246825

Prioritizing genetic variants in GWAS with lasso using permutation-assisted tuning.

Songshan Yang1, Jiawei Wen1, Scott T Eckert2, Yaqun Wang3, Dajiang J Liu2, Rongling Wu2, Runze Li1, Xiang Zhan2.   

Abstract

MOTIVATION: Large scale genome-wide association studies (GWAS) have resulted in the identification of a wide range of genetic variants related to a host of complex traits and disorders. Despite their success, the individual single-nucleotide polymorphism (SNP) analysis approach adopted in most current GWAS can be limited in that it is usually biologically simple to elucidate a comprehensive genetic architecture of phenotypes and statistically underpowered due to heavy multiple-testing correction burden. On the other hand, multiple-SNP analyses (e.g. gene-based or region-based SNP-set analysis) are usually more powerful to examine the joint effects of a set of SNPs on the phenotype of interest. However, current multiple-SNP approaches can only draw an overall conclusion at the SNP-set level and does not directly inform which SNPs in the SNP-set are driving the overall genotype-phenotype association.
RESULTS: In this article, we propose a new permutation-assisted tuning procedure in lasso (plasso) to identify phenotype-associated SNPs in a joint multiple-SNP regression model in GWAS. The tuning parameter of lasso determines the amount of shrinkage and is essential to the performance of variable selection. In the proposed plasso procedure, we first generate permutations as pseudo-SNPs that are not associated with the phenotype. Then, the lasso tuning parameter is delicately chosen to separate true signal SNPs and non-informative pseudo-SNPs. We illustrate plasso using simulations to demonstrate its superior performance over existing methods, and application of plasso to a real GWAS dataset gains new additional insights into the genetic control of complex traits.
AVAILABILITY AND IMPLEMENTATION: R codes to implement the proposed methodology is available at https://github.com/xyz5074/plasso. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Mesh:

Year:  2020        PMID: 32246825      PMCID: PMC7320616          DOI: 10.1093/bioinformatics/btaa229

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  28 in total

1.  Joint identification of multiple genetic variants via elastic-net variable selection in a genome-wide association analysis.

Authors:  Seoae Cho; Kyunga Kim; Young Jin Kim; Jong-Keuk Lee; Yoon Shin Cho; Jong-Young Lee; Bok-Ghee Han; Heebal Kim; Jurg Ott; Taesung Park
Journal:  Ann Hum Genet       Date:  2010-07-14       Impact factor: 1.670

2.  The Bayesian lasso for genome-wide association studies.

Authors:  Jiahan Li; Kiranmoy Das; Guifang Fu; Runze Li; Rongling Wu
Journal:  Bioinformatics       Date:  2010-12-14       Impact factor: 6.937

3.  A novel copy number variants kernel association test with application to autism spectrum disorders studies.

Authors:  Xiang Zhan; Santhosh Girirajan; Ni Zhao; Michael C Wu; Debashis Ghosh
Journal:  Bioinformatics       Date:  2016-08-06       Impact factor: 6.937

4.  Prioritizing individual genetic variants after kernel machine testing using variable selection.

Authors:  Qianchuan He; Tianxi Cai; Yang Liu; Ni Zhao; Quaker E Harmon; Lynn M Almli; Elisabeth B Binder; Stephanie M Engel; Kerry J Ressler; Karen N Conneely; Xihong Lin; Michael C Wu
Journal:  Genet Epidemiol       Date:  2016-08-03       Impact factor: 2.135

Review 5.  From genome-wide associations to candidate causal variants by statistical fine-mapping.

Authors:  Daniel J Schaid; Wenan Chen; Nicholas B Larson
Journal:  Nat Rev Genet       Date:  2018-08       Impact factor: 53.242

6.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

7.  SNP selection in genome-wide and candidate gene studies via penalized logistic regression.

Authors:  Kristin L Ayers; Heather J Cordell
Journal:  Genet Epidemiol       Date:  2010-12       Impact factor: 2.135

8.  Resampling-based tests for Lasso in genome-wide association studies.

Authors:  Jaron Arbet; Matt McGue; Snigdhansu Chatterjee; Saonli Basu
Journal:  BMC Genet       Date:  2017-07-24       Impact factor: 2.797

9.  A groupwise association test for rare mutations using a weighted sum statistic.

Authors:  Bo Eskerod Madsen; Sharon R Browning
Journal:  PLoS Genet       Date:  2009-02-13       Impact factor: 5.917

10.  Evaluation of the lasso and the elastic net in genome-wide association studies.

Authors:  Patrik Waldmann; Gábor Mészáros; Birgit Gredler; Christian Fuerst; Johann Sölkner
Journal:  Front Genet       Date:  2013-12-04       Impact factor: 4.599

View more
  3 in total

1.  Robust Huber-LASSO for improved prediction of protein, metabolite and gene expression levels relying on individual genotype data.

Authors:  Heike Deutelmoser; Dominique Scherer; Hermann Brenner; Melanie Waldenberger; Karsten Suhre; Gabi Kastenmüller; Justo Lorenzo Bermejo
Journal:  Brief Bioinform       Date:  2021-07-20       Impact factor: 11.622

2.  Adaptive and powerful microbiome multivariate association analysis via feature selection.

Authors:  Kalins Banerjee; Jun Chen; Xiang Zhan
Journal:  NAR Genom Bioinform       Date:  2022-01-14

3.  Machine learning models outperform deep learning models, provide interpretation and facilitate feature selection for soybean trait prediction.

Authors:  Mitchell Gill; Robyn Anderson; Haifei Hu; Mohammed Bennamoun; Jakob Petereit; Babu Valliyodan; Henry T Nguyen; Jacqueline Batley; Philipp E Bayer; David Edwards
Journal:  BMC Plant Biol       Date:  2022-04-08       Impact factor: 4.215

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.