Literature DB >> 23836599

The value of statistical or bioinformatics annotation for rare variant association with quantitative trait.

Andrea E Byrnes1, Michael C Wu, Fred A Wright, Mingyao Li, Yun Li.   

Abstract

In the past few years, a plethora of methods for rare variant association with phenotype have been proposed. These methods aggregate information from multiple rare variants across genomic region(s), but there is little consensus as to which method is most effective. The weighting scheme adopted when aggregating information across variants is one of the primary determinants of effectiveness. Here we present a systematic evaluation of multiple weighting schemes through a series of simulations intended to mimic large sequencing studies of a quantitative trait. We evaluate existing phenotype-independent and phenotype-dependent methods, as well as weights estimated by penalized regression approaches including Lasso, Elastic Net, and SCAD. We find that the difference in power between phenotype-dependent schemes is negligible when high-quality functional annotations are available. When functional annotations are unavailable or incomplete, all methods suffer from power loss; however, the variable selection methods outperform the others at the cost of increased computational time. Therefore, in the absence of good annotation, we recommend variable selection methods (which can be viewed as "statistical annotation") on top of regions implicated by a phenotype-independent weighting scheme. Further, once a region is implicated, variable selection can help to identify potential causal single nucleotide polymorphisms for biological validation. These findings are supported by an analysis of a high coverage targeted sequencing study of 1,898 individuals.
© 2013 WILEY PERIODICALS, INC.

Entities:  

Keywords:  association; rare variants; variable selection; variant annotation; weighting

Mesh:

Year:  2013        PMID: 23836599      PMCID: PMC4083762          DOI: 10.1002/gepi.21747

Source DB:  PubMed          Journal:  Genet Epidemiol        ISSN: 0741-0395            Impact factor:   2.135


  39 in total

1.  SIFT: Predicting amino acid changes that affect protein function.

Authors:  Pauline C Ng; Steven Henikoff
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

2.  Multiple rare alleles contribute to low plasma levels of HDL cholesterol.

Authors:  Jonathan C Cohen; Robert S Kiss; Alexander Pertsemlidis; Yves L Marcel; Ruth McPherson; Helen H Hobbs
Journal:  Science       Date:  2004-08-06       Impact factor: 47.728

3.  An efficient Monte Carlo approach to assessing statistical significance in genomic studies.

Authors:  D Y Lin
Journal:  Bioinformatics       Date:  2004-09-28       Impact factor: 6.937

Review 4.  Genotype imputation for genome-wide association studies.

Authors:  Jonathan Marchini; Bryan Howie
Journal:  Nat Rev Genet       Date:  2010-07       Impact factor: 53.242

5.  Calibrating a coalescent simulation of human genome sequence variation.

Authors:  Stephen F Schaffner; Catherine Foo; Stacey Gabriel; David Reich; Mark J Daly; David Altshuler
Journal:  Genome Res       Date:  2005-11       Impact factor: 9.043

6.  Methods for detecting associations with rare variants for common diseases: application to analysis of sequence data.

Authors:  Bingshan Li; Suzanne M Leal
Journal:  Am J Hum Genet       Date:  2008-08-07       Impact factor: 11.025

7.  Practical aspects of imputation-driven meta-analysis of genome-wide association studies.

Authors:  Paul I W de Bakker; Manuel A R Ferreira; Xiaoming Jia; Benjamin M Neale; Soumya Raychaudhuri; Benjamin F Voight
Journal:  Hum Mol Genet       Date:  2008-10-15       Impact factor: 6.150

8.  Personal genomes: The case of the missing heritability.

Authors:  Brendan Maher
Journal:  Nature       Date:  2008-11-06       Impact factor: 49.962

9.  A groupwise association test for rare mutations using a weighted sum statistic.

Authors:  Bo Eskerod Madsen; Sharon R Browning
Journal:  PLoS Genet       Date:  2009-02-13       Impact factor: 5.917

10.  The CoLaus study: a population-based study to investigate the epidemiology and genetic determinants of cardiovascular risk factors and metabolic syndrome.

Authors:  Mathieu Firmann; Vladimir Mayor; Pedro Marques Vidal; Murielle Bochud; Alain Pécoud; Daniel Hayoz; Fred Paccaud; Martin Preisig; Kijoung S Song; Xin Yuan; Theodore M Danoff; Heide A Stirnadel; Dawn Waterworth; Vincent Mooser; Gérard Waeber; Peter Vollenweider
Journal:  BMC Cardiovasc Disord       Date:  2008-03-17       Impact factor: 2.298

View more
  11 in total

1.  Genetic risk prediction using a spatial autoregressive model with adaptive lasso.

Authors:  Yalu Wen; Xiaoxi Shen; Qing Lu
Journal:  Stat Med       Date:  2018-05-31       Impact factor: 2.373

2.  Multikernel linear mixed model with adaptive lasso for complex phenotype prediction.

Authors:  Yalu Wen; Qing Lu
Journal:  Stat Med       Date:  2020-01-27       Impact factor: 2.373

3.  Unified Sequence-Based Association Tests Allowing for Multiple Functional Annotations and Meta-analysis of Noncoding Variation in Metabochip Data.

Authors:  Zihuai He; Bin Xu; Seunggeun Lee; Iuliana Ionita-Laza
Journal:  Am J Hum Genet       Date:  2017-08-24       Impact factor: 11.025

4.  Multi-kernel linear mixed model with adaptive lasso for prediction analysis on high-dimensional multi-omics data.

Authors:  Jun Li; Qing Lu; Yalu Wen
Journal:  Bioinformatics       Date:  2020-03-01       Impact factor: 6.937

5.  A penalized linear mixed model with generalized method of moments for prediction analysis on high-dimensional multi-omics data.

Authors:  Xiaqiong Wang; Yalu Wen
Journal:  Brief Bioinform       Date:  2022-07-18       Impact factor: 13.994

6.  Penalized co-inertia analysis with applications to -omics data.

Authors:  Eun Jeong Min; Sandra E Safo; Qi Long
Journal:  Bioinformatics       Date:  2019-03-15       Impact factor: 6.937

7.  A Bayesian linear mixed model for prediction of complex traits.

Authors:  Yang Hai; Yalu Wen
Journal:  Bioinformatics       Date:  2020-12-17       Impact factor: 6.937

8.  RVTESTS: an efficient and comprehensive tool for rare variant association analysis using sequence data.

Authors:  Xiaowei Zhan; Youna Hu; Bingshan Li; Goncalo R Abecasis; Dajiang J Liu
Journal:  Bioinformatics       Date:  2016-02-15       Impact factor: 6.937

9.  Risk Prediction Modeling of Sequencing Data Using a Forward Random Field Method.

Authors:  Yalu Wen; Zihuai He; Ming Li; Qing Lu
Journal:  Sci Rep       Date:  2016-02-19       Impact factor: 4.379

10.  Exome-wide rare variant analyses of two bone mineral density phenotypes: the challenges of analyzing rare genetic variation.

Authors:  Jianping Sun; Karim Oualkacha; Vincenzo Forgetta; Hou-Feng Zheng; J Brent Richards; Daniel S Evans; Eric Orwoll; Celia M T Greenwood
Journal:  Sci Rep       Date:  2018-01-09       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.