Literature DB >> 22734045

Identifying genetic marker sets associated with phenotypes via an efficient adaptive score test.

Tianxi Cai1, Xihong Lin, Raymond J Carroll.   

Abstract

In recent years, genome-wide association studies (GWAS) and gene-expression profiling have generated a large number of valuable datasets for assessing how genetic variations are related to disease outcomes. With such datasets, it is often of interest to assess the overall effect of a set of genetic markers, assembled based on biological knowledge. Genetic marker-set analyses have been advocated as more reliable and powerful approaches compared with the traditional marginal approaches (Curtis and others, 2005. Pathways to the analysis of microarray data. TRENDS in Biotechnology 23, 429-435; Efroni and others, 2007. Identification of key processes underlying cancer phenotypes using biologic pathway analysis. PLoS One 2, 425). Procedures for testing the overall effect of a marker-set have been actively studied in recent years. For example, score tests derived under an Empirical Bayes (EB) framework (Liu and others, 2007. Semiparametric regression of multidimensional genetic pathway data: least-squares kernel machines and linear mixed models. Biometrics 63, 1079-1088; Liu and others, 2008. Estimation and testing for the effect of a genetic pathway on a disease outcome using logistic kernel machine regression via logistic mixed models. BMC bioinformatics 9, 292-2; Wu and others, 2010. Powerful SNP-set analysis for case-control genome-wide association studies. American Journal of Human Genetics 86, 929) have been proposed as powerful alternatives to the standard Rao score test (Rao, 1948. Large sample tests of statistical hypotheses concerning several parameters with applications to problems of estimation. Mathematical Proceedings of the Cambridge Philosophical Society, 44, 50-57). The advantages of these EB-based tests are most apparent when the markers are correlated, due to the reduction in the degrees of freedom. In this paper, we propose an adaptive score test which up- or down-weights the contributions from each member of the marker-set based on the Z-scores of their effects. Such an adaptive procedure gains power over the existing procedures when the signal is sparse and the correlation among the markers is weak. By combining evidence from both the EB-based score test and the adaptive test, we further construct an omnibus test that attains good power in most settings. The null distributions of the proposed test statistics can be approximated well either via simple perturbation procedures or via distributional approximations. Through extensive simulation studies, we demonstrate that the proposed procedures perform well in finite samples. We apply the tests to a breast cancer genetic study to assess the overall effect of the FGFR2 gene on breast cancer risk.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 22734045      PMCID: PMC3440238          DOI: 10.1093/biostatistics/kxs015

Source DB:  PubMed          Journal:  Biostatistics        ISSN: 1465-4644            Impact factor:   5.899


  24 in total

1.  Knowledge-based analysis of microarray gene expression data by using support vector machines.

Authors:  M P Brown; W N Grundy; D Lin; N Cristianini; C W Sugnet; T S Furey; M Ares; D Haussler
Journal:  Proc Natl Acad Sci U S A       Date:  2000-01-04       Impact factor: 11.205

2.  Truncated product method for combining P-values.

Authors:  D V Zaykin; Lev A Zhivotovsky; P H Westfall; B S Weir
Journal:  Genet Epidemiol       Date:  2002-02       Impact factor: 2.135

3.  Empirical Bayes methods for testing associations with large numbers of candidate genes in the presence of environmental risk factors, with applications to HLA associations in IDDM.

Authors:  D Thomas; B Langholz; D Clayton; J Pitkäniemi; E Tuomilehto-Wolf; J Tuomilehto
Journal:  Ann Med       Date:  1992-10       Impact factor: 4.709

4.  A simple correction for multiple testing for single-nucleotide polymorphisms in linkage disequilibrium with each other.

Authors:  Dale R Nyholt
Journal:  Am J Hum Genet       Date:  2004-03-02       Impact factor: 11.025

5.  An efficient Monte Carlo approach to assessing statistical significance in genomic studies.

Authors:  D Y Lin
Journal:  Bioinformatics       Date:  2004-09-28       Impact factor: 6.937

6.  Testing association of a pathway with survival using gene expression data.

Authors:  Jelle J Goeman; Jan Oosting; Anne-Marie Cleton-Jansen; Jakob K Anninga; Hans C van Houwelingen
Journal:  Bioinformatics       Date:  2005-01-18       Impact factor: 6.937

Review 7.  Pathways to the analysis of microarray data.

Authors:  R Keira Curtis; Matej Oresic; Antonio Vidal-Puig
Journal:  Trends Biotechnol       Date:  2005-08       Impact factor: 19.536

8.  Principal components analysis corrects for stratification in genome-wide association studies.

Authors:  Alkes L Price; Nick J Patterson; Robert M Plenge; Michael E Weinblatt; Nancy A Shadick; David Reich
Journal:  Nat Genet       Date:  2006-07-23       Impact factor: 38.330

9.  Robust genetic linkage analysis based on a score test of homogeneity: the weighted pairwise correlation statistic.

Authors:  D Commenges
Journal:  Genet Epidemiol       Date:  1994       Impact factor: 2.135

10.  Genome-wide association study of prostate cancer identifies a second risk locus at 8q24.

Authors:  Meredith Yeager; Nick Orr; Richard B Hayes; Kevin B Jacobs; Peter Kraft; Sholom Wacholder; Mark J Minichiello; Paul Fearnhead; Kai Yu; Nilanjan Chatterjee; Zhaoming Wang; Robert Welch; Brian J Staats; Eugenia E Calle; Heather Spencer Feigelson; Michael J Thun; Carmen Rodriguez; Demetrius Albanes; Jarmo Virtamo; Stephanie Weinstein; Fredrick R Schumacher; Edward Giovannucci; Walter C Willett; Geraldine Cancel-Tassin; Olivier Cussenot; Antoine Valeri; Gerald L Andriole; Edward P Gelmann; Margaret Tucker; Daniela S Gerhard; Joseph F Fraumeni; Robert Hoover; David J Hunter; Stephen J Chanock; Gilles Thomas
Journal:  Nat Genet       Date:  2007-04-01       Impact factor: 38.330

View more
  13 in total

1.  iGWAS: Integrative Genome-Wide Association Studies of Genetic and Genomic Data for Disease Susceptibility Using Mediation Analysis.

Authors:  Yen-Tsung Huang; Liming Liang; Miriam F Moffatt; William O C M Cookson; Xihong Lin
Journal:  Genet Epidemiol       Date:  2015-05-22       Impact factor: 2.135

2.  Kernel-machine testing coupled with a rank-truncation method for genetic pathway analysis.

Authors:  Qi Yan; Hemant K Tiwari; Nengjun Yi; Wan-Yu Lin; Guimin Gao; Xiang-Yang Lou; Xiangqin Cui; Nianjun Liu
Journal:  Genet Epidemiol       Date:  2014-05-21       Impact factor: 2.135

3.  A unified powerful set-based test for sequencing data analysis of GxE interactions.

Authors:  Yu-Ru Su; Chong-Zhi Di; Li Hsu
Journal:  Biostatistics       Date:  2016-07-28       Impact factor: 5.899

4.  Integrative analysis of micro-RNA, gene expression, and survival of glioblastoma multiforme.

Authors:  Yen-Tsung Huang; Thomas Hsu; Karl T Kelsey; Chien-Ling Lin
Journal:  Genet Epidemiol       Date:  2014-12-23       Impact factor: 2.135

5.  Kernel machine SNP-set testing under multiple candidate kernels.

Authors:  Michael C Wu; Arnab Maity; Seunggeun Lee; Elizabeth M Simmons; Quaker E Harmon; Xinyi Lin; Stephanie M Engel; Jeffrey J Molldrem; Paul M Armistead
Journal:  Genet Epidemiol       Date:  2013-03-07       Impact factor: 2.135

6.  JOINT ANALYSIS OF SNP AND GENE EXPRESSION DATA IN GENETIC ASSOCIATION STUDIES OF COMPLEX DISEASES.

Authors:  Yen-Tsung Huang; Tyler J Vanderweele; Xihong Lin
Journal:  Ann Appl Stat       Date:  2014-03-01       Impact factor: 2.083

7.  An Adaptive Genetic Association Test Using Double Kernel Machines.

Authors:  Xiang Zhan; Michael P Epstein; Debashis Ghosh
Journal:  Stat Biosci       Date:  2014-06-24

8.  SBERIA: set-based gene-environment interaction test for rare and common variants in complex diseases.

Authors:  Shuo Jiao; Li Hsu; Stéphane Bézieau; Hermann Brenner; Andrew T Chan; Jenny Chang-Claude; Loic Le Marchand; Mathieu Lemire; Polly A Newcomb; Martha L Slattery; Ulrike Peters
Journal:  Genet Epidemiol       Date:  2013-05-29       Impact factor: 2.135

9.  Gene set analysis using variance component tests.

Authors:  Yen-Tsung Huang; Xihong Lin
Journal:  BMC Bioinformatics       Date:  2013-06-28       Impact factor: 3.169

10.  Integrating Clinical and Multiple Omics Data for Prognostic Assessment across Human Cancers.

Authors:  Bin Zhu; Nan Song; Ronglai Shen; Arshi Arora; Mitchell J Machiela; Lei Song; Maria Teresa Landi; Debashis Ghosh; Nilanjan Chatterjee; Veera Baladandayuthapani; Hongyu Zhao
Journal:  Sci Rep       Date:  2017-12-05       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.