Literature DB >> 29280188

On the substructure controls in rare variant analysis: Principal components or variance components?

Yiwen Luo1,2, Arnab Maity2, Michael C Wu3, Chris Smith1, Qing Duan4, Yun Li4,5, Jung-Ying Tzeng1,2,6,7.   

Abstract

Recent studies showed that population substructure (PS) can have more complex impact on rare variant tests and that similarity-based collapsing tests (e.g., SKAT) may suffer more severely by PS than burden-based tests. In this work, we evaluate the performance of SKAT coupling with principal components (PC) or variance components (VC) based PS correction methods. We consider confounding effects caused by PS including stratified populations, admixed populations, and spatially distributed nongenetic risk; we investigate which types of variants (e.g., common, less frequent, rare, or all variants) should be used to effectively control for confounding effects. We found that (i) PC-based methods can account for confounding effects in most scenarios except for admixture, although the number of sufficient PCs depends on the PS complexity and the type of variants used. (ii) PCs based on all variants (i.e., common + less frequent + rare) tend to require equal or fewer sufficient PCs and often achieve higher power than PCs based on other variant types. (iii) VC-based methods can effectively adjust for confounding in all scenarios (even for admixture), though the type of variants should be used to construct VC may vary. (iv) VC based on all variants works consistently in all scenarios, though its power may be sometimes lower than VC based on other variant types. Given that the best-performed method and which variants to use depend on the underlying unknown confounding mechanisms, a robust strategy is to perform SKAT analyses using VC-based methods based on all variants.
© 2017 WILEY PERIODICALS, INC.

Entities:  

Keywords:  population substructure; principal components analysis; rare variant association tests; variance components

Mesh:

Year:  2017        PMID: 29280188      PMCID: PMC5851819          DOI: 10.1002/gepi.22102

Source DB:  PubMed          Journal:  Genet Epidemiol        ISSN: 0741-0395            Impact factor:   2.135


  41 in total

1.  Pooled association tests for rare variants in exon-resequencing studies.

Authors:  Alkes L Price; Gregory V Kryukov; Paul I W de Bakker; Shaun M Purcell; Jeff Staples; Lee-Jen Wei; Shamil R Sunyaev
Journal:  Am J Hum Genet       Date:  2010-05-13       Impact factor: 11.025

2.  Principal components analysis corrects for stratification in genome-wide association studies.

Authors:  Alkes L Price; Nick J Patterson; Robert M Plenge; Michael E Weinblatt; Nancy A Shadick; David Reich
Journal:  Nat Genet       Date:  2006-07-23       Impact factor: 38.330

3.  ARIEL and AMELIA: testing for an accumulation of rare variants using next-generation sequencing data.

Authors:  Jennifer L Asimit; Aaron G Day-Williams; Andrew P Morris; Eleftheria Zeggini
Journal:  Hum Hered       Date:  2012-03-22       Impact factor: 0.444

4.  A strategy to discover genes that carry multi-allelic or mono-allelic risk for common diseases: a cohort allelic sums test (CAST).

Authors:  Stephan Morgenthaler; William G Thilly
Journal:  Mutat Res       Date:  2006-11-13       Impact factor: 2.433

5.  Adjustment for population stratification via principal components in association analysis of rare variants.

Authors:  Yiwei Zhang; Weihua Guan; Wei Pan
Journal:  Genet Epidemiol       Date:  2012-10-12       Impact factor: 2.135

6.  GENE-LEVEL PHARMACOGENETIC ANALYSIS ON SURVIVAL OUTCOMES USING GENE-TRAIT SIMILARITY REGRESSION.

Authors:  Jung-Ying Tzeng; Wenbin Lu; Fang-Chi Hsu
Journal:  Ann Appl Stat       Date:  2014       Impact factor: 2.083

7.  Assessing the impact of population stratification on association studies of rare variation.

Authors:  Yunxuan Jiang; Michael P Epstein; Karen N Conneely
Journal:  Hum Hered       Date:  2013-07-31       Impact factor: 0.444

8.  Low frequency variants, collapsed based on biological knowledge, uncover complexity of population stratification in 1000 genomes project data.

Authors:  Carrie B Moore; John R Wallace; Daniel J Wolfe; Alex T Frase; Sarah A Pendergrass; Kenneth M Weiss; Marylyn D Ritchie
Journal:  PLoS Genet       Date:  2013-12-26       Impact factor: 5.917

9.  An evaluation of statistical approaches to rare variant analysis in genetic association studies.

Authors:  Andrew P Morris; Eleftheria Zeggini
Journal:  Genet Epidemiol       Date:  2010-02       Impact factor: 2.135

10.  An integrated map of genetic variation from 1,092 human genomes.

Authors:  Goncalo R Abecasis; Adam Auton; Lisa D Brooks; Mark A DePristo; Richard M Durbin; Robert E Handsaker; Hyun Min Kang; Gabor T Marth; Gil A McVean
Journal:  Nature       Date:  2012-11-01       Impact factor: 49.962

View more
  3 in total

1.  Taking population stratification into account by local permutations in rare-variant association studies on small samples.

Authors:  Jimmy Mullaert; Matthieu Bouaziz; Yoann Seeleuthner; Benedetta Bigio; Jean-Laurent Casanova; Alexandre Alcaïs; Laurent Abel; Aurélie Cobat
Journal:  Genet Epidemiol       Date:  2021-08-17       Impact factor: 2.135

2.  Comparison of mixed model based approaches for correcting for population substructure with application to extreme phenotype sampling.

Authors:  Maryam Onifade; Marie-Hélène Roy-Gagnon; Marie-Élise Parent; Kelly M Burkett
Journal:  BMC Genomics       Date:  2022-02-04       Impact factor: 3.969

3.  Controlling for human population stratification in rare variant association studies.

Authors:  Laurent Abel; Aurélie Cobat; Matthieu Bouaziz; Jimmy Mullaert; Benedetta Bigio; Yoann Seeleuthner; Jean-Laurent Casanova; Alexandre Alcais
Journal:  Sci Rep       Date:  2021-09-24       Impact factor: 4.379

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.