Literature DB >> 23065775

Adjustment for population stratification via principal components in association analysis of rare variants.

Yiwei Zhang1, Weihua Guan, Wei Pan.   

Abstract

For unrelated samples, principal component (PC) analysis has been established as a simple and effective approach to adjusting for population stratification in association analysis of common variants (CVs, with minor allele frequencies MAF > 5%). However, it is less clear how it would perform in analysis of low-frequency variants (LFVs, MAF between 1% and 5%), or of rare variants (RVs, MAF < 5%). Furthermore, with next-generation sequencing data, it is unknown whether PCs should be constructed based on CVs, LFVs, or RVs. In this study, we used the 1000 Genomes Project sequence data to explore the construction of PCs and their use in association analysis of LFVs or RVs for unrelated samples. It is shown that a few top PCs based on either CVs or LFVs could separate two continental groups, European and African samples, but those based on only RVs performed less well. When applied to several association tests in simulated data with population stratification, using PCs based on either CVs or LFVs was effective in controlling Type I error rates, while nonadjustment led to inflated Type I error rates. Perhaps the most interesting observation is that, although the PCs based on LFVs could better separate the two continental groups than those based on CVs, the use of the former could lead to overadjustment in the sense of substantial power loss in the absence of population stratification; in contrast, we did not see any problem with the use of the PCs based on CVs in all our examples.
© 2012 Wiley Periodicals, Inc.

Entities:  

Mesh:

Year:  2012        PMID: 23065775      PMCID: PMC4066816          DOI: 10.1002/gepi.21691

Source DB:  PubMed          Journal:  Genet Epidemiol        ISSN: 0741-0395            Impact factor:   2.135


  39 in total

1.  Inference of population structure using multilocus genotype data.

Authors:  J K Pritchard; M Stephens; P Donnelly
Journal:  Genetics       Date:  2000-06       Impact factor: 4.562

2.  Association mapping, using a mixture model for complex traits.

Authors:  Xiaofeng Zhu; ShuangLin Zhang; Hongyu Zhao; Richard S Cooper
Journal:  Genet Epidemiol       Date:  2002-08       Impact factor: 2.135

3.  Pooled association tests for rare variants in exon-resequencing studies.

Authors:  Alkes L Price; Gregory V Kryukov; Paul I W de Bakker; Shaun M Purcell; Jeff Staples; Lee-Jen Wei; Shamil R Sunyaev
Journal:  Am J Hum Genet       Date:  2010-05-13       Impact factor: 11.025

4.  So many correlated tests, so little time! Rapid adjustment of P values for multiple correlated tests.

Authors:  Karen N Conneely; Michael Boehnke
Journal:  Am J Hum Genet       Date:  2007-12       Impact factor: 11.025

5.  A comparison of association methods correcting for population stratification in case-control studies.

Authors:  Chengqing Wu; Andrew DeWan; Josephine Hoh; Zuoheng Wang
Journal:  Ann Hum Genet       Date:  2011-01-31       Impact factor: 1.670

6.  Evolution and functional impact of rare coding variation from deep sequencing of human exomes.

Authors:  Jacob A Tennessen; Abigail W Bigham; Timothy D O'Connor; Wenqing Fu; Eimear E Kenny; Simon Gravel; Sean McGee; Ron Do; Xiaoming Liu; Goo Jun; Hyun Min Kang; Daniel Jordan; Suzanne M Leal; Stacey Gabriel; Mark J Rieder; Goncalo Abecasis; David Altshuler; Deborah A Nickerson; Eric Boerwinkle; Shamil Sunyaev; Carlos D Bustamante; Michael J Bamshad; Joshua M Akey
Journal:  Science       Date:  2012-05-17       Impact factor: 47.728

7.  A strategy to discover genes that carry multi-allelic or mono-allelic risk for common diseases: a cohort allelic sums test (CAST).

Authors:  Stephan Morgenthaler; William G Thilly
Journal:  Mutat Res       Date:  2006-11-13       Impact factor: 2.433

8.  Genotype-based matching to correct for population stratification in large-scale case-control genetic association studies.

Authors:  Weihua Guan; Liming Liang; Michael Boehnke; Gonçalo R Abecasis
Journal:  Genet Epidemiol       Date:  2009-09       Impact factor: 2.135

9.  Discovering genetic ancestry using spectral graph theory.

Authors:  Ann B Lee; Diana Luca; Lambertus Klei; Bernie Devlin; Kathryn Roeder
Journal:  Genet Epidemiol       Date:  2010-01       Impact factor: 2.135

10.  Population structure and eigenanalysis.

Authors:  Nick Patterson; Alkes L Price; David Reich
Journal:  PLoS Genet       Date:  2006-12       Impact factor: 5.917

View more
  21 in total

1.  Utilizing the Jaccard index to reveal population stratification in sequencing data: a simulation study and an application to the 1000 Genomes Project.

Authors:  Dmitry Prokopenko; Julian Hecker; Edwin K Silverman; Marcello Pagano; Markus M Nöthen; Christian Dina; Christoph Lange; Heide Loehlein Fier
Journal:  Bioinformatics       Date:  2015-12-31       Impact factor: 6.937

2.  A practical approach to adjusting for population stratification in genome-wide association studies: principal components and propensity scores (PCAPS).

Authors:  Huaqing Zhao; Nandita Mitra; Peter A Kanetsky; Katherine L Nathanson; Timothy R Rebbeck
Journal:  Stat Appl Genet Mol Biol       Date:  2018-12-04

3.  Sex- and age-interacting eQTLs in human complex diseases.

Authors:  Chen Yao; Roby Joehanes; Andrew D Johnson; Tianxiao Huan; Tõnu Esko; Saixia Ying; Jane E Freedman; Joanne Murabito; Kathryn L Lunetta; Andres Metspalu; Peter J Munson; Daniel Levy
Journal:  Hum Mol Genet       Date:  2013-11-15       Impact factor: 6.150

4.  Adjusting for population stratification in a fine scale with principal components and sequencing data.

Authors:  Yiwei Zhang; Xiaotong Shen; Wei Pan
Journal:  Genet Epidemiol       Date:  2013-10-05       Impact factor: 2.135

5.  Imputation and quality control steps for combining multiple genome-wide datasets.

Authors:  Shefali S Verma; Mariza de Andrade; Gerard Tromp; Helena Kuivaniemi; Elizabeth Pugh; Bahram Namjou-Khales; Shubhabrata Mukherjee; Gail P Jarvik; Leah C Kottyan; Amber Burt; Yuki Bradford; Gretta D Armstrong; Kimberly Derr; Dana C Crawford; Jonathan L Haines; Rongling Li; David Crosslin; Marylyn D Ritchie
Journal:  Front Genet       Date:  2014-12-11       Impact factor: 4.599

6.  A powerful and adaptive association test for rare variants.

Authors:  Wei Pan; Junghi Kim; Yiwei Zhang; Xiaotong Shen; Peng Wei
Journal:  Genetics       Date:  2014-05-15       Impact factor: 4.562

7.  On the substructure controls in rare variant analysis: Principal components or variance components?

Authors:  Yiwen Luo; Arnab Maity; Michael C Wu; Chris Smith; Qing Duan; Yun Li; Jung-Ying Tzeng
Journal:  Genet Epidemiol       Date:  2017-12-26       Impact factor: 2.135

Review 8.  Identifying rare variants associated with complex traits via sequencing.

Authors:  Bingshan Li; Dajiang J Liu; Suzanne M Leal
Journal:  Curr Protoc Hum Genet       Date:  2013-07

9.  Semi-supervised spectral clustering with application to detect population stratification.

Authors:  Binghui Liu; Xiaotong Shen; Wei Pan
Journal:  Front Genet       Date:  2013-10-25       Impact factor: 4.599

10.  Whole-exome sequencing of over 4100 men of African ancestry and prostate cancer risk.

Authors:  Kristin A Rand; Nadin Rohland; Arti Tandon; Alex Stram; Xin Sheng; Ron Do; Bogdan Pasaniuc; Alex Allen; Dominique Quinque; Swapan Mallick; Loic Le Marchand; Sam Kaggwa; Alex Lubwama; Daniel O Stram; Stephen Watya; Brian E Henderson; David V Conti; David Reich; Christopher A Haiman
Journal:  Hum Mol Genet       Date:  2015-11-24       Impact factor: 6.150

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.