Literature DB >> 21133882

Control of population stratification by correlation-selected principal components.

Seunggeun Lee1, Fred A Wright, Fei Zou.   

Abstract

In genome-wide association studies, population stratification is recognized as producing inflated type I error due to the inflation of test statistics. Principal component-based methods applied to genotypes provide information about population structure, and have been widely used to control for stratification. Here we explore the precise relationship between genotype principal components and inflation of association test statistics, thereby drawing a connection between principal component-based stratification control and the alternative approach of genomic control. Our results provide an inherent justification for the use of principal components, but call into question the popular practice of selecting principal components based on significance of eigenvalues alone. We propose a new approach, called EigenCorr, which selects principal components based on both their eigenvalues and their correlation with the (disease) phenotype. Our approach tends to select fewer principal components for stratification control than does testing of eigenvalues alone, providing substantial computational savings and improvements in power. Analyses of simulated and real data demonstrate the usefulness of the proposed approach.
© 2010, The International Biometric Society.

Entities:  

Mesh:

Year:  2010        PMID: 21133882      PMCID: PMC3117098          DOI: 10.1111/j.1541-0420.2010.01520.x

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   2.571


  18 in total

Review 1.  Genomic control, a new approach to genetic-based association studies.

Authors:  B Devlin; K Roeder; L Wasserman
Journal:  Theor Popul Biol       Date:  2001-11       Impact factor: 1.570

2.  Genomic control for association studies.

Authors:  B Devlin; K Roeder
Journal:  Biometrics       Date:  1999-12       Impact factor: 2.571

3.  The effects of human population structure on large genetic association studies.

Authors:  Jonathan Marchini; Lon R Cardon; Michael S Phillips; Peter Donnelly
Journal:  Nat Genet       Date:  2004-03-28       Impact factor: 38.330

4.  Quantification of population structure using correlated SNPs by shrinkage principal components.

Authors:  Fei Zou; Seunggeun Lee; Michael R Knowles; Fred A Wright
Journal:  Hum Hered       Date:  2010-04-23       Impact factor: 0.444

5.  Genomic Control to the extreme.

Authors:  B Devlin; Silviu-Alin Bacanu; Kathryn Roeder
Journal:  Nat Genet       Date:  2004-11       Impact factor: 38.330

6.  Principal components analysis corrects for stratification in genome-wide association studies.

Authors:  Alkes L Price; Nick J Patterson; Robert M Plenge; Michael E Weinblatt; Nancy A Shadick; David Reich
Journal:  Nat Genet       Date:  2006-07-23       Impact factor: 38.330

7.  A method for quantifying differentiation between populations at multi-allelic loci and its implications for investigating identity and paternity.

Authors:  D J Balding; R A Nichols
Journal:  Genetica       Date:  1995       Impact factor: 1.082

8.  A simple and improved correction for population stratification in case-control studies.

Authors:  Michael P Epstein; Andrew S Allen; Glen A Satten
Journal:  Am J Hum Genet       Date:  2007-03-29       Impact factor: 11.025

9.  A propensity score approach to correction for bias due to population stratification using genetic and non-genetic factors.

Authors:  Huaqing Zhao; Timothy R Rebbeck; Nandita Mitra
Journal:  Genet Epidemiol       Date:  2009-12       Impact factor: 2.135

10.  Population structure and eigenanalysis.

Authors:  Nick Patterson; Alkes L Price; David Reich
Journal:  PLoS Genet       Date:  2006-12       Impact factor: 5.917

View more
  18 in total

1.  Rare Coding Variants Associated with Breast Cancer.

Authors:  Mi-Ryung Han
Journal:  Adv Exp Med Biol       Date:  2021       Impact factor: 2.622

2.  Genetic variations and risk of placental abruption: A genome-wide association study and meta-analysis of genome-wide association studies.

Authors:  Tsegaselassie Workalemahu; Daniel A Enquobahrie; Bizu Gelaye; Sixto E Sanchez; Pedro J Garcia; Fasil Tekola-Ayele; Anjum Hajat; Timothy A Thornton; Cande V Ananth; Michelle A Williams
Journal:  Placenta       Date:  2018-04-16       Impact factor: 3.481

Review 3.  Rare-variant association analysis: study designs and statistical tests.

Authors:  Seunggeung Lee; Gonçalo R Abecasis; Michael Boehnke; Xihong Lin
Journal:  Am J Hum Genet       Date:  2014-07-03       Impact factor: 11.025

4.  Maternal and Offspring Genetic Risk of Type 2 Diabetes and Offspring Birthweight Among African Ancestry Populations.

Authors:  Mohammad L Rahman; Deepika Shrestha; Tsegaselassie Workalemahu; Jing Wu; Chunming Zhu; Cuilin Zhang; Fasil Tekola-Ayele
Journal:  J Clin Endocrinol Metab       Date:  2019-11-01       Impact factor: 5.958

5.  Set-Based Tests for the Gene-Environment Interaction in Longitudinal Studies.

Authors:  Zihuai He; Min Zhang; Seunggeun Lee; Jennifer A Smith; Sharon L R Kardia; Ana V Diez Roux; Bhramar Mukherjee
Journal:  J Am Stat Assoc       Date:  2016-12-16       Impact factor: 5.033

6.  Empirical pathway analysis, without permutation.

Authors:  Yi-Hui Zhou; William T Barry; Fred A Wright
Journal:  Biostatistics       Date:  2013-02-20       Impact factor: 5.899

7.  Adjustment for population stratification via principal components in association analysis of rare variants.

Authors:  Yiwei Zhang; Weihua Guan; Wei Pan
Journal:  Genet Epidemiol       Date:  2012-10-12       Impact factor: 2.135

8.  Correcting for population structure and kinship using the linear mixed model: theory and extensions.

Authors:  Gabriel E Hoffman
Journal:  PLoS One       Date:  2013-10-28       Impact factor: 3.240

9.  Further improvements to linear mixed models for genome-wide association studies.

Authors:  Christian Widmer; Christoph Lippert; Omer Weissbrod; Nicolo Fusi; Carl Kadie; Robert Davidson; Jennifer Listgarten; David Heckerman
Journal:  Sci Rep       Date:  2014-11-12       Impact factor: 4.379

10.  Discovery of candidate genes for muscle traits based on GWAS supported by eQTL-analysis.

Authors:  Siriluck Ponsuksili; Eduard Murani; Nares Trakooljul; Manfred Schwerin; Klaus Wimmers
Journal:  Int J Biol Sci       Date:  2014-03-10       Impact factor: 6.580

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.