Literature DB >> 25536929

Principal component regression and linear mixed model in association analysis of structured samples: competitors or complements?

Yiwei Zhang1, Wei Pan.   

Abstract

Genome-wide association studies (GWAS) have been established as a major tool to identify genetic variants associated with complex traits, such as common diseases. However, GWAS may suffer from false positives and false negatives due to confounding population structures, including known or unknown relatedness. Another important issue is unmeasured environmental risk factors. Among many methods for adjusting for population structures, two approaches stand out: one is principal component regression (PCR) based on principal component analysis, which is perhaps the most popular due to its early appearance, simplicity, and general effectiveness; the other is based on a linear mixed model (LMM) that has emerged recently as perhaps the most flexible and effective, especially for samples with complex structures as in model organisms. As shown previously, the PCR approach can be regarded as an approximation to an LMM; such an approximation depends on the number of the top principal components (PCs) used, the choice of which is often difficult in practice. Hence, in the presence of population structure, the LMM appears to outperform the PCR method. However, due to the different treatments of fixed vs. random effects in the two approaches, we show an advantage of PCR over LMM: in the presence of an unknown but spatially confined environmental confounder (e.g., environmental pollution or lifestyle), the PCs may be able to implicitly and effectively adjust for the confounder whereas the LMM cannot. Accordingly, to adjust for both population structures and nongenetic confounders, we propose a hybrid method combining the use and, thus, strengths of PCR and LMM. We use real genotype data and simulated phenotypes to confirm the above points, and establish the superior performance of the hybrid method across all scenarios.
© 2015 Wiley Periodicals, Inc.

Entities:  

Keywords:  association testing; confounding; environmental risk; population stratification; probabilistic principal component analysis

Mesh:

Year:  2014        PMID: 25536929      PMCID: PMC4366301          DOI: 10.1002/gepi.21879

Source DB:  PubMed          Journal:  Genet Epidemiol        ISSN: 0741-0395            Impact factor:   2.135


  28 in total

1.  Genomic control for association studies.

Authors:  B Devlin; K Roeder
Journal:  Biometrics       Date:  1999-12       Impact factor: 2.571

2.  Principal components analysis corrects for stratification in genome-wide association studies.

Authors:  Alkes L Price; Nick J Patterson; Robert M Plenge; Michael E Weinblatt; Nancy A Shadick; David Reich
Journal:  Nat Genet       Date:  2006-07-23       Impact factor: 38.330

3.  An analytical comparison of the principal component method and the mixed effects model for association studies in the presence of cryptic relatedness and population stratification.

Authors:  Kai Wang; Xijian Hu; Yingwei Peng
Journal:  Hum Hered       Date:  2013-07-31       Impact factor: 0.444

4.  A comparison of association methods correcting for population stratification in case-control studies.

Authors:  Chengqing Wu; Andrew DeWan; Josephine Hoh; Zuoheng Wang
Journal:  Ann Hum Genet       Date:  2011-01-31       Impact factor: 1.670

5.  A simple and improved correction for population stratification in case-control studies.

Authors:  Michael P Epstein; Andrew S Allen; Glen A Satten
Journal:  Am J Hum Genet       Date:  2007-03-29       Impact factor: 11.025

6.  Genotype-based matching to correct for population stratification in large-scale case-control genetic association studies.

Authors:  Weihua Guan; Liming Liang; Michael Boehnke; Gonçalo R Abecasis
Journal:  Genet Epidemiol       Date:  2009-09       Impact factor: 2.135

7.  Discovering genetic ancestry using spectral graph theory.

Authors:  Ann B Lee; Diana Luca; Lambertus Klei; Bernie Devlin; Kathryn Roeder
Journal:  Genet Epidemiol       Date:  2010-01       Impact factor: 2.135

8.  Advantages and pitfalls in the application of mixed-model association methods.

Authors:  Jian Yang; Noah A Zaitlen; Michael E Goddard; Peter M Visscher; Alkes L Price
Journal:  Nat Genet       Date:  2014-02       Impact factor: 38.330

9.  The benefits of selecting phenotype-specific variants for applications of mixed models in genomics.

Authors:  Christoph Lippert; Gerald Quon; Eun Yong Kang; Carl M Kadie; Jennifer Listgarten; David Heckerman
Journal:  Sci Rep       Date:  2013       Impact factor: 4.379

10.  Population structure and eigenanalysis.

Authors:  Nick Patterson; Alkes L Price; David Reich
Journal:  PLoS Genet       Date:  2006-12       Impact factor: 5.917

View more
  14 in total

1.  Quality control, imputation and analysis of genome-wide genotyping data from the Illumina HumanCoreExome microarray.

Authors:  Jonathan R I Coleman; Jack Euesden; Hamel Patel; Amos A Folarin; Stephen Newhouse; Gerome Breen
Journal:  Brief Funct Genomics       Date:  2015-10-05       Impact factor: 4.241

2.  A practical approach to adjusting for population stratification in genome-wide association studies: principal components and propensity scores (PCAPS).

Authors:  Huaqing Zhao; Nandita Mitra; Peter A Kanetsky; Katherine L Nathanson; Timothy R Rebbeck
Journal:  Stat Appl Genet Mol Biol       Date:  2018-12-04

3.  Multiple SNP Set Analysis for Genome-Wide Association Studies Through Bayesian Latent Variable Selection.

Authors:  Zhao-Hua Lu; Hongtu Zhu; Rebecca C Knickmeyer; Patrick F Sullivan; Stephanie N Williams; Fei Zou
Journal:  Genet Epidemiol       Date:  2015-10-30       Impact factor: 2.135

4.  On the substructure controls in rare variant analysis: Principal components or variance components?

Authors:  Yiwen Luo; Arnab Maity; Michael C Wu; Chris Smith; Qing Duan; Yun Li; Jung-Ying Tzeng
Journal:  Genet Epidemiol       Date:  2017-12-26       Impact factor: 2.135

5.  Including diverse and admixed populations in genetic epidemiology research.

Authors:  Amke Caliebe; Fasil Tekola-Ayele; Burcu F Darst; Xuexia Wang; Yeunjoo E Song; Jiang Gui; Ronnie A Sebro; David J Balding; Mohamad Saad; Marie-Pierre Dubé
Journal:  Genet Epidemiol       Date:  2022-07-16       Impact factor: 2.344

6.  Genes, exposures, and interactions on preterm birth risk: an exploratory study in an Argentine population.

Authors:  Dario E Elias; Maria R Santos; Hebe Campaña; Fernando A Poletta; Silvina L Heisecke; Juan A Gili; Julia Ratowiecki; Viviana Cosentino; Rocio Uranga; Diana Rojas Málaga; Alice Brinckmann Oliveira Netto; Ana Carolina Brusius-Facchin; César Saleme; Mónica Rittler; Hugo B Krupitzki; Jorge S Lopez Camelo; Lucas G Gimenez
Journal:  J Community Genet       Date:  2022-08-17

Review 7.  Genome-wide Association Studies in Ancestrally Diverse Populations: Opportunities, Methods, Pitfalls, and Recommendations.

Authors:  Roseann E Peterson; Karoline Kuchenbaecker; Raymond K Walters; Chia-Yen Chen; Alice B Popejoy; Sathish Periyasamy; Max Lam; Conrad Iyegbe; Rona J Strawbridge; Leslie Brick; Caitlin E Carey; Alicia R Martin; Jacquelyn L Meyers; Jinni Su; Junfang Chen; Alexis C Edwards; Allan Kalungi; Nastassja Koen; Lerato Majara; Emanuel Schwarz; Jordan W Smoller; Eli A Stahl; Patrick F Sullivan; Evangelos Vassos; Bryan Mowry; Miguel L Prieto; Alfredo Cuellar-Barboza; Tim B Bigdeli; Howard J Edenberg; Hailiang Huang; Laramie E Duncan
Journal:  Cell       Date:  2019-10-10       Impact factor: 41.582

Review 8.  Population genetic considerations for using biobanks as international resources in the pandemic era and beyond.

Authors:  Hannah Carress; Daniel John Lawson; Eran Elhaik
Journal:  BMC Genomics       Date:  2021-05-17       Impact factor: 3.969

9.  Accuracy of heritability estimations in presence of hidden population stratification.

Authors:  Claire Dandine-Roulland; Céline Bellenguez; Stéphanie Debette; Philippe Amouyel; Emmanuelle Génin; Hervé Perdry
Journal:  Sci Rep       Date:  2016-05-25       Impact factor: 4.379

10.  An eQTL variant of ZXDC is associated with IFN-γ production following Mycobacterium tuberculosis antigen-specific stimulation.

Authors:  Fabienne Jabot-Hanin; Aurélie Cobat; Jacqueline Feinberg; Marianna Orlova; Jonathan Niay; Caroline Deswarte; Christine Poirier; Ioannis Theodorou; Jacinta Bustamante; Stéphanie Boisson-Dupuis; Jean-Laurent Casanova; Alexandre Alcaïs; Eileen G Hoal; Christophe Delacourt; Erwin Schurr; Laurent Abel
Journal:  Sci Rep       Date:  2017-10-09       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.