Literature DB >> 24123217

Adjusting for population stratification in a fine scale with principal components and sequencing data.

Yiwei Zhang1, Xiaotong Shen, Wei Pan.   

Abstract

Population stratification is of primary interest in genetic studies to infer human evolution history and to avoid spurious findings in association testing. Although it is well studied with high-density single nucleotide polymorphisms (SNPs) in genome-wide association studies (GWASs), next-generation sequencing brings both new opportunities and challenges to uncovering population structures in finer scales. Several recent studies have noticed different confounding effects from variants of different minor allele frequencies (MAFs). In this paper, using a low-coverage sequencing dataset from the 1000 Genomes Project, we compared a popular method, principal component analysis (PCA), with a recently proposed spectral clustering technique, called spectral dimensional reduction (SDR), in detecting and adjusting for population stratification at the level of ethnic subgroups. We investigated the varying performance of adjusting for population stratification with different types and sets of variants when testing on different types of variants. One main conclusion is that principal components based on all variants or common variants were generally most effective in controlling inflations caused by population stratification; in particular, contrary to many speculations on the effectiveness of rare variants, we did not find much added value with the use of only rare variants. In addition, SDR was confirmed to be more robust than PCA, especially when applied to rare variants.
© 2013 WILEY PERIODICALS, INC.

Entities:  

Keywords:  1000 Genomes Project; association testing; common variants; principal component analysis; rare variants; spectral analysis

Mesh:

Year:  2013        PMID: 24123217      PMCID: PMC3864649          DOI: 10.1002/gepi.21764

Source DB:  PubMed          Journal:  Genet Epidemiol        ISSN: 0741-0395            Impact factor:   2.135


  30 in total

1.  Inference of population structure using multilocus genotype data.

Authors:  J K Pritchard; M Stephens; P Donnelly
Journal:  Genetics       Date:  2000-06       Impact factor: 4.562

2.  Genomic control for association studies.

Authors:  B Devlin; K Roeder
Journal:  Biometrics       Date:  1999-12       Impact factor: 2.571

3.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits.

Authors:  Lucia A Hindorff; Praveen Sethupathy; Heather A Junkins; Erin M Ramos; Jayashri P Mehta; Francis S Collins; Teri A Manolio
Journal:  Proc Natl Acad Sci U S A       Date:  2009-05-27       Impact factor: 11.205

4.  A comparison of association methods correcting for population stratification in case-control studies.

Authors:  Chengqing Wu; Andrew DeWan; Josephine Hoh; Zuoheng Wang
Journal:  Ann Hum Genet       Date:  2011-01-31       Impact factor: 1.670

5.  A simple and improved correction for population stratification in case-control studies.

Authors:  Michael P Epstein; Andrew S Allen; Glen A Satten
Journal:  Am J Hum Genet       Date:  2007-03-29       Impact factor: 11.025

6.  Genotype-based matching to correct for population stratification in large-scale case-control genetic association studies.

Authors:  Weihua Guan; Liming Liang; Michael Boehnke; Gonçalo R Abecasis
Journal:  Genet Epidemiol       Date:  2009-09       Impact factor: 2.135

7.  Using BioBin to explore rare variant population stratification.

Authors:  Carrie B Moore; John R Wallace; Alex T Frase; Sarah A Pendergrass; Marylyn D Ritchie
Journal:  Pac Symp Biocomput       Date:  2013

8.  Adjustment for population stratification via principal components in association analysis of rare variants.

Authors:  Yiwei Zhang; Weihua Guan; Wei Pan
Journal:  Genet Epidemiol       Date:  2012-10-12       Impact factor: 2.135

9.  Accounting for population stratification in practice: a comparison of the main strategies dedicated to genome-wide association studies.

Authors:  Matthieu Bouaziz; Christophe Ambroise; Mickael Guedj
Journal:  PLoS One       Date:  2011-12-21       Impact factor: 3.240

10.  Population structure and eigenanalysis.

Authors:  Nick Patterson; Alkes L Price; David Reich
Journal:  PLoS Genet       Date:  2006-12       Impact factor: 5.917

View more
  12 in total

1.  A practical approach to adjusting for population stratification in genome-wide association studies: principal components and propensity scores (PCAPS).

Authors:  Huaqing Zhao; Nandita Mitra; Peter A Kanetsky; Katherine L Nathanson; Timothy R Rebbeck
Journal:  Stat Appl Genet Mol Biol       Date:  2018-12-04

Review 2.  Rare-variant association analysis: study designs and statistical tests.

Authors:  Seunggeung Lee; Gonçalo R Abecasis; Michael Boehnke; Xihong Lin
Journal:  Am J Hum Genet       Date:  2014-07-03       Impact factor: 11.025

3.  Power considerations for λ inflation factor in meta-analyses of genome-wide association studies.

Authors:  Georgios Georgiopoulos; Evangelos Evangelou
Journal:  Genet Res (Camb)       Date:  2016-05-19       Impact factor: 1.588

4.  On the substructure controls in rare variant analysis: Principal components or variance components?

Authors:  Yiwen Luo; Arnab Maity; Michael C Wu; Chris Smith; Qing Duan; Yun Li; Jung-Ying Tzeng
Journal:  Genet Epidemiol       Date:  2017-12-26       Impact factor: 2.135

5.  Principal component regression and linear mixed model in association analysis of structured samples: competitors or complements?

Authors:  Yiwei Zhang; Wei Pan
Journal:  Genet Epidemiol       Date:  2014-12-23       Impact factor: 2.135

6.  Taking population stratification into account by local permutations in rare-variant association studies on small samples.

Authors:  Jimmy Mullaert; Matthieu Bouaziz; Yoann Seeleuthner; Benedetta Bigio; Jean-Laurent Casanova; Alexandre Alcaïs; Laurent Abel; Aurélie Cobat
Journal:  Genet Epidemiol       Date:  2021-08-17       Impact factor: 2.135

7.  Quality Control for the Illumina HumanExome BeadChip.

Authors:  Robert P Igo; Jessica N Cooke Bailey; Jane Romm; Jonathan L Haines; Janey L Wiggs
Journal:  Curr Protoc Hum Genet       Date:  2016-07-01

Review 8.  Genome-wide Association Studies in Ancestrally Diverse Populations: Opportunities, Methods, Pitfalls, and Recommendations.

Authors:  Roseann E Peterson; Karoline Kuchenbaecker; Raymond K Walters; Chia-Yen Chen; Alice B Popejoy; Sathish Periyasamy; Max Lam; Conrad Iyegbe; Rona J Strawbridge; Leslie Brick; Caitlin E Carey; Alicia R Martin; Jacquelyn L Meyers; Jinni Su; Junfang Chen; Alexis C Edwards; Allan Kalungi; Nastassja Koen; Lerato Majara; Emanuel Schwarz; Jordan W Smoller; Eli A Stahl; Patrick F Sullivan; Evangelos Vassos; Bryan Mowry; Miguel L Prieto; Alfredo Cuellar-Barboza; Tim B Bigdeli; Howard J Edenberg; Hailiang Huang; Laramie E Duncan
Journal:  Cell       Date:  2019-10-10       Impact factor: 41.582

Review 9.  The impact of rare and low-frequency genetic variants in common disease.

Authors:  Lorenzo Bomba; Klaudia Walter; Nicole Soranzo
Journal:  Genome Biol       Date:  2017-04-27       Impact factor: 13.583

10.  On rare variants in principal component analysis of population stratification.

Authors:  Shengqing Ma; Gang Shi
Journal:  BMC Genet       Date:  2020-03-17       Impact factor: 2.797

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.