Literature DB >> 20413978

Quantification of population structure using correlated SNPs by shrinkage principal components.

Fei Zou1, Seunggeun Lee, Michael R Knowles, Fred A Wright.   

Abstract

BACKGROUND/AIMS: Association studies using unrelated individuals have become the most popular design for mapping complex traits. One of the major challenges of association mapping is avoiding spurious association due to population stratification. Principal component analysis (PCA) on genome-wide marker genotypes is one of the most popular population stratification control methods. It implicitly assumes that the markers are in linkage equilibrium, a condition that is rarely satisfied and that we plan to relax.
METHODS: We carefully examined the impact of linkage disequilibrium (LD) on PCA, and proposed a simple modification of the standard PCA to automatically adjust for the correlations among markers.
RESULTS: We demonstrated that LD patterns in genome-wide association datasets can distort the techniques for stratification control, showing 'subpopulations' reflecting localized LD phenomena rather than plausible population structure. We showed that the proposed method effectively removes the artifactual effect of LD patterns, and successfully recovers underlying population structure that is not apparent from standard PCA.
CONCLUSION: PCA is highly influenced by sets of SNPs with high LD, obscuring the true population substructure. Our shrinkage PCA applies to all available markers, regardless of the LD patterns. The proposed method is easier to implement than most existing LD adjusted PCA methods. Copyright 2010 S. Karger AG, Basel.

Mesh:

Substances:

Year:  2010        PMID: 20413978      PMCID: PMC2912642          DOI: 10.1159/000288706

Source DB:  PubMed          Journal:  Hum Hered        ISSN: 0001-5652            Impact factor:   0.444


  36 in total

Review 1.  Association study designs for complex diseases.

Authors:  L R Cardon; J I Bell
Journal:  Nat Rev Genet       Date:  2001-02       Impact factor: 53.242

2.  Inference of population structure using multilocus genotype data.

Authors:  J K Pritchard; M Stephens; P Donnelly
Journal:  Genetics       Date:  2000-06       Impact factor: 4.562

3.  The power of genomic control.

Authors:  S A Bacanu; B Devlin; K Roeder
Journal:  Am J Hum Genet       Date:  2000-05-08       Impact factor: 11.025

4.  Accounting for unmeasured population substructure in case-control studies of genetic association using a novel latent-class model.

Authors:  G A Satten; W D Flanders; Q Yang
Journal:  Am J Hum Genet       Date:  2001-01-19       Impact factor: 11.025

Review 5.  The future of genetic case-control studies.

Authors:  N J Schork; D Fallin; B Thiel; X Xu; U Broeckel; H J Jacob; D Cohen
Journal:  Adv Genet       Date:  2001       Impact factor: 1.944

Review 6.  Candidate gene case-control association studies: advantages and potential pitfalls.

Authors:  A K Daly; C P Day
Journal:  Br J Clin Pharmacol       Date:  2001-11       Impact factor: 4.335

Review 7.  Genetic association mapping at the crossroads: which test and why? Overview and practical guidelines.

Authors:  Thomas G Schulze; Francis J McMahon
Journal:  Am J Med Genet       Date:  2002-01-08

8.  Genomic control for association studies.

Authors:  B Devlin; K Roeder
Journal:  Biometrics       Date:  1999-12       Impact factor: 2.571

9.  Lactase haplotype diversity in the Old World.

Authors:  E J Hollox; M Poulter; M Zvarik; V Ferak; A Krause; T Jenkins; N Saha; A I Kozlov; D M Swallow
Journal:  Am J Hum Genet       Date:  2000-11-28       Impact factor: 11.025

10.  Genome-wide association analysis identifies loci for type 2 diabetes and triglyceride levels.

Authors:  Richa Saxena; Benjamin F Voight; Valeriya Lyssenko; Noël P Burtt; Paul I W de Bakker; Hong Chen; Jeffrey J Roix; Sekar Kathiresan; Joel N Hirschhorn; Mark J Daly; Thomas E Hughes; Leif Groop; David Altshuler; Peter Almgren; Jose C Florez; Joanne Meyer; Kristin Ardlie; Kristina Bengtsson Boström; Bo Isomaa; Guillaume Lettre; Ulf Lindblad; Helen N Lyon; Olle Melander; Christopher Newton-Cheh; Peter Nilsson; Marju Orho-Melander; Lennart Råstam; Elizabeth K Speliotes; Marja-Riitta Taskinen; Tiinamaija Tuomi; Candace Guiducci; Anna Berglund; Joyce Carlson; Lauren Gianniny; Rachel Hackett; Liselotte Hall; Johan Holmkvist; Esa Laurila; Marketa Sjögren; Maria Sterner; Aarti Surti; Margareta Svensson; Malin Svensson; Ryan Tewhey; Brendan Blumenstiel; Melissa Parkin; Matthew Defelice; Rachel Barry; Wendy Brodeur; Jody Camarata; Nancy Chia; Mary Fava; John Gibbons; Bob Handsaker; Claire Healy; Kieu Nguyen; Casey Gates; Carrie Sougnez; Diane Gage; Marcia Nizzari; Stacey B Gabriel; Gung-Wei Chirn; Qicheng Ma; Hemang Parikh; Delwood Richardson; Darrell Ricke; Shaun Purcell
Journal:  Science       Date:  2007-04-26       Impact factor: 47.728

View more
  27 in total

1.  Controlling Population Structure in Human Genetic Association Studies with Samples of Unrelated Individuals.

Authors:  Nianjun Liu; Hongyu Zhao; Amit Patki; Nita A Limdi; David B Allison
Journal:  Stat Interface       Date:  2011       Impact factor: 0.582

2.  Allowing for population stratification in association analysis.

Authors:  Huaizhen Qin; Xiaofeng Zhu
Journal:  Methods Mol Biol       Date:  2012

3.  A practical approach to adjusting for population stratification in genome-wide association studies: principal components and propensity scores (PCAPS).

Authors:  Huaqing Zhao; Nandita Mitra; Peter A Kanetsky; Katherine L Nathanson; Timothy R Rebbeck
Journal:  Stat Appl Genet Mol Biol       Date:  2018-12-04

4.  Efficient Estimation of Realized Kinship from Single Nucleotide Polymorphism Genotypes.

Authors:  Bowen Wang; Serge Sverdlov; Elizabeth Thompson
Journal:  Genetics       Date:  2017-01-18       Impact factor: 4.562

5.  Imputation and quality control steps for combining multiple genome-wide datasets.

Authors:  Shefali S Verma; Mariza de Andrade; Gerard Tromp; Helena Kuivaniemi; Elizabeth Pugh; Bahram Namjou-Khales; Shubhabrata Mukherjee; Gail P Jarvik; Leah C Kottyan; Amber Burt; Yuki Bradford; Gretta D Armstrong; Kimberly Derr; Dana C Crawford; Jonathan L Haines; Rongling Li; David Crosslin; Marylyn D Ritchie
Journal:  Front Genet       Date:  2014-12-11       Impact factor: 4.599

6.  Improved heritability estimation from genome-wide SNPs.

Authors:  Doug Speed; Gibran Hemani; Michael R Johnson; David J Balding
Journal:  Am J Hum Genet       Date:  2012-12-07       Impact factor: 11.025

7.  Development of single-nucleotide polymorphism-based phylum-specific PCR amplification technique: application to the community analysis using ciliates as a reference organism.

Authors:  Jae-Ho Jung; Sanghee Kim; Seongho Ryu; Min-Seok Kim; Ye-Seul Baek; Se-Joo Kim; Joong-Ki Choi; Joong-Ki Park; Gi-Sik Min
Journal:  Mol Cells       Date:  2012-09-06       Impact factor: 5.034

8.  Fast Principal-Component Analysis Reveals Convergent Evolution of ADH1B in Europe and East Asia.

Authors:  Kevin J Galinsky; Gaurav Bhatia; Po-Ru Loh; Stoyan Georgiev; Sayan Mukherjee; Nick J Patterson; Alkes L Price
Journal:  Am J Hum Genet       Date:  2016-02-25       Impact factor: 11.025

9.  Eigenvalue significance testing for genetic association.

Authors:  Yi-Hui Zhou; J S Marron; Fred A Wright
Journal:  Biometrics       Date:  2017-08-29       Impact factor: 2.571

10.  Control of population stratification by correlation-selected principal components.

Authors:  Seunggeun Lee; Fred A Wright; Fei Zou
Journal:  Biometrics       Date:  2010-12-06       Impact factor: 2.571

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.