Literature DB >> 22714934

Stratification-score matching improves correction for confounding by population stratification in case-control association studies.

Michael P Epstein1, Richard Duncan, K Alaine Broadaway, Min He, Andrew S Allen, Glen A Satten.   

Abstract

Proper control of confounding due to population stratification is crucial for valid analysis of case-control association studies. Fine matching of cases and controls based on genetic ancestry is an increasingly popular strategy to correct for such confounding, both in genome-wide association studies (GWASs) as well as studies that employ next-generation sequencing, where matching can be used when selecting a subset of participants from a GWAS for rare-variant analysis. Existing matching methods match on measures of genetic ancestry that combine multiple components of ancestry into a scalar quantity. However, we show that including nonconfounding ancestry components in a matching criterion can lead to inaccurate matches, and hence to an improper control of confounding. To resolve this issue, we propose a novel method that assigns cases and controls to matched strata based on the stratification score (Epstein et al. [2007] Am J Hum Genet 80:921-930), which is the probability of disease given genomic variables. Matching on the stratification score leads to more accurate matches because case participants are matched to control participants who have a similar risk of disease given ancestry information. We illustrate our matching method using the African-American arm of the GAIN GWAS of schizophrenia. In this study, we observe that confounding due to stratification can be resolved by our matching approach but not by other existing matching procedures. We also use simulated data to show our novel matching approach can provide a more appropriate correction for population stratification than existing matching approaches.
© 2012 Wiley Periodicals, Inc.

Entities:  

Mesh:

Year:  2012        PMID: 22714934      PMCID: PMC3671578          DOI: 10.1002/gepi.21611

Source DB:  PubMed          Journal:  Genet Epidemiol        ISSN: 0741-0395            Impact factor:   2.135


  34 in total

1.  Qualitative semi-parametric test for genetic associations in case-control designs under structured populations.

Authors:  H-S Chen; X Zhu; H Zhao; S Zhang
Journal:  Ann Hum Genet       Date:  2003-05       Impact factor: 1.670

2.  Principal components analysis corrects for stratification in genome-wide association studies.

Authors:  Alkes L Price; Nick J Patterson; Robert M Plenge; Michael E Weinblatt; Nancy A Shadick; David Reich
Journal:  Nat Genet       Date:  2006-07-23       Impact factor: 38.330

3.  New models of collaboration in genome-wide association studies: the Genetic Association Information Network.

Authors:  Teri A Manolio; Laura Lyman Rodriguez; Lisa Brooks; Gonçalo Abecasis; Dennis Ballinger; Mark Daly; Peter Donnelly; Stephen V Faraone; Kelly Frazer; Stacey Gabriel; Pablo Gejman; Alan Guttmacher; Emily L Harris; Thomas Insel; John R Kelsoe; Eric Lander; Norma McCowin; Matthew D Mailman; Elizabeth Nabel; James Ostell; Elizabeth Pugh; Stephen Sherry; Patrick F Sullivan; John F Thompson; James Warram; David Wholley; Patrice M Milos; Francis S Collins
Journal:  Nat Genet       Date:  2007-09       Impact factor: 38.330

4.  A simple and improved correction for population stratification in case-control studies.

Authors:  Michael P Epstein; Andrew S Allen; Glen A Satten
Journal:  Am J Hum Genet       Date:  2007-03-29       Impact factor: 11.025

5.  Genotype-based matching to correct for population stratification in large-scale case-control genetic association studies.

Authors:  Weihua Guan; Liming Liang; Michael Boehnke; Gonçalo R Abecasis
Journal:  Genet Epidemiol       Date:  2009-09       Impact factor: 2.135

6.  Discovering genetic ancestry using spectral graph theory.

Authors:  Ann B Lee; Diana Luca; Lambertus Klei; Bernie Devlin; Kathryn Roeder
Journal:  Genet Epidemiol       Date:  2010-01       Impact factor: 2.135

7.  Genome-wide association analysis identifies loci for type 2 diabetes and triglyceride levels.

Authors:  Richa Saxena; Benjamin F Voight; Valeriya Lyssenko; Noël P Burtt; Paul I W de Bakker; Hong Chen; Jeffrey J Roix; Sekar Kathiresan; Joel N Hirschhorn; Mark J Daly; Thomas E Hughes; Leif Groop; David Altshuler; Peter Almgren; Jose C Florez; Joanne Meyer; Kristin Ardlie; Kristina Bengtsson Boström; Bo Isomaa; Guillaume Lettre; Ulf Lindblad; Helen N Lyon; Olle Melander; Christopher Newton-Cheh; Peter Nilsson; Marju Orho-Melander; Lennart Råstam; Elizabeth K Speliotes; Marja-Riitta Taskinen; Tiinamaija Tuomi; Candace Guiducci; Anna Berglund; Joyce Carlson; Lauren Gianniny; Rachel Hackett; Liselotte Hall; Johan Holmkvist; Esa Laurila; Marketa Sjögren; Maria Sterner; Aarti Surti; Margareta Svensson; Malin Svensson; Ryan Tewhey; Brendan Blumenstiel; Melissa Parkin; Matthew Defelice; Rachel Barry; Wendy Brodeur; Jody Camarata; Nancy Chia; Mary Fava; John Gibbons; Bob Handsaker; Claire Healy; Kieu Nguyen; Casey Gates; Carrie Sougnez; Diane Gage; Marcia Nizzari; Stacey B Gabriel; Gung-Wei Chirn; Qicheng Ma; Hemang Parikh; Delwood Richardson; Darrell Ricke; Shaun Purcell
Journal:  Science       Date:  2007-04-26       Impact factor: 47.728

8.  A novel haplotype-sharing approach for genome-wide case-control association studies implicates the calpastatin gene in Parkinson's disease.

Authors:  Andrew S Allen; Glen A Satten
Journal:  Genet Epidemiol       Date:  2009-12       Impact factor: 2.135

9.  Population structure and eigenanalysis.

Authors:  Nick Patterson; Alkes L Price; David Reich
Journal:  PLoS Genet       Date:  2006-12       Impact factor: 5.917

10.  Effect of population stratification on the identification of significant single-nucleotide polymorphisms in genome-wide association studies.

Authors:  Sara M Sarasua; Julianne S Collins; Dhelia M Williamson; Glen A Satten; Andrew S Allen
Journal:  BMC Proc       Date:  2009-12-15
View more
  11 in total

1.  A practical approach to adjusting for population stratification in genome-wide association studies: principal components and propensity scores (PCAPS).

Authors:  Huaqing Zhao; Nandita Mitra; Peter A Kanetsky; Katherine L Nathanson; Timothy R Rebbeck
Journal:  Stat Appl Genet Mol Biol       Date:  2018-12-04

2.  Contributions of Rare Gene Variants to Familial and Sporadic FSGS.

Authors:  Minxian Wang; Justin Chun; Giulio Genovese; Andrea U Knob; Ava Benjamin; Maris S Wilkins; David J Friedman; Gerald B Appel; Richard P Lifton; Shrikant Mane; Martin R Pollak
Journal:  J Am Soc Nephrol       Date:  2019-07-15       Impact factor: 10.121

3.  A fast and noise-resilient approach to detect rare-variant associations with deep sequencing data for complex disorders.

Authors:  Yee Him Cheung; Gao Wang; Suzanne M Leal; Shuang Wang
Journal:  Genet Epidemiol       Date:  2012-08-03       Impact factor: 2.135

4.  Multi-Ancestry Genome-Wide Association Study of Spontaneous Clearance of Hepatitis C Virus.

Authors:  Candelaria Vergara; Chloe L Thio; Eric Johnson; Alex H Kral; Thomas R O'Brien; James J Goedert; Alessandra Mangia; Valeria Piazzolla; Shruti H Mehta; Gregory D Kirk; Arthur Y Kim; Georg M Lauer; Raymond T Chung; Andrea L Cox; Marion G Peters; Salim I Khakoo; Laurent Alric; Matthew E Cramp; Sharyne M Donfield; Brian R Edlin; Michael P Busch; Graeme Alexander; Hugo R Rosen; Edward L Murphy; Rachel Latanich; Genevieve L Wojcik; Margaret A Taub; Ana Valencia; David L Thomas; Priya Duggal
Journal:  Gastroenterology       Date:  2018-12-26       Impact factor: 22.682

5.  Novel genetic matching methods for handling population stratification in genome-wide association studies.

Authors:  André Lacour; Vitalia Schüller; Dmitriy Drichel; Christine Herold; Frank Jessen; Markus Leber; Wolfgang Maier; Markus M Noethen; Alfredo Ramirez; Tatsiana Vaitsiakhovich; Tim Becker
Journal:  BMC Bioinformatics       Date:  2015-03-14       Impact factor: 3.169

6.  Identification of genetic risk factors in the Chinese population implicates a role of immune system in Alzheimer's disease pathogenesis.

Authors:  Xiaopu Zhou; Yu Chen; Kin Y Mok; Qianhua Zhao; Keliang Chen; Yuewen Chen; John Hardy; Yun Li; Amy K Y Fu; Qihao Guo; Nancy Y Ip
Journal:  Proc Natl Acad Sci U S A       Date:  2018-02-05       Impact factor: 11.205

7.  Sparse conditional logistic regression for analyzing large-scale matched data from epidemiological studies: a simple algorithm.

Authors:  Marta Avalos; Hélène Pouyes; Yves Grandvalet; Ludivine Orriols; Emmanuel Lagarde
Journal:  BMC Bioinformatics       Date:  2015-04-17       Impact factor: 3.169

8.  Families or Unrelated: The Evolving Debate in Genetic Association Studies.

Authors:  David W Fardo; Richard Charnigo; Michael P Epstein
Journal:  J Biom Biostat       Date:  2012-06-01

9.  The relationship between three-dimensional knee MRI bone shape and total knee replacement-a case control study: data from the Osteoarthritis Initiative.

Authors:  Andrew J Barr; Bright Dube; Elizabeth M A Hensor; Sarah R Kingsbury; George Peat; Mike A Bowes; Linda D Sharples; Philip G Conaghan
Journal:  Rheumatology (Oxford)       Date:  2016-05-15       Impact factor: 7.580

Review 10.  Extracorporeal Life Support: The Next Step in Moderate to Severe ARDS-A Review and Meta-Analysis of the Literature.

Authors:  Diamanto Aretha; Fotini Fligou; Panagiotis Kiekkas; Vasilis Karamouzos; Gregorios Voyagis
Journal:  Biomed Res Int       Date:  2019-09-29       Impact factor: 3.411

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.