Literature DB >> 25810074

Robust inference of population structure for ancestry prediction and correction of stratification in the presence of relatedness.

Matthew P Conomos1, Michael B Miller, Timothy A Thornton.   

Abstract

Population structure inference with genetic data has been motivated by a variety of applications in population genetics and genetic association studies. Several approaches have been proposed for the identification of genetic ancestry differences in samples where study participants are assumed to be unrelated, including principal components analysis (PCA), multidimensional scaling (MDS), and model-based methods for proportional ancestry estimation. Many genetic studies, however, include individuals with some degree of relatedness, and existing methods for inferring genetic ancestry fail in related samples. We present a method, PC-AiR, for robust population structure inference in the presence of known or cryptic relatedness. PC-AiR utilizes genome-screen data and an efficient algorithm to identify a diverse subset of unrelated individuals that is representative of all ancestries in the sample. The PC-AiR method directly performs PCA on the identified ancestry representative subset and then predicts components of variation for all remaining individuals based on genetic similarities. In simulation studies and in applications to real data from Phase III of the HapMap Project, we demonstrate that PC-AiR provides a substantial improvement over existing approaches for population structure inference in related samples. We also demonstrate significant efficiency gains, where a single axis of variation from PC-AiR provides better prediction of ancestry in a variety of structure settings than using 10 (or more) components of variation from widely used PCA and MDS approaches. Finally, we illustrate that PC-AiR can provide improved population stratification correction over existing methods in genetic association studies with population structure and relatedness.
© 2015 WILEY PERIODICALS, INC.

Entities:  

Keywords:  GWAS; PCA; admixture; cryptic relatedness; pedigrees

Mesh:

Year:  2015        PMID: 25810074      PMCID: PMC4836868          DOI: 10.1002/gepi.21896

Source DB:  PubMed          Journal:  Genet Epidemiol        ISSN: 0741-0395            Impact factor:   2.135


  30 in total

1.  Inference of population structure using multilocus genotype data.

Authors:  J K Pritchard; M Stephens; P Donnelly
Journal:  Genetics       Date:  2000-06       Impact factor: 4.562

2.  Genomic control for association studies.

Authors:  B Devlin; K Roeder
Journal:  Biometrics       Date:  1999-12       Impact factor: 2.571

3.  Estimation of individual admixture: analytical and study design considerations.

Authors:  Hua Tang; Jie Peng; Pei Wang; Neil J Risch
Journal:  Genet Epidemiol       Date:  2005-05       Impact factor: 2.135

4.  Principal components analysis corrects for stratification in genome-wide association studies.

Authors:  Alkes L Price; Nick J Patterson; Robert M Plenge; Michael E Weinblatt; Nancy A Shadick; David Reich
Journal:  Nat Genet       Date:  2006-07-23       Impact factor: 38.330

5.  A unified association analysis approach for family and unrelated samples correcting for stratification.

Authors:  Xiaofeng Zhu; Shengchao Li; Richard S Cooper; Robert C Elston
Journal:  Am J Hum Genet       Date:  2008-02       Impact factor: 11.025

6.  PLINK: a tool set for whole-genome association and population-based linkage analyses.

Authors:  Shaun Purcell; Benjamin Neale; Kathe Todd-Brown; Lori Thomas; Manuel A R Ferreira; David Bender; Julian Maller; Pamela Sklar; Paul I W de Bakker; Mark J Daly; Pak C Sham
Journal:  Am J Hum Genet       Date:  2007-07-25       Impact factor: 11.025

7.  A method for quantifying differentiation between populations at multi-allelic loci and its implications for investigating identity and paternity.

Authors:  D J Balding; R A Nichols
Journal:  Genetica       Date:  1995       Impact factor: 1.082

8.  Local and global ancestry inference and applications to genetic association analysis for admixed populations.

Authors:  Timothy A Thornton; Justo Lorenzo Bermejo
Journal:  Genet Epidemiol       Date:  2014-09       Impact factor: 2.135

9.  Worldwide human relationships inferred from genome-wide patterns of variation.

Authors:  Jun Z Li; Devin M Absher; Hua Tang; Audrey M Southwick; Amanda M Casto; Sohini Ramachandran; Howard M Cann; Gregory S Barsh; Marcus Feldman; Luigi L Cavalli-Sforza; Richard M Myers
Journal:  Science       Date:  2008-02-22       Impact factor: 47.728

10.  Population structure and eigenanalysis.

Authors:  Nick Patterson; Alkes L Price; David Reich
Journal:  PLoS Genet       Date:  2006-12       Impact factor: 5.917

View more
  131 in total

1.  Genetic Diversity and Association Studies in US Hispanic/Latino Populations: Applications in the Hispanic Community Health Study/Study of Latinos.

Authors:  Matthew P Conomos; Cecelia A Laurie; Adrienne M Stilp; Stephanie M Gogarten; Caitlin P McHugh; Sarah C Nelson; Tamar Sofer; Lindsay Fernández-Rhodes; Anne E Justice; Mariaelisa Graff; Kristin L Young; Amanda A Seyerle; Christy L Avery; Kent D Taylor; Jerome I Rotter; Gregory A Talavera; Martha L Daviglus; Sylvia Wassertheil-Smoller; Neil Schneiderman; Gerardo Heiss; Robert C Kaplan; Nora Franceschini; Alex P Reiner; John R Shaffer; R Graham Barr; Kathleen F Kerr; Sharon R Browning; Brian L Browning; Bruce S Weir; M Larissa Avilés-Santa; George J Papanicolaou; Thomas Lumley; Adam A Szpiro; Kari E North; Ken Rice; Timothy A Thornton; Cathy C Laurie
Journal:  Am J Hum Genet       Date:  2016-01-07       Impact factor: 11.025

2.  Model-free Estimation of Recent Genetic Relatedness.

Authors:  Matthew P Conomos; Alexander P Reiner; Bruce S Weir; Timothy A Thornton
Journal:  Am J Hum Genet       Date:  2016-01-07       Impact factor: 11.025

3.  Genome-wide association study of dental caries in the Hispanic Communities Health Study/Study of Latinos (HCHS/SOL).

Authors:  Jean Morrison; Cathy C Laurie; Mary L Marazita; Anne E Sanders; Steven Offenbacher; Christian R Salazar; Matthew P Conomos; Timothy Thornton; Deepti Jain; Cecelia A Laurie; Kathleen F Kerr; George Papanicolaou; Kent Taylor; Linda M Kaste; James D Beck; John R Shaffer
Journal:  Hum Mol Genet       Date:  2015-12-11       Impact factor: 6.150

4.  Genome-wide Association Study of Platelet Count Identifies Ancestry-Specific Loci in Hispanic/Latino Americans.

Authors:  Ursula M Schick; Deepti Jain; Chani J Hodonsky; Jean V Morrison; James P Davis; Lisa Brown; Tamar Sofer; Matthew P Conomos; Claudia Schurmann; Caitlin P McHugh; Sarah C Nelson; Swarooparani Vadlamudi; Adrienne Stilp; Anna Plantinga; Leslie Baier; Stephanie A Bien; Stephanie M Gogarten; Cecelia A Laurie; Kent D Taylor; Yongmei Liu; Paul L Auer; Nora Franceschini; Adam Szpiro; Ken Rice; Kathleen F Kerr; Jerome I Rotter; Robert L Hanson; George Papanicolaou; Stephen S Rich; Ruth J F Loos; Brian L Browning; Sharon R Browning; Bruce S Weir; Cathy C Laurie; Karen L Mohlke; Kari E North; Timothy A Thornton; Alex P Reiner
Journal:  Am J Hum Genet       Date:  2016-01-21       Impact factor: 11.025

5.  Evaluation of population stratification adjustment using genome-wide or exonic variants.

Authors:  Yuning Chen; Gina M Peloso; Ching-Ti Liu; Anita L DeStefano; Josée Dupuis
Journal:  Genet Epidemiol       Date:  2020-06-30       Impact factor: 2.135

6.  Integrated analysis of genomics, longitudinal metabolomics, and Alzheimer's risk factors among 1,111 cohort participants.

Authors:  Burcu F Darst; Qiongshi Lu; Sterling C Johnson; Corinne D Engelman
Journal:  Genet Epidemiol       Date:  2019-05-18       Impact factor: 2.135

7.  Genetic association testing using the GENESIS R/Bioconductor package.

Authors:  Stephanie M Gogarten; Tamar Sofer; Han Chen; Chaoyu Yu; Jennifer A Brody; Timothy A Thornton; Kenneth M Rice; Matthew P Conomos
Journal:  Bioinformatics       Date:  2019-12-15       Impact factor: 6.937

8.  Genome-Wide Association Study of Heavy Smoking and Daily/Nondaily Smoking in the Hispanic Community Health Study/Study of Latinos (HCHS/SOL).

Authors:  Nancy L Saccone; Leslie S Emery; Tamar Sofer; Stephanie M Gogarten; Diane M Becker; Erwin P Bottinger; Li-Shiun Chen; Robert C Culverhouse; Weimin Duan; Dana B Hancock; H Dean Hosgood; Eric O Johnson; Ruth J F Loos; Tin Louie; George Papanicolaou; Krista M Perreira; Erik J Rodriquez; Claudia Schurmann; Adrienne M Stilp; Adam A Szpiro; Gregory A Talavera; Kent D Taylor; James F Thrasher; Lisa R Yanek; Cathy C Laurie; Eliseo J Pérez-Stable; Laura J Bierut; Robert C Kaplan
Journal:  Nicotine Tob Res       Date:  2018-03-06       Impact factor: 4.244

9.  Genetic heterogeneity of Alzheimer's disease in subjects with and without hypertension.

Authors:  Alireza Nazarian; Konstantin G Arbeev; Arseniy P Yashkin; Alexander M Kulminski
Journal:  Geroscience       Date:  2019-05-05       Impact factor: 7.713

Review 10.  Genetics of Obesity in Diverse Populations.

Authors:  Kristin L Young; Mariaelisa Graff; Lindsay Fernandez-Rhodes; Kari E North
Journal:  Curr Diab Rep       Date:  2018-11-19       Impact factor: 4.810

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.