Literature DB >> 26027497

Improved ancestry estimation for both genotyping and sequencing data using projection procrustes analysis and genotype imputation.

Chaolong Wang1, Xiaowei Zhan2, Liming Liang3, Gonçalo R Abecasis4, Xihong Lin5.   

Abstract

Accurate estimation of individual ancestry is important in genetic association studies, especially when a large number of samples are collected from multiple sources. However, existing approaches developed for genome-wide SNP data do not work well with modest amounts of genetic data, such as in targeted sequencing or exome chip genotyping experiments. We propose a statistical framework to estimate individual ancestry in a principal component ancestry map generated by a reference set of individuals. This framework extends and improves upon our previous method for estimating ancestry using low-coverage sequence reads (LASER 1.0) to analyze either genotyping or sequencing data. In particular, we introduce a projection Procrustes analysis approach that uses high-dimensional principal components to estimate ancestry in a low-dimensional reference space. Using extensive simulations and empirical data examples, we show that our new method (LASER 2.0), combined with genotype imputation on the reference individuals, can substantially outperform LASER 1.0 in estimating fine-scale genetic ancestry. Specifically, LASER 2.0 can accurately estimate fine-scale ancestry within Europe using either exome chip genotypes or targeted sequencing data with off-target coverage as low as 0.05×. Under the framework of LASER 2.0, we can estimate individual ancestry in a shared reference space for samples assayed at different loci or by different techniques. Therefore, our ancestry estimation method will accelerate discovery in disease association studies not only by helping model ancestry within individual studies but also by facilitating combined analysis of genetic data from multiple sources.
Copyright © 2015 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

Mesh:

Year:  2015        PMID: 26027497      PMCID: PMC4457959          DOI: 10.1016/j.ajhg.2015.04.018

Source DB:  PubMed          Journal:  Am J Hum Genet        ISSN: 0002-9297            Impact factor:   11.025


  37 in total

1.  Principal components analysis corrects for stratification in genome-wide association studies.

Authors:  Alkes L Price; Nick J Patterson; Robert M Plenge; Michael E Weinblatt; Nancy A Shadick; David Reich
Journal:  Nat Genet       Date:  2006-07-23       Impact factor: 38.330

2.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits.

Authors:  Lucia A Hindorff; Praveen Sethupathy; Heather A Junkins; Erin M Ramos; Jayashri P Mehta; Francis S Collins; Teri A Manolio
Journal:  Proc Natl Acad Sci U S A       Date:  2009-05-27       Impact factor: 11.205

3.  CONVERGENCE AND PREDICTION OF PRINCIPAL COMPONENT SCORES IN HIGH-DIMENSIONAL SETTINGS.

Authors:  Seunggeun Lee; Fei Zou; Fred A Wright
Journal:  Ann Stat       Date:  2010-01-01       Impact factor: 4.028

4.  A model-based approach for analysis of spatial structure in genetic data.

Authors:  Wen-Yun Yang; John Novembre; Eleazar Eskin; Eran Halperin
Journal:  Nat Genet       Date:  2012-05-20       Impact factor: 38.330

Review 5.  Sequencing studies in human genetics: design and interpretation.

Authors:  David B Goldstein; Andrew Allen; Jonathan Keebler; Elliott H Margulies; Steven Petrou; Slavé Petrovski; Shamil Sunyaev
Journal:  Nat Rev Genet       Date:  2013-06-11       Impact factor: 53.242

6.  Deep resequencing of GWAS loci identifies independent rare variants associated with inflammatory bowel disease.

Authors:  Manuel A Rivas; Mélissa Beaudoin; Agnes Gardet; Christine Stevens; Yashoda Sharma; Clarence K Zhang; Gabrielle Boucher; Stephan Ripke; David Ellinghaus; Noel Burtt; Tim Fennell; Andrew Kirby; Anna Latiano; Philippe Goyette; Todd Green; Jonas Halfvarson; Talin Haritunians; Joshua M Korn; Finny Kuruvilla; Caroline Lagacé; Benjamin Neale; Ken Sin Lo; Phil Schumm; Leif Törkvist; Marla C Dubinsky; Steven R Brant; Mark S Silverberg; Richard H Duerr; David Altshuler; Stacey Gabriel; Guillaume Lettre; Andre Franke; Mauro D'Amato; Dermot P B McGovern; Judy H Cho; John D Rioux; Ramnik J Xavier; Mark J Daly
Journal:  Nat Genet       Date:  2011-10-09       Impact factor: 38.330

7.  Population structure and eigenanalysis.

Authors:  Nick Patterson; Alkes L Price; David Reich
Journal:  PLoS Genet       Date:  2006-12       Impact factor: 5.917

8.  An integrated map of genetic variation from 1,092 human genomes.

Authors:  Goncalo R Abecasis; Adam Auton; Lisa D Brooks; Mark A DePristo; Richard M Durbin; Robert E Handsaker; Hyun Min Kang; Gabor T Marth; Gil A McVean
Journal:  Nature       Date:  2012-11-01       Impact factor: 49.962

9.  Analysis of 6,515 exomes reveals the recent origin of most human protein-coding variants.

Authors:  Wenqing Fu; Timothy D O'Connor; Goo Jun; Hyun Min Kang; Goncalo Abecasis; Suzanne M Leal; Stacey Gabriel; Mark J Rieder; David Altshuler; Jay Shendure; Deborah A Nickerson; Michael J Bamshad; Joshua M Akey
Journal:  Nature       Date:  2012-11-28       Impact factor: 49.962

10.  A quantitative comparison of the similarity between genes and geography in worldwide human populations.

Authors:  Chaolong Wang; Sebastian Zöllner; Noah A Rosenberg
Journal:  PLoS Genet       Date:  2012-08-23       Impact factor: 5.917

View more
  55 in total

1.  Reference component analysis of single-cell transcriptomes elucidates cellular heterogeneity in human colorectal tumors.

Authors:  Huipeng Li; Elise T Courtois; Debarka Sengupta; Yuliana Tan; Kok Hao Chen; Jolene Jie Lin Goh; Say Li Kong; Clarinda Chua; Lim Kiat Hon; Wah Siew Tan; Mark Wong; Paul Jongjoon Choi; Lawrence J K Wee; Axel M Hillmer; Iain Beehuat Tan; Paul Robson; Shyam Prabhakar
Journal:  Nat Genet       Date:  2017-03-20       Impact factor: 38.330

2.  Rare loss of function variants in candidate genes and risk of colorectal cancer.

Authors:  Elisabeth A Rosenthal; Brian H Shirts; Laura M Amendola; Martha Horike-Pyne; Peggy D Robertson; Fuki M Hisama; Robin L Bennett; Michael O Dorschner; Deborah A Nickerson; Ian B Stanaway; Rami Nassir; Kathy T Vickers; Christopher Li; William M Grady; Ulrike Peters; Gail P Jarvik
Journal:  Hum Genet       Date:  2018-09-28       Impact factor: 4.132

3.  Early farmers from across Europe directly descended from Neolithic Aegeans.

Authors:  Zuzana Hofmanová; Susanne Kreutzer; Garrett Hellenthal; Christian Sell; Yoan Diekmann; David Díez-Del-Molino; Lucy van Dorp; Saioa López; Athanasios Kousathanas; Vivian Link; Karola Kirsanow; Lara M Cassidy; Rui Martiniano; Melanie Strobel; Amelie Scheu; Kostas Kotsakis; Paul Halstead; Sevi Triantaphyllou; Nina Kyparissi-Apostolika; Dushka Urem-Kotsou; Christina Ziota; Fotini Adaktylou; Shyamalika Gopalan; Dean M Bobo; Laura Winkelbach; Jens Blöcher; Martina Unterländer; Christoph Leuenberger; Çiler Çilingiroğlu; Barbara Horejs; Fokke Gerritsen; Stephen J Shennan; Daniel G Bradley; Mathias Currat; Krishna R Veeramah; Daniel Wegmann; Mark G Thomas; Christina Papageorgopoulou; Joachim Burger
Journal:  Proc Natl Acad Sci U S A       Date:  2016-06-06       Impact factor: 11.205

Review 4.  The African diaspora: history, adaptation and health.

Authors:  Charles N Rotimi; Fasil Tekola-Ayele; Jennifer L Baker; Daniel Shriner
Journal:  Curr Opin Genet Dev       Date:  2016-09-16       Impact factor: 5.578

5.  Rare variant association test in family-based sequencing studies.

Authors:  Xuefeng Wang; Zhenyu Zhang; Nathan Morris; Tianxi Cai; Seunggeun Lee; Chaolong Wang; Timothy W Yu; Christopher A Walsh; Xihong Lin
Journal:  Brief Bioinform       Date:  2017-11-01       Impact factor: 11.622

6.  Genes for Good: Engaging the Public in Genetics Research via Social Media.

Authors:  Katharine Brieger; Gregory J M Zajac; Anita Pandit; Johanna R Foerster; Kevin W Li; Aubrey C Annis; Ellen M Schmidt; Chris P Clark; Karly McMorrow; Wei Zhou; Jingjing Yang; Alan M Kwong; Andrew P Boughton; Jinxi Wu; Chris Scheller; Tanvi Parikh; Alejandro de la Vega; David M Brazel; Maia Frieser; Gianna Rea-Sandin; Lars G Fritsche; Scott I Vrieze; Gonçalo R Abecasis
Journal:  Am J Hum Genet       Date:  2019-06-13       Impact factor: 11.025

Review 7.  Population Stratification in Genetic Association Studies.

Authors:  Jacklyn N Hellwege; Jacob M Keaton; Ayush Giri; Xiaoyi Gao; Digna R Velez Edwards; Todd L Edwards
Journal:  Curr Protoc Hum Genet       Date:  2017-10-18

8.  Polygenic risk score identifies associations between sleep duration and diseases determined from an electronic medical record biobank.

Authors:  Hassan S Dashti; Susan Redline; Richa Saxena
Journal:  Sleep       Date:  2019-03-01       Impact factor: 5.849

9.  Isolation-by-distance-and-time in a stepping-stone model.

Authors:  Nicolas Duforet-Frebourg; Montgomery Slatkin
Journal:  Theor Popul Biol       Date:  2015-11-21       Impact factor: 1.570

10.  Fast and robust ancestry prediction using principal component analysis.

Authors:  Daiwei Zhang; Rounak Dey; Seunggeun Lee
Journal:  Bioinformatics       Date:  2020-06-01       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.