MOTIVATION: It is becoming increasingly evident that the analysis of genotype data from recently admixed populations is providing important insights into medical genetics and population history. Such analyses have been used to identify novel disease loci, to understand recombination rate variation and to detect recent selection events. The utility of such studies crucially depends on accurate and unbiased estimation of the ancestry at every genomic locus in recently admixed populations. Although various methods have been proposed and shown to be extremely accurate in two-way admixtures (e.g. African Americans), only a few approaches have been proposed and thoroughly benchmarked on multi-way admixtures (e.g. Latino populations of the Americas). RESULTS: To address these challenges we introduce here methods for local ancestry inference which leverage the structure of linkage disequilibrium in the ancestral population (LAMP-LD), and incorporate the constraint of Mendelian segregation when inferring local ancestry in nuclear family trios (LAMP-HAP). Our algorithms uniquely combine hidden Markov models (HMMs) of haplotype diversity within a novel window-based framework to achieve superior accuracy as compared with published methods. Further, unlike previous methods, the structure of our HMM does not depend on the number of reference haplotypes but on a fixed constant, and it is thereby capable of utilizing large datasets while remaining highly efficient and robust to over-fitting. Through simulations and analysis of real data from 489 nuclear trio families from the mainland US, Puerto Rico and Mexico, we demonstrate that our methods achieve superior accuracy compared with published methods for local ancestry inference in Latinos.
MOTIVATION: It is becoming increasingly evident that the analysis of genotype data from recently admixed populations is providing important insights into medical genetics and population history. Such analyses have been used to identify novel disease loci, to understand recombination rate variation and to detect recent selection events. The utility of such studies crucially depends on accurate and unbiased estimation of the ancestry at every genomic locus in recently admixed populations. Although various methods have been proposed and shown to be extremely accurate in two-way admixtures (e.g. African Americans), only a few approaches have been proposed and thoroughly benchmarked on multi-way admixtures (e.g. Latino populations of the Americas). RESULTS: To address these challenges we introduce here methods for local ancestry inference which leverage the structure of linkage disequilibrium in the ancestral population (LAMP-LD), and incorporate the constraint of Mendelian segregation when inferring local ancestry in nuclear family trios (LAMP-HAP). Our algorithms uniquely combine hidden Markov models (HMMs) of haplotype diversity within a novel window-based framework to achieve superior accuracy as compared with published methods. Further, unlike previous methods, the structure of our HMM does not depend on the number of reference haplotypes but on a fixed constant, and it is thereby capable of utilizing large datasets while remaining highly efficient and robust to over-fitting. Through simulations and analysis of real data from 489 nuclear trio families from the mainland US, Puerto Rico and Mexico, we demonstrate that our methods achieve superior accuracy compared with published methods for local ancestry inference in Latinos.
Authors: Esteban González Burchard; Luisa N Borrell; Shweta Choudhry; Mariam Naqvi; Hui-Ju Tsai; Jose R Rodriguez-Santana; Rocio Chapela; Scott D Rogers; Rui Mei; William Rodriguez-Cintron; Jose F Arena; Rick Kittles; Eliseo J Perez-Stable; Elad Ziv; Neil Risch Journal: Am J Public Health Date: 2005-10-27 Impact factor: 9.308
Authors: Alkes L Price; Nick Patterson; Fuli Yu; David R Cox; Alicja Waliszewska; Gavin J McDonald; Arti Tandon; Christine Schirmer; Julie Neubauer; Gabriel Bedoya; Constanza Duque; Alberto Villegas; Maria Catira Bortolini; Francisco M Salzano; Carla Gallo; Guido Mazzotti; Marcela Tello-Ruiz; Laura Riba; Carlos A Aguilar-Salinas; Samuel Canizales-Quinteros; Marta Menjivar; William Klitz; Brian Henderson; Christopher A Haiman; Cheryl Winkler; Teresa Tusie-Luna; Andres Ruiz-Linares; David Reich Journal: Am J Hum Genet Date: 2007-04-13 Impact factor: 11.025
Authors: David Reich; Nick Patterson; Philip L De Jager; Gavin J McDonald; Alicja Waliszewska; Arti Tandon; Robin R Lincoln; Cari DeLoa; Scott A Fruhan; Philippe Cabre; Odile Bera; Gilbert Semana; M Ann Kelly; David A Francis; Kristin Ardlie; Omar Khan; Bruce A C Cree; Stephen L Hauser; Jorge R Oksenberg; David A Hafler Journal: Nat Genet Date: 2005-09-25 Impact factor: 38.330
Authors: Brenna M Henn; Laura R Botigué; Simon Gravel; Wei Wang; Abra Brisbin; Jake K Byrnes; Karima Fadhlaoui-Zid; Pierre A Zalloua; Andres Moreno-Estrada; Jaume Bertranpetit; Carlos D Bustamante; David Comas Journal: PLoS Genet Date: 2012-01-12 Impact factor: 5.917
Authors: James Y Zou; Danny S Park; Esteban G Burchard; Dara G Torgerson; Maria Pino-Yanes; Yun S Song; Sriram Sankararaman; Eran Halperin; Noah Zaitlen Journal: Proc Natl Acad Sci U S A Date: 2015-10-19 Impact factor: 11.205
Authors: Katherine A Drake; Dara G Torgerson; Christopher R Gignoux; Joshua M Galanter; Lindsey A Roth; Scott Huntsman; Celeste Eng; Sam S Oh; Sook Wah Yee; Lawrence Lin; Carlos D Bustamante; Andrés Moreno-Estrada; Karla Sandoval; Adam Davis; Luisa N Borrell; Harold J Farber; Rajesh Kumar; Pedro C Avila; Emerita Brigino-Buenaventura; Rocio Chapela; Jean G Ford; Michael A Lenoir; Fred Lurmann; Kelley Meade; Denise Serebrisky; Shannon Thyne; William Rodríguez-Cintrón; Saunak Sen; José R Rodríguez-Santana; Ryan D Hernandez; Kathleen M Giacomini; Esteban G Burchard Journal: J Allergy Clin Immunol Date: 2013-08-29 Impact factor: 10.793
Authors: Giulio Genovese; Robert E Handsaker; Heng Li; Eimear E Kenny; Steven A McCarroll Journal: Am J Hum Genet Date: 2013-08-08 Impact factor: 11.025
Authors: Carl D Langefeld; Mary E Comeau; Maggie C Y Ng; Meijian Guan; Latchezar Dimitrov; Poorva Mudgal; Mitzie H Spainhour; Bruce A Julian; Jeffrey C Edberg; Jennifer A Croker; Jasmin Divers; Pamela J Hicks; Donald W Bowden; Gary C Chan; Lijun Ma; Nicholette D Palmer; Robert P Kimberly; Barry I Freedman Journal: Kidney Int Date: 2018-06-07 Impact factor: 10.612
Authors: Maria Pino-Yanes; Christopher R Gignoux; Joshua M Galanter; Albert M Levin; Catarina D Campbell; Celeste Eng; Scott Huntsman; Katherine K Nishimura; Pierre-Antoine Gourraud; Kiana Mohajeri; Brian J O'Roak; Donglei Hu; Rasika A Mathias; Elizabeth A Nguyen; Lindsey A Roth; Badri Padhukasahasram; Andres Moreno-Estrada; Karla Sandoval; Cheryl A Winkler; Fred Lurmann; Adam Davis; Harold J Farber; Kelley Meade; Pedro C Avila; Denise Serebrisky; Rocio Chapela; Jean G Ford; Michael A Lenoir; Shannon M Thyne; Emerita Brigino-Buenaventura; Luisa N Borrell; William Rodriguez-Cintron; Saunak Sen; Rajesh Kumar; Jose R Rodriguez-Santana; Carlos D Bustamante; Fernando D Martinez; Benjamin A Raby; Scott T Weiss; Dan L Nicolae; Carole Ober; Deborah A Meyers; Eugene R Bleecker; Steven J Mack; Ryan D Hernandez; Evan E Eichler; Kathleen C Barnes; L Keoki Williams; Dara G Torgerson; Esteban G Burchard Journal: J Allergy Clin Immunol Date: 2014-12-06 Impact factor: 10.793