Literature DB >> 22883141

Phasing of many thousands of genotyped samples.

Amy L Williams1, Nick Patterson, Joseph Glessner, Hakon Hakonarson, David Reich.   

Abstract

Haplotypes are an important resource for a large number of applications in human genetics, but computationally inferred haplotypes are subject to switch errors that decrease their utility. The accuracy of computationally inferred haplotypes increases with sample size, and although ever larger genotypic data sets are being generated, the fact that existing methods require substantial computational resources limits their applicability to data sets containing tens or hundreds of thousands of samples. Here, we present HAPI-UR (haplotype inference for unrelated samples), an algorithm that is designed to handle unrelated and/or trio and duo family data, that has accuracy comparable to or greater than existing methods, and that is computationally efficient and can be applied to 100,000 samples or more. We use HAPI-UR to phase a data set with 58,207 samples and show that it achieves practical runtime and that switch errors decrease with sample size even with the use of samples from multiple ethnicities. Using a data set with 16,353 samples, we compare HAPI-UR to Beagle, MaCH, IMPUTE2, and SHAPEIT and show that HAPI-UR runs 18× faster than all methods and has a lower switch-error rate than do other methods except for Beagle; with the use of consensus phasing, running HAPI-UR three times gives a slightly lower switch-error rate than Beagle does and is more than six times faster. We demonstrate results similar to those from Beagle on another data set with a higher marker density. Lastly, we show that HAPI-UR has better runtime scaling properties than does Beagle so that for larger data sets, HAPI-UR will be practical and will have an even larger runtime advantage. HAPI-UR is available online (see Web Resources).
Copyright © 2012 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

Entities:  

Mesh:

Year:  2012        PMID: 22883141      PMCID: PMC3415548          DOI: 10.1016/j.ajhg.2012.06.013

Source DB:  PubMed          Journal:  Am J Hum Genet        ISSN: 0002-9297            Impact factor:   11.025


  30 in total

1.  Modeling linkage disequilibrium and identifying recombination hotspots using single-nucleotide polymorphism data.

Authors:  Na Li; Matthew Stephens
Journal:  Genetics       Date:  2003-12       Impact factor: 4.562

2.  A linear complexity phasing method for thousands of genomes.

Authors:  Olivier Delaneau; Jonathan Marchini; Jean-François Zagury
Journal:  Nat Methods       Date:  2011-12-04       Impact factor: 28.547

3.  A comparison of phasing algorithms for trios and unrelated individuals.

Authors:  Jonathan Marchini; David Cutler; Nick Patterson; Matthew Stephens; Eleazar Eskin; Eran Halperin; Shin Lin; Zhaohui S Qin; Heather M Munro; Goncalo R Abecasis; Peter Donnelly
Journal:  Am J Hum Genet       Date:  2006-01-26       Impact factor: 11.025

4.  Effect of genetic divergence in identifying ancestral origin using HAPAA.

Authors:  Andreas Sundquist; Eugene Fratkin; Chuong B Do; Serafim Batzoglou
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

5.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits.

Authors:  Lucia A Hindorff; Praveen Sethupathy; Heather A Junkins; Erin M Ramos; Jayashri P Mehta; Francis S Collins; Teri A Manolio
Journal:  Proc Natl Acad Sci U S A       Date:  2009-05-27       Impact factor: 11.205

6.  Autism genome-wide copy number variation reveals ubiquitin and neuronal genes.

Authors:  Joseph T Glessner; Kai Wang; Guiqing Cai; Olena Korvatska; Cecilia E Kim; Shawn Wood; Haitao Zhang; Annette Estes; Camille W Brune; Jonathan P Bradfield; Marcin Imielinski; Edward C Frackelton; Jennifer Reichert; Emily L Crawford; Jeffrey Munson; Patrick M A Sleiman; Rosetta Chiavacci; Kiran Annaiah; Kelly Thomas; Cuiping Hou; Wendy Glaberson; James Flory; Frederick Otieno; Maria Garris; Latha Soorya; Lambertus Klei; Joseph Piven; Kacie J Meyer; Evdokia Anagnostou; Takeshi Sakurai; Rachel M Game; Danielle S Rudd; Danielle Zurawiecki; Christopher J McDougle; Lea K Davis; Judith Miller; David J Posey; Shana Michaels; Alexander Kolevzon; Jeremy M Silverman; Raphael Bernier; Susan E Levy; Robert T Schultz; Geraldine Dawson; Thomas Owley; William M McMahon; Thomas H Wassink; John A Sweeney; John I Nurnberger; Hilary Coon; James S Sutcliffe; Nancy J Minshew; Struan F A Grant; Maja Bucan; Edwin H Cook; Joseph D Buxbaum; Bernie Devlin; Gerard D Schellenberg; Hakon Hakonarson
Journal:  Nature       Date:  2009-04-28       Impact factor: 49.962

7.  Loci on 20q13 and 21q22 are associated with pediatric-onset inflammatory bowel disease.

Authors:  Subra Kugathasan; Robert N Baldassano; Jonathan P Bradfield; Patrick M A Sleiman; Marcin Imielinski; Stephen L Guthery; Salvatore Cucchiara; Cecilia E Kim; Edward C Frackelton; Kiran Annaiah; Joseph T Glessner; Erin Santa; Tara Willson; Andrew W Eckert; Erin Bonkowski; Julie L Shaner; Ryan M Smith; F George Otieno; Nicholas Peterson; Debra J Abrams; Rosetta M Chiavacci; Robert Grundmeier; Petar Mamula; Gitit Tomer; David A Piccoli; Dimitri S Monos; Vito Annese; Lee A Denson; Struan F A Grant; Hakon Hakonarson
Journal:  Nat Genet       Date:  2008-08-31       Impact factor: 38.330

8.  Detection of sharing by descent, long-range phasing and haplotype imputation.

Authors:  Augustine Kong; Gisli Masson; Michael L Frigge; Arnaldur Gylfason; Pasha Zusmanovich; Gudmar Thorleifsson; Pall I Olason; Andres Ingason; Stacy Steinberg; Thorunn Rafnar; Patrick Sulem; Magali Mouy; Frosti Jonsson; Unnur Thorsteinsdottir; Daniel F Gudbjartsson; Hreinn Stefansson; Kari Stefansson
Journal:  Nat Genet       Date:  2008-09       Impact factor: 38.330

9.  Genotype imputation with thousands of genomes.

Authors:  Bryan Howie; Jonathan Marchini; Matthew Stephens
Journal:  G3 (Bethesda)       Date:  2011-11-01       Impact factor: 3.154

10.  Genome-wide association study of ulcerative colitis identifies three new susceptibility loci, including the HNF4A region.

Authors:  Jeffrey C Barrett; James C Lee; Charles W Lees; Natalie J Prescott; Carl A Anderson; Anne Phillips; Emma Wesley; Kirstie Parnell; Hu Zhang; Hazel Drummond; Elaine R Nimmo; Dunecan Massey; Kasia Blaszczyk; Timothy Elliott; Lynn Cotterill; Helen Dallal; Alan J Lobo; Craig Mowat; Jeremy D Sanderson; Derek P Jewell; William G Newman; Cathryn Edwards; Tariq Ahmad; John C Mansfield; Jack Satsangi; Miles Parkes; Christopher G Mathew; Peter Donnelly; Leena Peltonen; Jenefer M Blackwell; Elvira Bramon; Matthew A Brown; Juan P Casas; Aiden Corvin; Nicholas Craddock; Panos Deloukas; Audrey Duncanson; Janusz Jankowski; Hugh S Markus; Christopher G Mathew; Mark I McCarthy; Colin N A Palmer; Robert Plomin; Anna Rautanen; Stephen J Sawcer; Nilesh Samani; Richard C Trembath; Anath C Viswanathan; Nicholas Wood; Chris C A Spencer; Jeffrey C Barrett; Céline Bellenguez; Daniel Davison; Colin Freeman; Amy Strange; Peter Donnelly; Cordelia Langford; Sarah E Hunt; Sarah Edkins; Rhian Gwilliam; Hannah Blackburn; Suzannah J Bumpstead; Serge Dronov; Matthew Gillman; Emma Gray; Naomi Hammond; Alagurevathi Jayakumar; Owen T McCann; Jennifer Liddle; Marc L Perez; Simon C Potter; Radhi Ravindrarajah; Michelle Ricketts; Matthew Waller; Paul Weston; Sara Widaa; Pamela Whittaker; Panos Deloukas; Leena Peltonen; Christopher G Mathew; Jenefer M Blackwell; Matthew A Brown; Aiden Corvin; Mark I McCarthy; Chris C A Spencer; Antony P Attwood; Jonathan Stephens; Jennifer Sambrook; Willem H Ouwehand; Wendy L McArdle; Susan M Ring; David P Strachan
Journal:  Nat Genet       Date:  2009-11-15       Impact factor: 38.330

View more
  64 in total

1.  Genetic and socioeconomic study of mate choice in Latinos reveals novel assortment patterns.

Authors:  James Y Zou; Danny S Park; Esteban G Burchard; Dara G Torgerson; Maria Pino-Yanes; Yun S Song; Sriram Sankararaman; Eran Halperin; Noah Zaitlen
Journal:  Proc Natl Acad Sci U S A       Date:  2015-10-19       Impact factor: 11.205

2.  Partitioning heritability of regulatory and cell-type-specific variants across 11 common diseases.

Authors:  Alexander Gusev; S Hong Lee; Gosia Trynka; Hilary Finucane; Bjarni J Vilhjálmsson; Han Xu; Chongzhi Zang; Stephan Ripke; Brendan Bulik-Sullivan; Eli Stahl; Anna K Kähler; Christina M Hultman; Shaun M Purcell; Steven A McCarroll; Mark Daly; Bogdan Pasaniuc; Patrick F Sullivan; Benjamin M Neale; Naomi R Wray; Soumya Raychaudhuri; Alkes L Price
Journal:  Am J Hum Genet       Date:  2014-11-06       Impact factor: 11.025

3.  Inferring Identical-by-Descent Sharing of Sample Ancestors Promotes High-Resolution Relative Detection.

Authors:  Monica D Ramstetter; Sushila A Shenoy; Thomas D Dyer; Donna M Lehman; Joanne E Curran; Ravindranath Duggirala; John Blangero; Jason G Mezey; Amy L Williams
Journal:  Am J Hum Genet       Date:  2018-06-21       Impact factor: 11.025

4.  Off the street phasing (OTSP): no hassle haplotype phasing for molecular PGD applications.

Authors:  David A Zeevi; Fouad Zahdeh; Yehuda Kling; Shai Carmi; Gheona Altarescu
Journal:  J Assist Reprod Genet       Date:  2019-01-08       Impact factor: 3.412

5.  Improved whole-chromosome phasing for disease and population genetic studies.

Authors:  Olivier Delaneau; Jean-Francois Zagury; Jonathan Marchini
Journal:  Nat Methods       Date:  2013-01       Impact factor: 28.547

6.  A dynamic Bayesian Markov model for phasing and characterizing haplotypes in next-generation sequencing.

Authors:  Yu Zhang
Journal:  Bioinformatics       Date:  2013-02-13       Impact factor: 6.937

7.  Next Generation Statistical Genetics: Modeling, Penalization, and Optimization in High-Dimensional Data.

Authors:  Kenneth Lange; Jeanette C Papp; Janet S Sinsheimer; Eric M Sobel
Journal:  Annu Rev Stat Appl       Date:  2014-01-01       Impact factor: 5.810

8.  FISH: fast and accurate diploid genotype imputation via segmental hidden Markov model.

Authors:  Lei Zhang; Yu-Fang Pei; Xiaoying Fu; Yong Lin; Yu-Ping Wang; Hong-Wen Deng
Journal:  Bioinformatics       Date:  2014-03-10       Impact factor: 6.937

9.  Next-generation genotype imputation service and methods.

Authors:  Sayantan Das; Lukas Forer; Sebastian Schönherr; Carlo Sidore; Adam E Locke; Alan Kwong; Scott I Vrieze; Emily Y Chew; Shawn Levy; Matt McGue; David Schlessinger; Dwight Stambolian; Po-Ru Loh; William G Iacono; Anand Swaroop; Laura J Scott; Francesco Cucca; Florian Kronenberg; Michael Boehnke; Gonçalo R Abecasis; Christian Fuchsberger
Journal:  Nat Genet       Date:  2016-08-29       Impact factor: 38.330

10.  Imputation-based assessment of next generation rare exome variant arrays.

Authors:  Alicia R Martin; Gerard Tse; Carlos D Bustamante; Eimear E Kenny
Journal:  Pac Symp Biocomput       Date:  2014
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.