Literature DB >> 16465620

A comparison of phasing algorithms for trios and unrelated individuals.

Jonathan Marchini1, David Cutler, Nick Patterson, Matthew Stephens, Eleazar Eskin, Eran Halperin, Shin Lin, Zhaohui S Qin, Heather M Munro, Goncalo R Abecasis, Peter Donnelly.   

Abstract

Knowledge of haplotype phase is valuable for many analysis methods in the study of disease, population, and evolutionary genetics. Considerable research effort has been devoted to the development of statistical and computational methods that infer haplotype phase from genotype data. Although a substantial number of such methods have been developed, they have focused principally on inference from unrelated individuals, and comparisons between methods have been rather limited. Here, we describe the extension of five leading algorithms for phase inference for handling father-mother-child trios. We performed a comprehensive assessment of the methods applied to both trios and to unrelated individuals, with a focus on genomic-scale problems, using both simulated data and data from the HapMap project. The most accurate algorithm was PHASE (v2.1). For this method, the percentages of genotypes whose phase was incorrectly inferred were 0.12%, 0.05%, and 0.16% for trios from simulated data, HapMap Centre d'Etude du Polymorphisme Humain (CEPH) trios, and HapMap Yoruban trios, respectively, and 5.2% and 5.9% for unrelated individuals in simulated data and the HapMap CEPH data, respectively. The other methods considered in this work had comparable but slightly worse error rates. The error rates for trios are similar to the levels of genotyping error and missing data expected. We thus conclude that all the methods considered will provide highly accurate estimates of haplotypes when applied to trio data sets. Running times differ substantially between methods. Although it is one of the slowest methods, PHASE (v2.1) was used to infer haplotypes for the 1 million-SNP HapMap data set. Finally, we evaluated methods of estimating the value of r(2) between a pair of SNPs and concluded that all methods estimated r(2) well when the estimated value was >or=0.8.

Entities:  

Mesh:

Year:  2006        PMID: 16465620      PMCID: PMC1380287          DOI: 10.1086/500808

Source DB:  PubMed          Journal:  Am J Hum Genet        ISSN: 0002-9297            Impact factor:   11.025


  45 in total

1.  A new statistical method for haplotype reconstruction from population data.

Authors:  M Stephens; N J Smith; P Donnelly
Journal:  Am J Hum Genet       Date:  2001-03-09       Impact factor: 11.025

2.  Accuracy of haplotype frequency estimation for biallelic loci, via the expectation-maximization algorithm for unphased diploid genotype data.

Authors:  D Fallin; N J Schork
Journal:  Am J Hum Genet       Date:  2000-08-22       Impact factor: 11.025

3.  Bounds on the minimum number of recombination events in a sample history.

Authors:  Simon R Myers; Robert C Griffiths
Journal:  Genetics       Date:  2003-01       Impact factor: 4.562

4.  Partition-ligation-expectation-maximization algorithm for haplotype inference with single-nucleotide polymorphisms.

Authors:  Zhaohui S Qin; Tianhua Niu; Jun S Liu
Journal:  Am J Hum Genet       Date:  2002-11       Impact factor: 11.025

5.  A comparison of bayesian methods for haplotype reconstruction from population genotype data.

Authors:  Matthew Stephens; Peter Donnelly
Journal:  Am J Hum Genet       Date:  2003-10-20       Impact factor: 11.025

6.  Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium.

Authors:  Christopher S Carlson; Michael A Eberle; Mark J Rieder; Qian Yi; Leonid Kruglyak; Deborah A Nickerson
Journal:  Am J Hum Genet       Date:  2003-12-15       Impact factor: 11.025

7.  Modeling linkage disequilibrium and identifying recombination hotspots using single-nucleotide polymorphism data.

Authors:  Na Li; Matthew Stephens
Journal:  Genetics       Date:  2003-12       Impact factor: 4.562

8.  Little loss of information due to unknown phase for fine-scale linkage-disequilibrium mapping with single-nucleotide-polymorphism genotype data.

Authors:  A P Morris; J C Whittaker; D J Balding
Journal:  Am J Hum Genet       Date:  2004-04-07       Impact factor: 11.025

9.  Detecting disease associations due to linkage disequilibrium using haplotype tags: a class of tests and the determinants of statistical power.

Authors:  Juliet M Chapman; Jason D Cooper; John A Todd; David G Clayton
Journal:  Hum Hered       Date:  2003       Impact factor: 0.444

10.  A practical algorithm for optimal inference of haplotypes from diploid populations.

Authors:  D Gusfield
Journal:  Proc Int Conf Intell Syst Mol Biol       Date:  2000
View more
  126 in total

1.  A linear complexity phasing method for thousands of genomes.

Authors:  Olivier Delaneau; Jonathan Marchini; Jean-François Zagury
Journal:  Nat Methods       Date:  2011-12-04       Impact factor: 28.547

2.  HapCompass: a fast cycle basis algorithm for accurate haplotype assembly of sequence data.

Authors:  Derek Aguiar; Sorin Istrail
Journal:  J Comput Biol       Date:  2012-06       Impact factor: 1.479

3.  High-resolution detection of identity by descent in unrelated individuals.

Authors:  Sharon R Browning; Brian L Browning
Journal:  Am J Hum Genet       Date:  2010-03-18       Impact factor: 11.025

Review 4.  Genotype imputation for genome-wide association studies.

Authors:  Jonathan Marchini; Bryan Howie
Journal:  Nat Rev Genet       Date:  2010-07       Impact factor: 53.242

Review 5.  Haplotyping methods for pedigrees.

Authors:  Guimin Gao; David B Allison; Ina Hoeschele
Journal:  Hum Hered       Date:  2009-01-27       Impact factor: 0.444

6.  Haplotype analysis of CYP11A1 identifies promoter variants associated with breast cancer risk.

Authors:  Brian L Yaspan; Joan P Breyer; Qiuyin Cai; Qi Dai; J Bradford Elmore; Isaac Amundson; Kevin M Bradley; Xiao-Ou Shu; Yu-Tang Gao; William D Dupont; Wei Zheng; Jeffrey R Smith
Journal:  Cancer Res       Date:  2007-06-15       Impact factor: 12.701

7.  Genome-Wide Association Study Identifies African-Specific Susceptibility Loci in African Americans With Inflammatory Bowel Disease.

Authors:  Steven R Brant; David T Okou; Claire L Simpson; David J Cutler; Talin Haritunians; Jonathan P Bradfield; Pankaj Chopra; Jarod Prince; Ferdouse Begum; Archana Kumar; Chengrui Huang; Suresh Venkateswaran; Lisa W Datta; Zhi Wei; Kelly Thomas; Lisa J Herrinton; Jan-Micheal A Klapproth; Antonio J Quiros; Jenifer Seminerio; Zhenqiu Liu; Jonathan S Alexander; Robert N Baldassano; Sharon Dudley-Brown; Raymond K Cross; Themistocles Dassopoulos; Lee A Denson; Tanvi A Dhere; Gerald W Dryden; John S Hanson; Jason K Hou; Sunny Z Hussain; Jeffrey S Hyams; Kim L Isaacs; Howard Kader; Michael D Kappelman; Jeffry Katz; Richard Kellermayer; Barbara S Kirschner; John F Kuemmerle; John H Kwon; Mark Lazarev; Ellen Li; David Mack; Peter Mannon; Dedrick E Moulton; Rodney D Newberry; Bankole O Osuntokun; Ashish S Patel; Shehzad A Saeed; Stephan R Targan; John F Valentine; Ming-Hsi Wang; Martin Zonca; John D Rioux; Richard H Duerr; Mark S Silverberg; Judy H Cho; Hakon Hakonarson; Michael E Zwick; Dermot P B McGovern; Subra Kugathasan
Journal:  Gastroenterology       Date:  2016-09-28       Impact factor: 22.682

8.  High-throughput haplotype determination over long distances by haplotype fusion PCR and ligation haplotyping.

Authors:  Daniel J Turner; Matthew E Hurles
Journal:  Nat Protoc       Date:  2009       Impact factor: 13.491

9.  Diversity and evolution of 11 innate immune genes in Bos taurus taurus and Bos taurus indicus cattle.

Authors:  Christopher M Seabury; Paul M Seabury; Jared E Decker; Robert D Schnabel; Jeremy F Taylor; James E Womack
Journal:  Proc Natl Acad Sci U S A       Date:  2009-12-14       Impact factor: 11.205

Review 10.  Missing data imputation and haplotype phase inference for genome-wide association studies.

Authors:  Sharon R Browning
Journal:  Hum Genet       Date:  2008-10-11       Impact factor: 4.132

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.