Literature DB >> 17924348

Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering.

Sharon R Browning1, Brian L Browning.   

Abstract

Whole-genome association studies present many new statistical and computational challenges due to the large quantity of data obtained. One of these challenges is haplotype inference; methods for haplotype inference designed for small data sets from candidate-gene studies do not scale well to the large number of individuals genotyped in whole-genome association studies. We present a new method and software for inference of haplotype phase and missing data that can accurately phase data from whole-genome association studies, and we present the first comparison of haplotype-inference methods for real and simulated data sets with thousands of genotyped individuals. We find that our method outperforms existing methods in terms of both speed and accuracy for large data sets with thousands of individuals and densely spaced genetic markers, and we use our method to phase a real data set of 3,002 individuals genotyped for 490,032 markers in 3.1 days of computing time, with 99% of masked alleles imputed correctly. Our method is implemented in the Beagle software package, which is freely available.

Mesh:

Substances:

Year:  2007        PMID: 17924348      PMCID: PMC2265661          DOI: 10.1086/521987

Source DB:  PubMed          Journal:  Am J Hum Genet        ISSN: 0002-9297            Impact factor:   11.025


  28 in total

1.  A new statistical method for haplotype reconstruction from population data.

Authors:  M Stephens; N J Smith; P Donnelly
Journal:  Am J Hum Genet       Date:  2001-03-09       Impact factor: 11.025

2.  Partition-ligation-expectation-maximization algorithm for haplotype inference with single-nucleotide polymorphisms.

Authors:  Zhaohui S Qin; Tianhua Niu; Jun S Liu
Journal:  Am J Hum Genet       Date:  2002-11       Impact factor: 11.025

3.  Haplotype inference in random population samples.

Authors:  Shin Lin; David J Cutler; Michael E Zwick; Aravinda Chakravarti
Journal:  Am J Hum Genet       Date:  2002-10-17       Impact factor: 11.025

4.  A comparison of bayesian methods for haplotype reconstruction from population genotype data.

Authors:  Matthew Stephens; Peter Donnelly
Journal:  Am J Hum Genet       Date:  2003-10-20       Impact factor: 11.025

5.  Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium.

Authors:  Christopher S Carlson; Michael A Eberle; Mark J Rieder; Qian Yi; Leonid Kruglyak; Deborah A Nickerson
Journal:  Am J Hum Genet       Date:  2003-12-15       Impact factor: 11.025

6.  A new algorithm for haplotype-based association analysis: the Stochastic-EM algorithm.

Authors:  D A Tregouet; S Escolano; L Tiret; A Mallet; J L Golmard
Journal:  Ann Hum Genet       Date:  2004-03       Impact factor: 1.670

7.  Efficient multilocus association testing for whole genome association studies using localized haplotype clustering.

Authors:  Brian L Browning; Sharon R Browning
Journal:  Genet Epidemiol       Date:  2007-07       Impact factor: 2.135

8.  HAPLO: a program using the EM algorithm to estimate the frequencies of multi-site haplotypes.

Authors:  M E Hawley; K K Kidd
Journal:  J Hered       Date:  1995 Sep-Oct       Impact factor: 2.645

9.  An E-M algorithm and testing strategy for multiple-locus haplotypes.

Authors:  J C Long; R C Williams; M Urbanek
Journal:  Am J Hum Genet       Date:  1995-03       Impact factor: 11.025

10.  Haplotype reconstruction from genotype data using Imperfect Phylogeny.

Authors:  Eran Halperin; Eleazar Eskin
Journal:  Bioinformatics       Date:  2004-02-26       Impact factor: 6.937

View more
  1347 in total

1.  CYP2C9*61, a rare missense variant identified in a Puerto Rican patient with low warfarin dose requirements.

Authors:  Karla I Claudio-Campos; Pablo González-Santiago; Jessica Y Renta; Jovaniel Rodríguez; Kelvin Carrasquillo; Andrea Gaedigk; Abiel Roche; Jorge Ducongé
Journal:  Pharmacogenomics       Date:  2018-12-06       Impact factor: 2.533

2.  Haploscope: a tool for the graphical display of haplotype structure in populations.

Authors:  F Anthony San Lucas; Noah A Rosenberg; Paul Scheet
Journal:  Genet Epidemiol       Date:  2011-12-06       Impact factor: 2.135

3.  Performance of genotype imputations using data from the 1000 Genomes Project.

Authors:  Yun Ju Sung; Lihua Wang; Tuomo Rankinen; Claude Bouchard; D C Rao
Journal:  Hum Hered       Date:  2011-12-30       Impact factor: 0.444

4.  A linear complexity phasing method for thousands of genomes.

Authors:  Olivier Delaneau; Jonathan Marchini; Jean-François Zagury
Journal:  Nat Methods       Date:  2011-12-04       Impact factor: 28.547

5.  Shared and unique components of human population structure and genome-wide signals of positive selection in South Asia.

Authors:  Mait Metspalu; Irene Gallego Romero; Bayazit Yunusbayev; Gyaneshwer Chaubey; Chandana Basu Mallick; Georgi Hudjashov; Mari Nelis; Reedik Mägi; Ene Metspalu; Maido Remm; Ramasamy Pitchappan; Lalji Singh; Kumarasamy Thangaraj; Richard Villems; Toomas Kivisild
Journal:  Am J Hum Genet       Date:  2011-12-09       Impact factor: 11.025

6.  Inferring coancestry in population samples in the presence of linkage disequilibrium.

Authors:  M D Brown; C G Glazner; C Zheng; E A Thompson
Journal:  Genetics       Date:  2012-01-31       Impact factor: 4.562

7.  Blockwise HMM computation for large-scale population genomic inference.

Authors:  Joshua S Paul; Yun S Song
Journal:  Bioinformatics       Date:  2012-05-28       Impact factor: 6.937

8.  BLUP genotype imputation for case-control association testing with related individuals and missing data.

Authors:  Mary Sara McPeek
Journal:  J Comput Biol       Date:  2012-06       Impact factor: 1.479

9.  Genetic modifiers and subtypes in schizophrenia: investigations of age at onset, severity, sex and family history.

Authors:  Sarah E Bergen; Colm T O'Dushlaine; Phil H Lee; Ayman H Fanous; Douglas M Ruderfer; Stephan Ripke; Patrick F Sullivan; Jordan W Smoller; Shaun M Purcell; Aiden Corvin
Journal:  Schizophr Res       Date:  2014-02-26       Impact factor: 4.939

10.  Genome-Wide Analysis of SNPs Is Consistent with No Domestic Dog Ancestry in the Endangered Mexican Wolf (Canis lupus baileyi).

Authors:  Robert R Fitak; Sarah E Rinkevich; Melanie Culver
Journal:  J Hered       Date:  2018-05-11       Impact factor: 2.645

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.