Literature DB >> 15615859

GERBIL: Genotype resolution and block identification using likelihood.

Gad Kimmel1, Ron Shamir.   

Abstract

The abundance of genotype data generated by individual and international efforts carries the promise of revolutionizing disease studies and the association of phenotypes with individual polymorphisms. A key challenge is providing an accurate resolution (phasing) of the genotypes into haplotypes. We present here results on a method for genotype phasing in the presence of recombination. Our analysis is based on a stochastic model for recombination-poor regions ("blocks"), in which haplotypes are generated from a small number of core haplotypes, allowing for mutations, rare recombinations, and errors. We formulate genotype resolution and block partitioning as a maximum-likelihood problem and solve it by an expectation-maximization algorithm. The algorithm was implemented in a software package called GERBIL (genotype resolution and block identification using likelihood), which is efficient and simple to use. We tested GERBIL on four large-scale sets of genotypes. It outperformed two state-of-the-art phasing algorithms. The phase algorithm was slightly more accurate than GERBIL when allowed to run with default parameters, but required two orders of magnitude more time. When using comparable running times, GERBIL was consistently more accurate. For data sets with hundreds of genotypes, the time required by phase becomes prohibitive. We conclude that GERBIL has a clear advantage for studies that include many hundreds of genotypes and, in particular, for large-scale disease studies.

Entities:  

Mesh:

Year:  2004        PMID: 15615859      PMCID: PMC544046          DOI: 10.1073/pnas.0404730102

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  20 in total

1.  Bayesian haplotype inference for multiple linked single-nucleotide polymorphisms.

Authors:  Tianhua Niu; Zhaohui S Qin; Xiping Xu; Jun S Liu
Journal:  Am J Hum Genet       Date:  2001-11-26       Impact factor: 11.025

2.  A dynamic programming algorithm for haplotype block partitioning.

Authors:  Kui Zhang; Minghua Deng; Ting Chen; Michael S Waterman; Fengzhu Sun
Journal:  Proc Natl Acad Sci U S A       Date:  2002-05-28       Impact factor: 11.205

3.  The structure of haplotype blocks in the human genome.

Authors:  Stacey B Gabriel; Stephen F Schaffner; Huy Nguyen; Jamie M Moore; Jessica Roy; Brendan Blumenstiel; John Higgins; Matthew DeFelice; Amy Lochner; Maura Faggart; Shau Neen Liu-Cordero; Charles Rotimi; Adebowale Adeyemo; Richard Cooper; Ryk Ward; Eric S Lander; Mark J Daly; David Altshuler
Journal:  Science       Date:  2002-05-23       Impact factor: 47.728

Review 4.  Progress in high throughput SNP genotyping methods.

Authors:  Z Tsuchihashi; N C Dracopoli
Journal:  Pharmacogenomics J       Date:  2002       Impact factor: 3.550

5.  A comparison of bayesian methods for haplotype reconstruction from population genotype data.

Authors:  Matthew Stephens; Peter Donnelly
Journal:  Am J Hum Genet       Date:  2003-10-20       Impact factor: 11.025

6.  Haplotyping as perfect phylogeny: a direct approach.

Authors:  Vineet Bafna; Dan Gusfield; Giuseppe Lancia; Shibu Yooseph
Journal:  J Comput Biol       Date:  2003       Impact factor: 1.479

7.  MENDELIAN PROPORTIONS IN A MIXED POPULATION.

Authors:  G H Hardy
Journal:  Science       Date:  1908-07-10       Impact factor: 47.728

8.  A comparison of linkage disequilibrium measures for fine-scale mapping.

Authors:  B Devlin; N Risch
Journal:  Genomics       Date:  1995-09-20       Impact factor: 5.736

9.  An E-M algorithm and testing strategy for multiple-locus haplotypes.

Authors:  J C Long; R C Williams; M Urbanek
Journal:  Am J Hum Genet       Date:  1995-03       Impact factor: 11.025

10.  SNPing away at complex diseases: analysis of single-nucleotide polymorphisms around APOE in Alzheimer disease.

Authors:  E R Martin; E H Lai; J R Gilbert; A R Rogala; A J Afshari; J Riley; K L Finch; J F Stevens; K J Livak; B D Slotterbeck; S H Slifer; L L Warren; P M Conneally; D E Schmechel; I Purvis; M A Pericak-Vance; A D Roses; J M Vance
Journal:  Am J Hum Genet       Date:  2000-06-21       Impact factor: 11.025

View more
  45 in total

Review 1.  An overview of population genetic data simulation.

Authors:  Xiguo Yuan; David J Miller; Junying Zhang; David Herrington; Yue Wang
Journal:  J Comput Biol       Date:  2011-12-09       Impact factor: 1.479

2.  Fast and accurate inference of local ancestry in Latino populations.

Authors:  Yael Baran; Bogdan Pasaniuc; Sriram Sankararaman; Dara G Torgerson; Christopher Gignoux; Celeste Eng; William Rodriguez-Cintron; Rocio Chapela; Jean G Ford; Pedro C Avila; Jose Rodriguez-Santana; Esteban Gonzàlez Burchard; Eran Halperin
Journal:  Bioinformatics       Date:  2012-04-11       Impact factor: 6.937

3.  A coalescence-guided hierarchical Bayesian method for haplotype inference.

Authors:  Yu Zhang; Tianhua Niu; Jun S Liu
Journal:  Am J Hum Genet       Date:  2006-06-28       Impact factor: 11.025

4.  A fast method for computing high-significance disease association in large population-based studies.

Authors:  Gad Kimmel; Ron Shamir
Journal:  Am J Hum Genet       Date:  2006-07-24       Impact factor: 11.025

5.  Haplotype reconstruction using perfect phylogeny and sequence data.

Authors:  Anatoly Efros; Eran Halperin
Journal:  BMC Bioinformatics       Date:  2012-04-19       Impact factor: 3.169

6.  A fast and flexible statistical model for large-scale population genotype data: applications to inferring missing genotypes and haplotypic phase.

Authors:  Paul Scheet; Matthew Stephens
Journal:  Am J Hum Genet       Date:  2006-02-17       Impact factor: 11.025

7.  Association mapping and significance estimation via the coalescent.

Authors:  Gad Kimmel; Richard M Karp; Michael I Jordan; Eran Halperin
Journal:  Am J Hum Genet       Date:  2008-11-20       Impact factor: 11.025

8.  SNP imputation in association studies.

Authors:  Eran Halperin; Dietrich A Stephan
Journal:  Nat Biotechnol       Date:  2009-04       Impact factor: 54.908

9.  Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering.

Authors:  Sharon R Browning; Brian L Browning
Journal:  Am J Hum Genet       Date:  2007-09-21       Impact factor: 11.025

10.  Strengthening the reporting of genetic association studies (STREGA): an extension of the STROBE Statement.

Authors:  Julian Little; Julian P T Higgins; John P A Ioannidis; David Moher; France Gagnon; Erik von Elm; Muin J Khoury; Barbara Cohen; George Davey-Smith; Jeremy Grimshaw; Paul Scheet; Marta Gwinn; Robin E Williamson; Guang Yong Zou; Kim Hutchings; Candice Y Johnson; Valerie Tait; Miriam Wiens; Jean Golding; Cornelia van Duijn; John McLaughlin; Andrew Paterson; George Wells; Isabel Fortier; Matthew Freedman; Maja Zecevic; Richard King; Claire Infante-Rivard; Alex Stewart; Nick Birkett
Journal:  Hum Genet       Date:  2009-02-01       Impact factor: 4.132

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.