Literature DB >> 16532393

A fast and flexible statistical model for large-scale population genotype data: applications to inferring missing genotypes and haplotypic phase.

Paul Scheet1, Matthew Stephens.   

Abstract

We present a statistical model for patterns of genetic variation in samples of unrelated individuals from natural populations. This model is based on the idea that, over short regions, haplotypes in a population tend to cluster into groups of similar haplotypes. To capture the fact that, because of recombination, this clustering tends to be local in nature, our model allows cluster memberships to change continuously along the chromosome according to a hidden Markov model. This approach is flexible, allowing for both "block-like" patterns of linkage disequilibrium (LD) and gradual decline in LD with distance. The resulting model is also fast and, as a result, is practicable for large data sets (e.g., thousands of individuals typed at hundreds of thousands of markers). We illustrate the utility of the model by applying it to dense single-nucleotide-polymorphism genotype data for the tasks of imputing missing genotypes and estimating haplotypic phase. For imputing missing genotypes, methods based on this model are as accurate or more accurate than existing methods. For haplotype estimation, the point estimates are slightly less accurate than those from the best existing methods (e.g., for unrelated Centre d'Etude du Polymorphisme Humain individuals from the HapMap project, switch error was 0.055 for our method vs. 0.051 for PHASE) but require a small fraction of the computational cost. In addition, we demonstrate that the model accurately reflects uncertainty in its estimates, in that probabilities computed using the model are approximately well calibrated. The methods described in this article are implemented in a software package, fastPHASE, which is available from the Stephens Lab Web site.

Mesh:

Year:  2006        PMID: 16532393      PMCID: PMC1424677          DOI: 10.1086/502802

Source DB:  PubMed          Journal:  Am J Hum Genet        ISSN: 0002-9297            Impact factor:   11.025


  18 in total

1.  A new statistical method for haplotype reconstruction from population data.

Authors:  M Stephens; N J Smith; P Donnelly
Journal:  Am J Hum Genet       Date:  2001-03-09       Impact factor: 11.025

2.  Inference of population structure using multilocus genotype data.

Authors:  J K Pritchard; M Stephens; P Donnelly
Journal:  Genetics       Date:  2000-06       Impact factor: 4.562

3.  Haplotype inference in random population samples.

Authors:  Shin Lin; David J Cutler; Michael E Zwick; Aravinda Chakravarti
Journal:  Am J Hum Genet       Date:  2002-10-17       Impact factor: 11.025

4.  Detecting immigration by using multilocus genotypes.

Authors:  B Rannala; J L Mountain
Journal:  Proc Natl Acad Sci U S A       Date:  1997-08-19       Impact factor: 11.205

5.  GERBIL: Genotype resolution and block identification using likelihood.

Authors:  Gad Kimmel; Ron Shamir
Journal:  Proc Natl Acad Sci U S A       Date:  2004-12-22       Impact factor: 11.205

6.  A haplotype map of the human genome.

Authors: 
Journal:  Nature       Date:  2005-10-27       Impact factor: 49.962

7.  Accounting for decay of linkage disequilibrium in haplotype inference and missing-data imputation.

Authors:  Matthew Stephens; Paul Scheet
Journal:  Am J Hum Genet       Date:  2005-01-31       Impact factor: 11.025

8.  Coalescent-based association mapping and fine mapping of complex trait loci.

Authors:  Sebastian Zöllner; Jonathan K Pritchard
Journal:  Genetics       Date:  2004-10-16       Impact factor: 4.562

9.  A block-free hidden Markov model for genotypes and its application to disease association.

Authors:  Gad Kimmel; Ron Shamir
Journal:  J Comput Biol       Date:  2005-12       Impact factor: 1.479

10.  A comparison of phasing algorithms for trios and unrelated individuals.

Authors:  Jonathan Marchini; David Cutler; Nick Patterson; Matthew Stephens; Eleazar Eskin; Eran Halperin; Shin Lin; Zhaohui S Qin; Heather M Munro; Goncalo R Abecasis; Peter Donnelly
Journal:  Am J Hum Genet       Date:  2006-01-26       Impact factor: 11.025

View more
  857 in total

1.  Haploscope: a tool for the graphical display of haplotype structure in populations.

Authors:  F Anthony San Lucas; Noah A Rosenberg; Paul Scheet
Journal:  Genet Epidemiol       Date:  2011-12-06       Impact factor: 2.135

2.  Rare versus common variants in pharmacogenetics: SLCO1B1 variation and methotrexate disposition.

Authors:  Laura B Ramsey; Gitte H Bruun; Wenjian Yang; Lisa R Treviño; Selina Vattathil; Paul Scheet; Cheng Cheng; Gary L Rosner; Kathleen M Giacomini; Yiping Fan; Alex Sparreboom; Torben S Mikkelsen; Thomas J Corydon; Ching-Hon Pui; William E Evans; Mary V Relling
Journal:  Genome Res       Date:  2011-12-06       Impact factor: 9.043

3.  A linear complexity phasing method for thousands of genomes.

Authors:  Olivier Delaneau; Jonathan Marchini; Jean-François Zagury
Journal:  Nat Methods       Date:  2011-12-04       Impact factor: 28.547

4.  Fast and accurate inference of local ancestry in Latino populations.

Authors:  Yael Baran; Bogdan Pasaniuc; Sriram Sankararaman; Dara G Torgerson; Christopher Gignoux; Celeste Eng; William Rodriguez-Cintron; Rocio Chapela; Jean G Ford; Pedro C Avila; Jose Rodriguez-Santana; Esteban Gonzàlez Burchard; Eran Halperin
Journal:  Bioinformatics       Date:  2012-04-11       Impact factor: 6.937

5.  Integrating genetic and gene expression evidence into genome-wide association analysis of gene sets.

Authors:  Qing Xiong; Nicola Ancona; Elizabeth R Hauser; Sayan Mukherjee; Terrence S Furey
Journal:  Genome Res       Date:  2011-09-22       Impact factor: 9.043

6.  Genome-wide interrogation of germline genetic variation associated with treatment response in childhood acute lymphoblastic leukemia.

Authors:  Jun J Yang; Cheng Cheng; Wenjian Yang; Deqing Pei; Xueyuan Cao; Yiping Fan; Stanley B Pounds; Geoffrey Neale; Lisa R Treviño; Deborah French; Dario Campana; James R Downing; William E Evans; Ching-Hon Pui; Meenakshi Devidas; W P Bowman; Bruce M Camitta; Cheryl L Willman; Stella M Davies; Michael J Borowitz; William L Carroll; Stephen P Hunger; Mary V Relling
Journal:  JAMA       Date:  2009-01-28       Impact factor: 56.272

7.  The Genomic Impacts of Drift and Selection for Hybrid Performance in Maize.

Authors:  Justin P Gerke; Jode W Edwards; Katherine E Guill; Jeffrey Ross-Ibarra; Michael D McMullen
Journal:  Genetics       Date:  2015-09-18       Impact factor: 4.562

8.  Association between urokinase haplotypes and outcome from infection-associated acute lung injury.

Authors:  John Arcaroli; Jeff Sankoff; Nianjun Liu; David B Allison; James Maloney; Edward Abraham
Journal:  Intensive Care Med       Date:  2007-11-10       Impact factor: 17.440

9.  Genotyping error detection in samples of unrelated individuals without replicate genotyping.

Authors:  Nianjun Liu; Dabao Zhang; Hongyu Zhao
Journal:  Hum Hered       Date:  2008-12-15       Impact factor: 0.444

10.  Evidence of positive selection for a glycogen synthase (GYS1) mutation in domestic horse populations.

Authors:  Annette M McCoy; Robert Schaefer; Jessica L Petersen; Peter L Morrell; Megan A Slamka; James R Mickelson; Stephanie J Valberg; Molly E McCue
Journal:  J Hered       Date:  2013-11-08       Impact factor: 2.645

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.