Literature DB >> 23383995

Hap-seq: an optimal algorithm for haplotype phasing with imputation using sequencing data.

Dan He1, Buhm Han, Eleazar Eskin.   

Abstract

Inference of haplotypes, or the sequence of alleles along each chromosome, is a fundamental problem in genetics and is important for many analyses, including admixture mapping, identifying regions of identity by descent, and imputation. Traditionally, haplotypes are inferred from genotype data obtained from microarrays using information on population haplotype frequencies inferred from either a large sample of genotyped individuals or a reference dataset such as the HapMap. Since the availability of large reference datasets, modern approaches for haplotype phasing along these lines are closely related to imputation methods. When applied to data obtained from sequencing studies, a straightforward way to obtain haplotypes is to first infer genotypes from the sequence data and then apply an imputation method. However, this approach does not take into account that alleles on the same sequence read originate from the same chromosome. Haplotype assembly approaches take advantage of this insight and predict haplotypes by assigning the reads to chromosomes in such a way that minimizes the number of conflicts between the reads and the predicted haplotypes. Unfortunately, assembly approaches require very high sequencing coverage and are usually not able to fully reconstruct the haplotypes. In this work, we present a novel approach, Hap-seq, which is simultaneously an imputation and assembly method that combines information from a reference dataset with the information from the reads using a likelihood framework. Our method applies a dynamic programming algorithm to identify the predicted haplotype, which maximizes the joint likelihood of the haplotype with respect to the reference dataset and the haplotype with respect to the observed reads. We show that our method requires only low sequencing coverage and can reconstruct haplotypes containing both common and rare alleles with higher accuracy compared to the state-of-the-art imputation methods.

Entities:  

Mesh:

Year:  2013        PMID: 23383995      PMCID: PMC3576919          DOI: 10.1089/cmb.2012.0091

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  19 in total

1.  A new statistical method for haplotype reconstruction from population data.

Authors:  M Stephens; N J Smith; P Donnelly
Journal:  Am J Hum Genet       Date:  2001-03-09       Impact factor: 11.025

2.  Blocks of limited haplotype diversity revealed by high-resolution scanning of human chromosome 21.

Authors:  N Patil; A J Berno; D A Hinds; W A Barrett; J M Doshi; C R Hacker; C R Kautzer; D H Lee; C Marjoribanks; D P McDonough; B T Nguyen; M C Norris; J B Sheehan; N Shen; D Stern; R P Stokowski; D J Thomas; M O Trulson; K R Vyas; K A Frazer; S P Fodor; D R Cox
Journal:  Science       Date:  2001-11-23       Impact factor: 47.728

3.  High-resolution detection of identity by descent in unrelated individuals.

Authors:  Sharon R Browning; Brian L Browning
Journal:  Am J Hum Genet       Date:  2010-03-18       Impact factor: 11.025

Review 4.  Inference of haplotypes from PCR-amplified samples of diploid populations.

Authors:  A G Clark
Journal:  Mol Biol Evol       Date:  1990-03       Impact factor: 16.240

5.  A fast, powerful method for detecting identity by descent.

Authors:  Brian L Browning; Sharon R Browning
Journal:  Am J Hum Genet       Date:  2011-02-11       Impact factor: 11.025

6.  MaCH: using sequence and genotype data to estimate haplotypes and unobserved genotypes.

Authors:  Yun Li; Cristen J Willer; Jun Ding; Paul Scheet; Gonçalo R Abecasis
Journal:  Genet Epidemiol       Date:  2010-12       Impact factor: 2.135

7.  EMINIM: an adaptive and memory-efficient algorithm for genotype imputation.

Authors:  Hyun Min Kang; Noah A Zaitlen; Eleazar Eskin
Journal:  J Comput Biol       Date:  2010-03       Impact factor: 1.479

8.  Methods for high-density admixture mapping of disease genes.

Authors:  Nick Patterson; Neil Hattangadi; Barton Lane; Kirk E Lohmueller; David A Hafler; Jorge R Oksenberg; Stephen L Hauser; Michael W Smith; Stephen J O'Brien; David Altshuler; Mark J Daly; David Reich
Journal:  Am J Hum Genet       Date:  2004-04-14       Impact factor: 11.025

9.  Haplotype reconstruction from genotype data using Imperfect Phylogeny.

Authors:  Eran Halperin; Eleazar Eskin
Journal:  Bioinformatics       Date:  2004-02-26       Impact factor: 6.937

10.  Efficient reconstruction of haplotype structure via perfect phylogeny.

Authors:  Eleazar Eskin; Eran Halperin; Richard M Karp
Journal:  J Bioinform Comput Biol       Date:  2003-04       Impact factor: 1.122

View more
  5 in total

1.  Maximum parsimony xor haplotyping by sparse dictionary selection.

Authors:  Abdulkadir Elmas; Guido H Jajamovich; Xiaodong Wang
Journal:  BMC Genomics       Date:  2013-09-23       Impact factor: 3.969

Review 2.  Digenic inheritance in medical genetics.

Authors:  Alejandro A Schäffer
Journal:  J Med Genet       Date:  2013-06-19       Impact factor: 6.318

3.  Integrating dilution-based sequencing and population genotypes for single individual haplotyping.

Authors:  Hirotaka Matsumoto; Hisanori Kiryu
Journal:  BMC Genomics       Date:  2014-08-28       Impact factor: 3.969

4.  Reference-based phasing using the Haplotype Reference Consortium panel.

Authors:  Po-Ru Loh; Petr Danecek; Pier Francesco Palamara; Christian Fuchsberger; Yakir A Reshef; Hilary K Finucane; Sebastian Schoenherr; Lukas Forer; Shane McCarthy; Goncalo R Abecasis; Richard Durbin; Alkes L Price
Journal:  Nat Genet       Date:  2016-10-03       Impact factor: 38.330

5.  Sparse Tensor Decomposition for Haplotype Assembly of Diploids and Polyploids.

Authors:  Abolfazl Hashemi; Banghua Zhu; Haris Vikalo
Journal:  BMC Genomics       Date:  2018-03-21       Impact factor: 3.969

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.