Literature DB >> 30590428

CSHAP: efficient haplotype frequency estimation based on sparse representation.

Yinsheng Zhou1, Han Zhang2, Yaning Yang1.   

Abstract

MOTIVATION: Estimating haplotype frequencies from genotype data plays an important role in genetic analysis. In silico methods are usually computationally involved since phase information is not available. Due to tight linkage disequilibrium and low recombination rates, the number of haplotypes observed in human populations is far less than all the possibilities. This motivates us to solve the estimation problem by maximizing the sparsity of existing haplotypes. Here, we propose a new algorithm by applying the compressive sensing (CS) theory in the field of signal processing, compressive sensing haplotype inference (CSHAP), to solve the sparse representation of haplotype frequencies based on allele frequencies and between-allele co-variances.
RESULTS: Our proposed approach can handle both individual genotype data and pooled DNA data with hundreds of loci. The CSHAP exhibits the same accuracy compared with the state-of-the-art methods, but runs several orders of magnitude faster. CSHAP can also handle with missing genotype data imputations efficiently.
AVAILABILITY AND IMPLEMENTATION: The CSHAP is implemented in R, the source code and the testing datasets are available at http://home.ustc.edu.cn/∼zhouys/CSHAP/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2018. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2019        PMID: 30590428      PMCID: PMC6931353          DOI: 10.1093/bioinformatics/bty1040

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  28 in total

1.  Bayesian haplotype inference for multiple linked single-nucleotide polymorphisms.

Authors:  Tianhua Niu; Zhaohui S Qin; Xiping Xu; Jun S Liu
Journal:  Am J Hum Genet       Date:  2001-11-26       Impact factor: 11.025

2.  Partition-ligation-expectation-maximization algorithm for haplotype inference with single-nucleotide polymorphisms.

Authors:  Zhaohui S Qin; Tianhua Niu; Jun S Liu
Journal:  Am J Hum Genet       Date:  2002-11       Impact factor: 11.025

3.  Efficiency of single-nucleotide polymorphism haplotype estimation from pooled DNA.

Authors:  Yaning Yang; Jingshan Zhang; Josephine Hoh; Fumihiko Matsuda; Peng Xu; Mark Lathrop; Jurg Ott
Journal:  Proc Natl Acad Sci U S A       Date:  2003-05-30       Impact factor: 11.205

4.  Accounting for decay of linkage disequilibrium in haplotype inference and missing-data imputation.

Authors:  Matthew Stephens; Paul Scheet
Journal:  Am J Hum Genet       Date:  2005-01-31       Impact factor: 11.025

5.  A coalescence-guided hierarchical Bayesian method for haplotype inference.

Authors:  Yu Zhang; Tianhua Niu; Jun S Liu
Journal:  Am J Hum Genet       Date:  2006-06-28       Impact factor: 11.025

6.  PoooL: an efficient method for estimating haplotype frequencies from large DNA pools.

Authors:  Han Zhang; Hsin-Chou Yang; Yaning Yang
Journal:  Bioinformatics       Date:  2008-06-23       Impact factor: 6.937

Review 7.  Haplotype phasing: existing methods and new developments.

Authors:  Sharon R Browning; Brian L Browning
Journal:  Nat Rev Genet       Date:  2011-09-16       Impact factor: 53.242

8.  Inference of haplotypes from samples of diploid populations: complexity and algorithms.

Authors:  D Gusfield
Journal:  J Comput Biol       Date:  2001       Impact factor: 1.479

9.  Sequence variation in the human angiotensin converting enzyme.

Authors:  M J Rieder; S L Taylor; A G Clark; D A Nickerson
Journal:  Nat Genet       Date:  1999-05       Impact factor: 38.330

10.  Genotype imputation with thousands of genomes.

Authors:  Bryan Howie; Jonathan Marchini; Matthew Stephens
Journal:  G3 (Bethesda)       Date:  2011-11-01       Impact factor: 3.154

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.