Literature DB >> 25304780

Accurate estimation of haplotype frequency from pooled sequencing data and cost-effective identification of rare haplotype carriers by overlapping pool sequencing.

Chang-Chang Cao1, Xiao Sun1.   

Abstract

MOTIVATION: A variety of hypotheses have been proposed for finding the missing heritability of complex diseases in genome-wide association studies. Studies have focused on the value of haplotype to improve the power of detecting associations with disease. To facilitate haplotype-based association analysis, it is necessary to accurately estimate haplotype frequencies of pooled samples.
RESULTS: Taking advantage of databases that contain prior haplotypes, we present Ehapp based on the algorithm for solving the system of linear equations to estimate the frequencies of haplotypes from pooled sequencing data. Effects of various factors in sequencing on the performance are evaluated using simulated data. Our method could estimate the frequencies of haplotypes with only about 3% average relative difference for pooled sequencing of the mixture of 10 haplotypes with total coverage of 50×. When unknown haplotypes exist, our method maintains excellent performance for haplotypes with actual frequencies >0.05. Comparisons with present method on simulated data in conjunction with publicly available Illumina sequencing data indicate that our method is state of the art for many sequencing study designs. We also demonstrate the feasibility of applying overlapping pool sequencing to identify rare haplotype carriers cost-effectively.
AVAILABILITY AND IMPLEMENTATION: Ehapp (in Perl) for the Linux platforms is available online (http://bioinfo.seu.edu.cn/Ehapp/). CONTACT: xsun@seu.edu.cn SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Mesh:

Year:  2014        PMID: 25304780     DOI: 10.1093/bioinformatics/btu670

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  3 in total

1.  Accurate Allele Frequencies from Ultra-low Coverage Pool-Seq Samples in Evolve-and-Resequence Experiments.

Authors:  Susanne Tilk; Alan Bergland; Aaron Goodman; Paul Schmidt; Dmitri Petrov; Sharon Greenblum
Journal:  G3 (Bethesda)       Date:  2019-12-03       Impact factor: 3.154

2.  A joint use of pooling and imputation for genotyping SNPs.

Authors:  Camille Clouard; Kristiina Ausmees; Carl Nettelblad
Journal:  BMC Bioinformatics       Date:  2022-10-13       Impact factor: 3.307

3.  Detecting selected haplotype blocks in evolve and resequence experiments.

Authors:  Kathrin A Otte; Christian Schlötterer
Journal:  Mol Ecol Resour       Date:  2020-09-06       Impact factor: 7.090

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.