Literature DB >> 17895275

HAPLOPOOL: improving haplotype frequency estimation through DNA pools and phylogenetic modeling.

Bonnie Kirkpatrick1, Carlos Santos Armendariz, Richard M Karp, Eran Halperin.   

Abstract

MOTIVATION: The search for genetic variants that are linked to complex diseases such as cancer, Parkinson's;, or Alzheimer's; disease, may lead to better treatments. Since haplotypes can serve as proxies for hidden variants, one method of finding the linked variants is to look for case-control associations between the haplotypes and disease. Finding these associations requires a high-quality estimation of the haplotype frequencies in the population. To this end, we present, HaploPool, a method of estimating haplotype frequencies from blocks of consecutive SNPs.
RESULTS: HaploPool leverages the efficiency of DNA pools and estimates the population haplotype frequencies from pools of disjoint sets, each containing two or three unrelated individuals. We study the trade-off between pooling efficiency and accuracy of haplotype frequency estimates. For a fixed genotyping budget, HaploPool performs favorably on pools of two individuals as compared with a state-of-the-art non-pooled phasing method, PHASE. Of independent interest, HaploPool can be used to phase non-pooled genotype data with an accuracy approaching that of PHASE. We compared our algorithm to three programs that estimate haplotype frequencies from pooled data. HaploPool is an order of magnitude more efficient (at least six times faster), and considerably more accurate than previous methods. In contrast to previous methods, HaploPool performs well with missing data, genotyping errors and long haplotype blocks (of between 5 and 25 SNPs).

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17895275     DOI: 10.1093/bioinformatics/btm435

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  9 in total

1.  Multimarker analysis and imputation of multiple platform pooling-based genome-wide association studies.

Authors:  Nils Homer; Waibhav D Tembe; Szabolcs Szelinger; Margot Redman; Dietrich A Stephan; John V Pearson; Stanley F Nelson; David Craig
Journal:  Bioinformatics       Date:  2008-07-10       Impact factor: 6.937

2.  Rapid inexpensive genome-wide association using pooled whole blood.

Authors:  Jamie E Craig; Alex W Hewitt; Amy E McMellon; Anjali K Henders; Lingjun Ma; Leanne Wallace; Shiwani Sharma; Kathryn P Burdon; Peter M Visscher; Grant W Montgomery; Stuart MacGregor
Journal:  Genome Res       Date:  2009-10-03       Impact factor: 9.043

3.  Maximum-parsimony haplotype frequencies inference based on a joint constrained sparse representation of pooled DNA.

Authors:  Guido H Jajamovich; Alexandros Iliadis; Dimitris Anastassiou; Xiaodong Wang
Journal:  BMC Bioinformatics       Date:  2013-09-08       Impact factor: 3.169

4.  A sequential Monte Carlo framework for haplotype inference in CNV/SNP genotype data.

Authors:  Alexandros Iliadis; Dimitris Anastassiou; Xiaodong Wang
Journal:  EURASIP J Bioinform Syst Biol       Date:  2014-04-24

5.  An efficient pipeline to generate data for studies in plastid population genomics and phylogeography.

Authors:  Brendan F Kohrn; Jessica M Persinger; Mitchell B Cruzan
Journal:  Appl Plant Sci       Date:  2017-11-14       Impact factor: 1.936

6.  Fast and accurate haplotype frequency estimation for large haplotype vectors from pooled DNA data.

Authors:  Alexandros Iliadis; Dimitris Anastassiou; Xiaodong Wang
Journal:  BMC Genet       Date:  2012-10-30       Impact factor: 2.797

7.  Cost-effective genome-wide estimation of allele frequencies from pooled DNA in Atlantic salmon (Salmo salar L.).

Authors:  Mikhail Ozerov; Anti Vasemägi; Vidar Wennevik; Eero Niemelä; Sergey Prusov; Matthew Kent; Juha-Pekka Vähä
Journal:  BMC Genomics       Date:  2013-01-16       Impact factor: 3.969

8.  Maximum likelihood estimation of frequencies of known haplotypes from pooled sequence data.

Authors:  Darren Kessner; Thomas L Turner; John Novembre
Journal:  Mol Biol Evol       Date:  2013-01-30       Impact factor: 16.240

9.  An EM algorithm based on an internal list for estimating haplotype distributions of rare variants from pooled genotype data.

Authors:  Anthony Y C Kuk; Xiang Li; Jinfeng Xu
Journal:  BMC Genet       Date:  2013-09-13       Impact factor: 2.797

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.