Literature DB >> 16379533

An approximation algorithm for haplotype inference by maximum parsimony.

Yao-Ting Huang1, Kun-Mao Chao, Ting Chen.   

Abstract

This paper studies haplotype inference by maximum parsimony using population data. We define the optimal haplotype inference (OHI) problem as given a set of genotypes and a set of related haplotypes, find a minimum subset of haplotypes that can resolve all the genotypes. We prove that OHI is NP-hard and can be formulated as an integer quadratic programming (IQP) problem. To solve the IQP problem, we propose an iterative semidefinite programming-based approximation algorithm, (called SDPHapInfer). We show that this algorithm finds a solution within a factor of O(log n) of the optimal solution, where n is the number of genotypes. This algorithm has been implemented and tested on a variety of simulated and biological data. In comparison with three other methods, (1) HAPAR, which was implemented based on the branching and bound algorithm, (2) HAPLOTYPER, which was implemented based on the expectation-maximization algorithm, and (3) PHASE, which combined the Gibbs sampling algorithm with an approximate coalescent prior, the experimental results indicate that SDPHapInfer and HAPLOTYPER have similar error rates. In addition, the results generated by PHASE have lower error rates on some data but higher error rates on others. The error rates of HAPAR are higher than the others on biological data. In terms of efficiency, SDPHapInfer, HAPLOTYPER, and PHASE output a solution in a stable and consistent way, and they run much faster than HAPAR when the number of genotypes becomes large.

Mesh:

Substances:

Year:  2005        PMID: 16379533     DOI: 10.1089/cmb.2005.12.1261

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  4 in total

1.  Haplotype reconstruction using perfect phylogeny and sequence data.

Authors:  Anatoly Efros; Eran Halperin
Journal:  BMC Bioinformatics       Date:  2012-04-19       Impact factor: 3.169

2.  How frugal is Mother Nature with haplotypes?

Authors:  Sharlee Climer; Gerold Jäger; Alan R Templeton; Weixiong Zhang
Journal:  Bioinformatics       Date:  2008-11-04       Impact factor: 6.937

3.  Inference of chromosome-specific copy numbers using population haplotypes.

Authors:  Yao-Ting Huang; Min-Han Wu
Journal:  BMC Bioinformatics       Date:  2011-05-24       Impact factor: 3.169

4.  Disentangling homeologous contigs in allo-tetraploid assembly: application to durum wheat.

Authors:  Vincent Ranwez; Yan Holtz; Gautier Sarah; Morgane Ardisson; Sylvain Santoni; Sylvain Glémin; Muriel Tavaud-Pirra; Jacques David
Journal:  BMC Bioinformatics       Date:  2013-10-15       Impact factor: 3.169

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.