Literature DB >> 10977079

A practical algorithm for optimal inference of haplotypes from diploid populations.

D Gusfield1.   

Abstract

The next phase of human genomics will involve large-scale screens of populations for significant DNA polymorphisms, notably single nucleotide polymorphisms (SNP's). Dense human SNP maps are currently under construction. However, the utility of those maps and screens will be limited by the fact that humans are diploid, and that it is presently difficult to get separate data on the two "copies". Hence genotype (blended) SNP data will be collected, and the desired haplotype (partitioned) data must then be (partially) inferred. A particular non-deterministic inference algorithm was proposed and studied before SNP data was available, and extensively applied more recently to study the first available SNP data. In this paper, we consider the question of whether we can obtain an efficient, deterministic variant of that method to optimize the obtained inferences. Although we have shown elsewhere that the optimization problem is NP-hard, we present here a practical approach based on (integer) linear programming. The method either returns the optimal answer, and a declaration that it is the optimal, or declares that it has failed to find the optimal. The approach works quickly and correctly, finding the optimal on all simulated data tested, data that is expected to be more demanding than realistic biological data.

Entities:  

Mesh:

Substances:

Year:  2000        PMID: 10977079

Source DB:  PubMed          Journal:  Proc Int Conf Intell Syst Mol Biol        ISSN: 1553-0833


  4 in total

1.  A comparison of phasing algorithms for trios and unrelated individuals.

Authors:  Jonathan Marchini; David Cutler; Nick Patterson; Matthew Stephens; Eleazar Eskin; Eran Halperin; Shin Lin; Zhaohui S Qin; Heather M Munro; Goncalo R Abecasis; Peter Donnelly
Journal:  Am J Hum Genet       Date:  2006-01-26       Impact factor: 11.025

2.  How frugal is Mother Nature with haplotypes?

Authors:  Sharlee Climer; Gerold Jäger; Alan R Templeton; Weixiong Zhang
Journal:  Bioinformatics       Date:  2008-11-04       Impact factor: 6.937

3.  Heterozygous genome assembly via binary classification of homologous sequence.

Authors:  Paul M Bodily; M Fujimoto; Cameron Ortega; Nozomu Okuda; Jared C Price; Mark J Clement; Quinn Snell
Journal:  BMC Bioinformatics       Date:  2015-04-23       Impact factor: 3.169

4.  ISHAPE: new rapid and accurate software for haplotyping.

Authors:  Olivier Delaneau; Cédric Coulonges; Pierre-Yves Boelle; George Nelson; Jean-Louis Spadoni; Jean-François Zagury
Journal:  BMC Bioinformatics       Date:  2007-06-15       Impact factor: 3.169

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.