Literature DB >> 16108713

Computing the minimum recombinant haplotype configuration from incomplete genotype data on a pedigree by integer linear programming.

Jing Li1, Tao Jiang.   

Abstract

We study the problem of reconstructing haplotype configurations from genotypes on pedigree data with missing alleles under the Mendelian law of inheritance and the minimum-recombination principle, which is important for the construction of haplotype maps and genetic linkage/association analyses. Our previous results show that the problem of finding a minimum-recombinant haplotype configuration (MRHC) is in general NP-hard. This paper presents an effective integer linear programming (ILP) formulation of the MRHC problem with missing data and a branch-and-bound strategy that utilizes a partial order relationship and some other special relationships among variables to decide the branching order. Nontrivial lower and upper bounds on the optimal number of recombinants are introduced at each branching node to effectively prune the search tree. When multiple solutions exist, a best haplotype configuration is selected based on a maximum likelihood approach. The paper also shows for the first time how to incorporate marker interval distance into a rule-based haplotyping algorithm. Our results on simulated data show that the algorithm could recover haplotypes with 50 loci from a pedigree of size 29 in seconds on a Pentium IV computer. Its accuracy is more than 99.8% for data with no missing alleles and 98.3% for data with 20% missing alleles in terms of correctly recovered phase information at each marker locus. A comparison with a statistical approach SimWalk2 on simulated data shows that the ILP algorithm runs much faster than SimWalk2 and reports better or comparable haplotypes on average than the first and second runs of SimWalk2. As an application of the algorithm to real data, we present some test results on reconstructing haplotypes from a genome-scale SNP dataset consisting of 12 pedigrees that have 0.8% to 14.5% missing alleles.

Mesh:

Year:  2005        PMID: 16108713     DOI: 10.1089/cmb.2005.12.719

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  30 in total

1.  Genotype-dependent responses to levels of sibling competition over maternal resources in mice.

Authors:  R Hager; J M Cheverud; J B Wolf
Journal:  Heredity (Edinb)       Date:  2011-11-30       Impact factor: 3.821

2.  Obesity-insulin targeted genes in the 3p26-25 region in human studies and LG/J and SM/J mice.

Authors:  Aldi T Kraja; Heather A Lawson; Donna K Arnett; Ingrid B Borecki; Ulrich Broeckel; Lisa de las Fuentes; Steven C Hunt; Michael A Province; James Cheverud; D C Rao
Journal:  Metabolism       Date:  2012-03-03       Impact factor: 8.694

3.  Polymorphisms of estrogen-biosynthesis genes CYP17 and CYP19 may influence age at menarche: a genetic association study in Caucasian females.

Authors:  Yan Guo; Dong-Hai Xiong; Tie-Lin Yang; Yan-Fang Guo; Robert R Recker; Hong-Wen Deng
Journal:  Hum Mol Genet       Date:  2006-06-16       Impact factor: 6.150

4.  Multipoint linkage analysis with many multiallelic or dense diallelic markers: Markov chain-Monte Carlo provides practical approaches for genome scans on general pedigrees.

Authors:  Ellen M Wijsman; Joseph H Rothstein; Elizabeth A Thompson
Journal:  Am J Hum Genet       Date:  2006-09-20       Impact factor: 11.025

5.  Pronounced inter- and intrachromosomal variation in linkage disequilibrium across the zebra finch genome.

Authors:  Jessica Stapley; Tim R Birkhead; Terry Burke; Jon Slate
Journal:  Genome Res       Date:  2010-03-31       Impact factor: 9.043

6.  Efficient haplotype inference from pedigrees with missing data using linear systems with disjoint-set data structures.

Authors:  Xin Li; Jing Li
Journal:  Comput Syst Bioinformatics Conf       Date:  2008

Review 7.  Haplotyping methods for pedigrees.

Authors:  Guimin Gao; David B Allison; Ina Hoeschele
Journal:  Hum Hered       Date:  2009-01-27       Impact factor: 0.444

8.  Detecting genome-wide haplotype polymorphism by combined use of Mendelian constraints and local population structure.

Authors:  Xin Li; Yixuan Chen; Jing Li
Journal:  Pac Symp Biocomput       Date:  2010

9.  Efficient genome ancestry inference in complex pedigrees with inbreeding.

Authors:  Eric Yi Liu; Qi Zhang; Leonard McMillan; Fernando Pardo-Manuel de Villena; Wei Wang
Journal:  Bioinformatics       Date:  2010-06-15       Impact factor: 6.937

10.  Linked region detection using high-density SNP genotype data via the minimum recombinant model of pedigree haplotype inference.

Authors:  Lusheng Wang; Zhanyong Wang; Wanling Yang
Journal:  BMC Bioinformatics       Date:  2009-07-15       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.