| Literature DB >> 19761594 |
Francois Besnier1, Orjan Carlborg.
Abstract
BACKGROUND: The linkage phase, or haplotype, is an extra level of information that in addition to genotype and pedigree can be useful for reconstructing the inheritance pattern of the alleles in a pedigree, and computing for example Identity By Descent probabilities. If a haplotype is provided, the precision of estimated IBD probabilities increases, as long as the haplotype is estimated without errors. It is therefore important to only use haplotypes that are strongly supported by the available data for IBD estimation, to avoid introducing new errors due to erroneous linkage phases.Entities:
Mesh:
Year: 2009 PMID: 19761594 PMCID: PMC2754495 DOI: 10.1186/1471-2156-10-57
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Figure 1Implementation of the genetic algorithm. a) In a full sib family, the haplotype of individual I is to be estimated. Since no parent is available, the first heterozygous marker is phased arbitrarily allele "1" being of paternal origin and allele "2" being of maternal origin. The three offspring have been phased by a deterministic approach. b) Step1: Five GA individuals are randomly sampled (a, b, c, d, e). Each sample is a possible haplotype of the non-phased markers of individual I. c) Step2: Fitness of the GA individuals is computed as a likelihood function of the number of recombinations observed in the offspring chromosomes. d) Step3: The GA individuals with higher fitness (c, d, e) are selected and crossed in a simulated diploid reproduction mechanism that generates the sample of GA individuals for the next generation.
Figure 2Computational strategy for deep pedigree. Successive steps used to estimate haplotypes in any n-generational complex pedigree.
Figure 3Description of the algorithm used for estimation of haplotypes in a three generations pedigree. Haplotypes are indicated by boxes; grey box contain maternally inherited alleles when marker phase is known, whereas two white boxes contain haplotype segments with unknown phase origin. -a haplotype estimated after the first deterministic (step 1). -b haplotype estimated in generation 1 from the offspring (step 3). -c haplotype estimated in generation 2 from their parents (step 4). -d haplotype estimated in generation 3 from their parents (step 4).
Figure 4Accuracy of the haplotyping algorithm as function of the stringency criterion. a) Percentage of correct heterozygous marker phases in the estimated haplotypes. b) Percentage of haplotype error (plain dark line) and percentage of non estimated marker phases (dashed line).
haplotyping accuracy for the different compared methods
| AIL pedigree | |||
| GA- 0.95 | 95.10 | 0.20 | 4.80 |
| GA- 0.5 | 97.10 | 2.60 | 0.15 |
| Hapsim 1 | 96.70 | 3.30 | 0.00 |
| Merlin 2 | 83.00 | 1.80 | 14.20 |
| QTL-MAS pedigree | |||
| GA- 0.95 | 87.00 | 0.05 | 13.00 |
| GA- 0.5 | 90.00 | 1.00 | 9.00 |
| Hapsim 1 | 97.00 | 3.00 | 0.00 |
| Merlin 2 | 95.00 | 1.30 | 3.70 |
We compare three methods: our GA based method with high stringency (0.95), and low stringency (0.50), a deterministic method, and a likelihood based descent tree (Merlin), The results are given in percent of the non obvious heterozygous markers (non homozygous parents).
1Hernández-Sánchez and Knott (2009) [7]
2Abecasis et al (2002)[12]