| Literature DB >> 21624166 |
Abstract
BACKGROUND: We have previously demonstrated an approach for efficient computation of genotype probabilities, and more generally probabilities of allele inheritance in inbred as well as outbred populations. That work also included an extension for haplotype inference, or phasing, using Hidden Markov Models. Computational phasing of multi-thousand marker datasets has not become common as of yet. In this communication, we further investigate the method presented earlier for such problems, in a multi-generational dataset simulated for QTL detection.Entities:
Year: 2011 PMID: 21624166 PMCID: PMC3103195 DOI: 10.1186/1753-6561-5-S3-S10
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Figure 1Convergence. Left axis (solid line) illustrating scaled sum of total logarithm probabilities over all analysis, up to iteration 25. In addition, block symbols (right vertical axis) show the number of downstream inversion events, most of which were occurring in the founder generation. After iteration 8, only minor adjustments were determined by the algorithm.
Figure 2Use of inversion. During optimisation, a scan is performed in each analysis for inverting the skewness values determining phase downstream of each marker. Based on this scan, knots or bubbles of incorrect phase can be identified and resolved. The two lines are illustrating iteration 2 and 3, respectively, for chromosome 1 in the founder individual with ID 3. In iteration 2, a large range of markers has converged towards an inverted phase versus the correct one. This region is identified and inverted for iteration 3.
Accuracy per generation on the level of markers and individuals
| Generation | Perfectly phased individuals | Correctly phased markers |
|---|---|---|
| 1 | 100.0% | 100.0% |
| 2 | 93.78% | 99.60% |
| 3 | 97.69% | 99.98% |
| 4 | 98.67% | 99.99% |
| 5 | 98.33% | 99.97% |
| All | 97.68% | 99.93% |
Accuracy in produced phasing per generation. A perfectly phased individual has no heterozygotic marker with incorrect phase over the 5 chromosomes. Only a very limited amount of markers were incorrectly phased. The total proportion of individuals affected by errors was below 10% in all generations.