| Literature DB >> 19278546 |
Jules Hernández-Sánchez1, Sara Knott.
Abstract
BACKGROUND: Haplotypes can increase the power of gene detection over genotypes and are essential to estimate linkage disequilibrium.Entities:
Year: 2009 PMID: 19278546 PMCID: PMC2654501 DOI: 10.1186/1753-6561-3-s1-s7
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Mating system and family size from a paternal perspective.
| 1 | 252 | 257 | 2 | 3 | 8 |
| 2 | 24 | 48 | 3 | 2 | 8 |
| 3 | 4 | 14 | 4 | 3 | 22 |
| 4 | 4 | 19 | 5 | 3 | 24 |
| 5 | 1 | 7 | 6 | 3 | 28 |
| 9 | 1 | 10 | 8 | 2 | 28 |
| 11 | 5 | 61 | 10 | 5 | 75 |
| 12 | 1 | 12 | 11 | 2 | 32 |
| 13 | 2 | 26 | 12 | 4 | 72 |
| 14 | 1 | 14 | 13 | 5 | 99 |
| 16 | 2 | 42 | 14 | 3 | 71 |
| 17 | 3 | 60 | 15 | 2 | 46 |
| 18 | 1 | 19 | 16 | 1 | 25 |
| 21 | 1 | 22 | 17 | 2 | 42 |
| 26 | 2 | 65 | 18 | 2 | 60 |
| 31 | 1 | 35 | 20 | 1 | 29 |
| 21 | 1 | 33 | |||
| 23 | 1 | 37 | |||
| 8 | 8 | 156 | 24 | 1 | 42 |
| 9 | 13 | 298 | 25 | 1 | 39 |
| 10 | 69 | 5246 | 27 | 2 | 93 |
F denotes number of females, M denotes number of males mated to F different females, and O denotes the total number of offspring in those families.
Mating system and family size from a maternal perspective.
| 1 | 184 | 190 | 1 | 236 | 359 |
| 2 | 51 | 110 | 2 | 94 | 298 |
| 3 | 22 | 70 | 3 | 41 | 199 |
| 4 | 11 | 50 | 4 | 9 | 57 |
| 5 | 2 | 10 | |||
| 6 | 3 | 18 | |||
| 7 | 1 | 9 | 1 | 871 | 5700 |
| 8 | 4 | 40 | |||
| 9 | 6 | 64 | |||
| 10 | 6 | 62 | |||
| 11 | 3 | 39 | |||
| 14 | 1 | 15 | |||
| 15 | 2 | 34 | |||
F denotes number of females, M denotes number of males mated to F different females, and O denotes the total number of offspring in those families.
Figure 1Pedigree structures. This figure shows the number of individuals (y-axes) against number of ancestors (x-axis), e.g. there are 359 sheep, 272 pigs and 239 individuals in the QTLMAS data with no ancestors (they are founders). The left y-axis is for sheep (clear bars) and pigs (grey bars), and the right y-axis for QTLMAS data (dark bars with top markers).
Measuring haplotype reconstruction uncertainty via simulations.
| Sheep | 1070 | 12 | 0 | 90 | 7.6 | 34 | 51 | 82.5 | 99.7 | 0.001 |
| Pigs | 1186 | 33 | 28 | 15 | 0.5 | 27.5 | 67 | 78.1 | 99.9 | 0.0003 |
| Sim. | 5939 | 1000 | 1000 | 100 | 0.1 | 98.75 | 98.75 | 94.1 | 99.2 | 0.0002 |
There were 100 simulations per data set. N is the pedigree size. M is the number of markers. SNP is the number of SNP among all markers. Length is the totalcM of the region considered. Av. is the average inter-locus distance incM. G is the percentage of all individuals in pedigree with all loci fully genotyped. 1/2 G is the percentage of all individuals in pedigree with at least 1/2 of loci fully genotyped. Phased is the percentage of phased alleles (or loci) from full genotypes. Correct is the percentage of correctly phased alleles from all phased. CV is the coefficient of variation of correct phases across simulations.
Comparison between fully or partially implementing MRHC.
| Sheep | Pigs | Simulated | ||||
| 4 | 6 | 4 | 6 | 4 | 6 | |
| 82.5 | 85.1 | 78.1 | 82.3 | 93.5 | 95.3 | |
| 99.7 | 99.3 | 99.9 | 99.6 | 98.8 | 98.6 | |
| 1.9 | 5 | 5.6 | 25.2 | 3460 | 16339 | |
The original MRHC algorithm contains 6 rules. The first four can be implemented on their own to phase most alleles. Phased was the percentage of phased alleles originally present in full genotypes. Correct is the percentage of correctly phased alleles from all phased. Time measures average computing requirement in seconds. One hundred simulations were used to assess performance in pigs and sheep, and true and predicted haplotypes were compared in the simulated data.