| Literature DB >> 16451564 |
Hae-Won Uh1, Jeanine J Houwing-Duistermaat, Hein Putter, Hans C van Houwelingen.
Abstract
Assigning haplotypes in a case-control study is a challenging problem. We proposed a method to quantify the information loss due to missing phase information. We determined which individuals were responsible for the information loss, and calculated how much information could be gained when the ambiguous individuals could be resolved by adding additional parental information.Entities:
Mesh:
Year: 2005 PMID: 16451564 PMCID: PMC1866831 DOI: 10.1186/1471-2156-6-S1-S108
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Information loss per haplotype based on the diagonal of information matrix in 100 cases, and R2 measure.
| Haplotype | cases | ||||
| nr | SNP-s | Loss | maxa | %b | |
| 1 | 111 | 9.51 | 37.18 | 25.57 | 0.7443 |
| 2 | 112 | 5.74 | 18.64 | 30.79 | 0.6921 |
| 3 | 121 | 8.11 | 43.28 | 18.74 | 0.8126 |
| 4 | 122 | 4.34 | 9.05 | 47.93 | 0.5206 |
| 5 | 211 | 4.49 | 10.11 | 44.39 | 0.5560 |
| 6 | 212 | 2.69 | 5.02 | 53.58 | 0.4641 |
| 7 | 221 | 5.35 | 16.07 | 33.32 | 0.6668 |
| 8 | 222 | 3.54 | 20.77 | 17.06 | 0.8293 |
a max, the maximum information that is contained in a haplotype
b%, relative information loss compared to the maximum information.
Loss of information in 100 cases per individual and per haplotype: '1' and '2' represent homozygotes 1/1 and 2/2, 'H' heterozygote1/2.
| Group | Genotype | No. | 111 | 112 | 121 | 122 | 211 | 212 | 221 | 222 | Tot. Lossa |
| 1 | 11 | 0.241 | 0.152 | 0.139 | 0.049 | 0.049 | 0.139 | 0.152 | 0.241 | 1.163 | |
| 2 | 4 | 0.249 | 0.249 | 0.249 | 0.249 | 0.996 | |||||
| 3 | 12 | 0.246 | 0.246 | 0.246 | 0.246 | 0.984 | |||||
| 4 | 15 | 0.194 | 0.194 | 0.194 | 0.194 | 0.774 | |||||
| 5 | 8 | 0.091 | 0.091 | 0.091 | 0.091 | 0.363 | |||||
| 6 | 2 | 0.083 | 0.083 | 0.083 | 0.083 | 0.331 | |||||
| 7 | 0 | 0.195 | 0.195 | 0.195 | 0.195 | 0.780 | |||||
| OK | 48 | ||||||||||
| Total | 100 | 9.506 | 5.739 | 8.113 | 4.337 | 4.490 | 2.690 | 5.354 | 3.545 | 43.77 |
aTot. loss, _________________________.
Figure 1Forward stepwise selection of the informative individuals based on D-optimality in 100 cases by maximizing |. (1) The group identification denotes the genotypes at SNP: '1' and '2' represent homozygotes 1/1 and 2/2, 'H' a heterozygote 1/2. (2) The y-label is ordered by A-optimality (the highest 'HHH' group for the first selection, the 'H1H', 'HH1', etc), the red points by D-optimality. So the first individuals to be selected are 'H1H' group, not 'HHH', and hence it shows discrepancy using two different measures. The jumps between groups indicate the correlation between parameters.