| Literature DB >> 16451612 |
Pamela A McCaskie1, Kim W Carter, Simon R McCaskie, Lyle J Palmer.
Abstract
We used our newly developed linkage disequilibrium (LD) plotting software, JLIN, to plot linkage disequilibrium between pairs of single-nucleotide polymorphisms (SNPs) for three chromosomes of the Genetic Analysis Workshop 14 Aipotu simulated population to assess the effect of missing data on LD calculations. Our haplotype analysis program, SIMHAP, was used to assess the effect of missing data on haplotype-phenotype association. Genotype data was removed at random, at levels of 1%, 5%, and 10%, and the LD calculations and haplotype association results for these levels of missingness were compared to those for the complete dataset. It was concluded that ignoring individuals with missing data substantially affects the number of regions of LD detected which, in turn, could affect tagging SNPs chosen to generate haplotypes.Entities:
Mesh:
Year: 2005 PMID: 16451612 PMCID: PMC1866783 DOI: 10.1186/1471-2156-6-S1-S151
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Figure 1Plots of the LD coefficient D' for region 2 in the Aipotu population. A, Full dataset with no missing values; B, 1% missing genotypic data; C, 5% missing genotypic data; D, 10% missing genotypic data.
Figure 2Varying proportions of strong D' with increasing degrees of missing data. Histograms of the proportion of pair-wise comparisons exhibiting a D' value greater than 0.75 for varying degrees of missing data.
Haplotype results for varying degrees of missing data
| OR (95% CI) | SE (logOR) (95% CI) | ||
| H12121 | |||
| Complete data | 0.52 (0.49, 0.57) | -0.64 (-0.71, -0.57) | 0.011 (0.005, 0.022) |
| 1% Missing data | 0.57 (0.53, 0.61) | -0.56 (-0.64, -0.49) | 0.027 (0.012, 0.052) |
| 5% Missing data | 0.58 (0.53, 0.63) | -0.54 (-0.63, -0.46) | 0.034 (0.013, 0.070) |
| 10% Missing data | 0.52 (0.46, 0.58) | -0.66 (-0.77, -0.54) | 0.010 (0.002, 0.029) |
| H12221 | |||
| Complete data | 0.62 (0.58, 0.65) | -0.48 (-0.54, -0.43) | 0.021 (0.010, 0.037) |
| 1% Missing data | 0.61 (0.57, 0.64) | -0.50 (-0.57, -0.44) | 0.018 (0.007, 0.034) |
| 5% Missing data | 0.59 (0.54, 0.64) | -0.52 (-0.61, -0.44) | 0.014 (0.004, 0.034) |
| 10% Missing data | 0.59 (0.53, 0.65) | -0.53 (-0.63, -0.43) | 0.014 (0.003, 0.039) |