| Literature DB >> 18466512 |
Weihua Zhang1,2, Winston Lau3, Cheng Hu4, Tai-Yue Kuo3.
Abstract
We studied the impact of marker density on the accuracy of association mapping using Genetic Analysis Workshop 15 simulated dense single-nucleotide polymorphism (SNP) data on chromosome 6. A total of 1500 cases and 2000 unaffected controls genotyped for 17,820 SNPs were analyzed. We applied the approach that combines information from multiple SNPs under the framework of the Malecot model and composite likelihood to non-overlapping regions of the chromosome. We successfully detected the associations with disease Loci C and D and predicted their locations as small as zero distance to Locus C when it was "typed" and 112 kb from the untyped rare Locus D. Reducing marker density decreased the accuracy of location estimates. However, the predicted locations were robust to variations in the number of SNPs. Generally, the linkage disequilibrium (LD) map reflecting distances between markers in relation to LD produced higher accuracy than the physical map. We also demonstrated that SNP selection based on equal LD distance outperforms that based on equal physical distance or SNP tagging. Furthermore, ignoring rare SNPs diminished the ability to detect rare causal variants.Entities:
Year: 2007 PMID: 18466512 PMCID: PMC2367596 DOI: 10.1186/1753-6561-1-s1-s166
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Association mapping of disease Loci C and D on chromosome 6
| Regiona | No. SNPs | ΛA - ΛD | χ23 | |||
| 26 | 128 | 24882 | 68 | 14 | 0.002471 | 0.311346 |
| 27 | 239 | 26121 | 793 | 40 | <10-7 | 0.000001 |
| 28 | 348 | 31299 | 5176 | 128 | <10-12 | <0.000001 |
| 29 | 176 | 32540 | 33058 | 322 | <10-12 | <0.000001 |
| 30 | 153 | 33962 | 295 | 25 | 0.000017 | 0.002129 |
| 31 | 134 | 35638 | 103 | 15 | 0.001929 | 0.243096 |
| 32 | 127 | 37776 | 141 | 27 | 0.000007 | 0.000926 |
| 33 | 147 | 39432 | 50 | 13 | 0.005554 | 0.699754 |
aA segment of consecutive regions of 10 LDUs showing nominal significant association with RA (p < 0.05). See Methods for the meaning of other symbols. Loci C and D were in regions 29 and 32 at locations of 32485 and 37234 kb, respectively.
bPc is Bonferroni corrected p-value for multiple tests of 126 regions (p × 126).
Candidate regions of disease Loci C and D with rare SNPs included
| Location error with rare SNPs | |||||||
| Locus | Map | ΛA - ΛD | χ23 | Included | Removed | ||
| C | LD | 32557 | 30693 | 360 | <10-12 | 72 | 55 |
| LD, tilt | 32518 | 30799 | 197 | <10-12 | 34 | 21 | |
| kb | 32506 | 28632 | 496 | <10-12 | 22 | 14 | |
| D | LD | 37358 | 154 | 22 | 0.000017 | 124 | 542 |
| LD, tilt | 37368 | 156 | 28 | 0.000003 | 130 | 546 | |
| kb | 37368 | 148 | 17 | 0.000666 | 130 | 954 | |
Density and accuracy for Locus C – SNP selection by order
| msSNP | Location error by the composite likelihood approach | |||||||
| Causal SNP out | Causal SNP in | |||||||
| SNP density (kb/SNP) | No. SNPs | No. regions | χ21 | Location error | LD | kb | LD | kb |
| Full (11) | 15701 | 126 | 2324 | 153 | 57 | 13 | 55 | 14 |
| 1/2 (22) | 7850 | 125 | 1762 | -2 | 5 | -19 | 5 | -20 |
| 1/3 (33) | 5233 | 118 | 2324 | 153 | 153 | 40 | 6 | 40 |
| 1/4 (44) | 3925 | 106 | 1762 | -2 | -65 | -56 | -57 | -53 |
| 1/5 (54) | 3140 | 94 | 2274 | 42 | -24 | -35 | -15 | -36 |
| 1/6 (65) | 2616 | 82 | 1285 | 20 | 20 | 20 | 10 | 15 |
| 1/8 (87) | 1962 | 64 | 1601 | 65 | -58 | -64 | -47 | -58 |
| 1/10 (109) | 1570 | 52 | 726 | -106 | -55 | -59 | -24 | -46 |
| 1/15 (163) | 1046 | 34 | 486 | -887 | 294 | 26 | 0 | 3 |
| 1/20 (217) | 785 | 26 | 726 | -106 | -97 | -160 | -26 | -43 |
| 1/25 (272) | 628 | 20 | 348 | -9 | 69 | -120 | 60 | -79 |
| 1/30 (326) | 523 | 17 | 229 | 188 | 362 | -25 | 0 | -25 |
Disease variant C (χ21 = 1916) was not present except in the full data set or specified.
Density and accuracy – SNP selection by tagging or equidistancea
| Locus C | Locus D | ||||||
| No. SNPs (kb/SNP) | Tagger | E_LD | E_kb | Tagger | E_LD | E_kb | |
| Full | 1658 (10) | 20 | 20 | 20 | 130 | 130 | 130 |
| 0.8 (0.013,7) | 1080b (15) | 35 | 20 | 30 | 130 | 124 | 130 |
| 0.6 (0.025,10) | 874 (18) | 56 | -16 | 27 | 545 | 124 | 551 |
| 0.4 (0.047,15) | 657 (24) | 106 | 42 | 63 | 124 | 123 | 117 |
| 0.2 (0.099,27) | 421 (38) | 125 | 14 | -23 | 537 | 112 | 231 |
aLocation error for SNPs selected by Tagger or equal LDU (E_LD) or kb (E_kb) distance in a candidate region of 15805.710 kb with rare SNPs included. Regions were fixed at 10 LDUs for Loci C (30997–33398 kb) and D (36784–37792 kb). Disease variant C was not present except in the full data set. Tilted LD map.
b1079 for E_LD.