| Literature DB >> 18466490 |
Abstract
Construction of precise confidence sets of disease gene locations after initial identification of linked regions can improve the efficiency of the ensuing fine mapping effort. We took the confidence set inference, a framework proposed and implemented using the Mean test statistic (CSI-Mean) and improved the efficiency substantially by using a likelihood ratio test statistic (CSI-MLS). The CSI framework requires knowledge of some disease-model-related parameters. In the absence of prior knowledge of these parameters, a two-step procedure may be employed: 1) the parameters are estimated using a coarse map of markers; 2) CSI-Mean or CSI-MLS are applied to construct the confidence sets of the disease gene locations using a finer map of markers, assuming the estimates from Step 1 for the required parameters. In this article we show that the advantages of CSI-MLS over CSI-Mean, previously demonstrated when the required parameters are known, are preserved in this two-step procedure, using both the simulated and real data contributed to Problems 2 and 3 of Genetic Analysis Workshop 15. In addition, our result suggests that microsatellite data, when available, should be used in Step 1. Also explored in detail is the effect of the absence of parental genotypes on the performance of CSI-MLS.Entities:
Year: 2007 PMID: 18466490 PMCID: PMC2367514 DOI: 10.1186/1753-6561-1-s1-s146
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Root mean squared errors of the estimates of the disease location, z0, z1, and z2
| No. ASPs | Strategy | With parental genotypes | Without parental genotypes | ||||||
| Location (cM) | z0 | z1 | z2 | Location (cM) | z0 | z1 | z2 | ||
| MS1 | 4.9 | 0.020 | 0.033 | 0.035 | 5.6 | 0.023 | 0.035 | 0.034 | |
| 250 | MSINT | 3.8 | 0.020 | 0.033 | 0.033 | 4.8 | 0.024 | 0.041 | 0.040 |
| SNP1 | 2.4 | 0.018 | 0.031 | 0.033 | 4.7 | 0.028 | 0.041 | 0.036 | |
| MS1 | 3.7 | 0.016 | 0.026 | 0.027 | 4.0 | 0.017 | 0.027 | 0.025 | |
| 500 | MSINT | 1.9 | 0.016 | 0.026 | 0.024 | 2.3 | 0.017 | 0.037 | 0.035 |
| SNP1 | 1.3 | 0.014 | 0.025 | 0.023 | 3.4 | 0.020 | 0.038 | 0.028 | |
| MS1 | 3.6 | 0.015 | 0.021 | 0.023 | 3.6 | 0.014 | 0.024 | 0.021 | |
| 750 | MSINT | 1.5 | 0.013 | 0.022 | 0.020 | 1.8 | 0.015 | 0.034 | 0.033 |
| SNP1 | 1.3 | 0.012 | 0.020 | 0.019 | 2.9 | 0.019 | 0.036 | 0.026 | |
Figure 1Density of the estimates of . Density of the estimates of z1 and z2, with and without parental genotypes, using 750 ASPs. The vertical lines represent the true values of the parameters.
Properties of 95% confidence sets constructed with CSI-MLS and CSI-Mean
| No. ASPs | Strategy | With parental genotypesa | Without parental genotypesa | ||||
| CSI-MLS | CSI-Mean | RR (%) | CSI-MLS | CSI-Mean | RR (%) | ||
| 250 | MS1 | 28.1 (1.00) | 36.1 (1.00) | 22 | 29.6 (1.00) | 38.7 (1.00) | 24 |
| MSINT | 22.8 (1.00) | 30.4 (1.00) | 25 | 23.2 (0.98) | 33.4 (1.00) | 31 | |
| SNP1 | 24.6 (1.00) | 31.6 (1.00) | 22 | 26.8 (1.00) | 37.3 (1.00) | 28 | |
| TRUE | 21.5 (0.91) | 27.2 (0.95) | 21 | 29.8 (0.96) | 35.5 (0.94) | 16 | |
| 500 | MS1 | 21.8 (1.00) | 28.8 (1.00) | 24 | 21.8 (1.00) | 31.6 (1.00) | 31 |
| MSINT | 15.4 (1.00) | 21.5 (1.00) | 28 | 13.5 (0.97) | 23.4 (1.00) | 42 | |
| SNP1 | 17.0 (1.00) | 23.1 (1.00) | 26 | 18.6 (0.99) | 29.9 (1.00) | 34 | |
| TRUE | 15.2 (0.92) | 20.1 (0.97) | 24 | 24.5 (0.95) | 31.8 (0.98) | 23 | |
| 750 | MS1 | 18.4 (1.00) | 24.1 (1.00) | 24 | 17.6 (1.00) | 27.1 (1.00) | 35 |
| MSINT | 12.2 (1.00) | 16.7 (1.00) | 27 | 9.1 (0.95) | 17.8 (1.00) | 49 | |
| SNP1 | 14.2 (1.00) | 18.6 (1.00) | 24 | 14.4 (0.98) | 24.9 (1.00) | 42 | |
| TRUE | 11.9 (0.94) | 16.3 (0.96) | 27 | 21.3 (0.97) | 28.5 (0.94) | 25 | |
aEmpirical proportions of confidence sets that contain the disease gene locus are given in parentheses. The relative reduction in length obtained by using CSI-MLS in place of CSI-Mean is given by
Figure 2Confidence sets provided by CSI-Mean and CSI-MLS for the NARAC data. The KAC curve is plotted. The two horizontal lines provide the confidence intervals constructed using CSI-MLS (solid) and CSI-Mean (dashed). The disease locus is indicated with the vertical line.