| Literature DB >> 18466507 |
Kelly Cho1, Qiong Yang, Josée Dupuis.
Abstract
The presence of linkage disequilibrium violates the underlying assumption of linkage equilibrium in most traditional multipoint linkage approaches. Studies have shown that such violation leads to bias in qualitative trait linkage analysis when parental genotypes are unavailable. Appropriate handling of marker linkage disequilibrium can avoid such false positive evidence. Using the rheumatoid arthritis simulated data from Genetic Analysis Workshop 15, we examined and compared the following three approaches to handle linkage disequilibrium among dense markers in both qualitative and quantitative trait linkage analyses: a simple algorithm; SNPLINK, methods for marker selection; and MERLIN-LD, a method for modeling linkage disequilibrium by creating marker clusters. In analysis ignoring linkage disequilibrium between markers, we observed LOD score inflation only in the affected sib-pair linkage analysis without parental genotypes; no such inflation was present in the quantitative trait locus linkage analysis with severity as our phenotype with or without parental genotypes. Using methods to model or adjust for linkage disequilibrium, we found a substantial reduction of inflation of LOD score in affected sib-pair linkage analysis. Greater LOD score reduction was observed by decreasing the amount of tolerable linkage disequilibrium among markers selected or marker clusters using MERLIN-LD; the latter approach showed most reduction. SNPLINK performed better with selected markers based on the D' measure of linkage disequilibrium as opposed to the r2 measure and outperformed the simple algorithm. Our findings reiterate the necessity of properly handling dense markers in linkage analysis, especially when parental genotypes are unavailable.Entities:
Year: 2007 PMID: 18466507 PMCID: PMC2367569 DOI: 10.1186/1753-6561-1-s1-s161
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Figure 1ASP analysis: average maximum LOD score and average number of markers (D'). Summary results from all 100 replicates with families without parental genotypes (except for the last panel, which shows the results from the complete data with full parental genotypes). Maximum LOD scores are shown in box plots with four panels: A, unadjusted, data with missing parental genotypes (pink); B, simple algorithm, with four D' thresholds (blue); C, SNPLINK, with four D' thresholds (green); and D, complete data unadjusted (red).
Figure 2ASP analysis: average maximum LOD score and average number of markers* (. Summary results from all 100 replicates with families without parental genotypes (except for the last panel, which shows the results from the complete data with full parental genotypes). Maximum LOD scores are shown in box plots with five panels: A, unadjusted, data with missing parental genotypes (pink); B, simple algorithm, with four r2 thresholds (blue); C, SNPLINK, with four r2 thresholds (green); D, MERLIN LD, with four r2 thresholds (purple); and E, complete data unadjusted (red).