| Literature DB >> 18466481 |
Mike Schmidt1, Xuejun Qin, Eden R Martin, Elizabeth R Hauser, Silke Schmidt.
Abstract
The incorporation of disease-associated covariates into studies aiming to identify susceptibility genes for complex human traits is a challenging problem. Accounting for such covariates in genetic linkage and association analyses may help reduce the genetic heterogeneity inherent in these complex phenotypes. For Genetic Analysis Workshop 15 (GAW15) Problem 3 simulated data, our goal was to compare the power of several two-stage study designs to identify rheumatoid arthritis-related genes on chromosome 9 (disease severity), 11 (IgM), and 18 (anti-cyclic citrinullated protein), with knowledge of the answers. Five study designs incorporating an initial linkage step, followed by a case-selection scheme and case-control association analysis by logistic regression, were considered. The linkage step was either qualitative-trait linkage analysis as implemented in MERLIN-nonparametric linkage (NPL), or quantitative-trait locus analysis as implemented in MERLIN-REGRESS. A set of cases representing either one case from each available family, one case per linked family (NPL >/= 0), or one case from each family identified by ordered-subset analysis was chosen for comparison with the full set of 2000 simulated controls. As expected, the performance of these study designs depended on the disease model used to generate the data, especially the simulated allele frequency difference between cases and controls. The quantitative trait loci analysis performed well in identifying these loci, and the power to identify disease-associated alleles was increased by using ordered-subset analysis as a case selection tool.Entities:
Year: 2007 PMID: 18466481 PMCID: PMC2367505 DOI: 10.1186/1753-6561-1-s1-s138
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Definition of study designsa
| Design | Linkage software | Case selection strategy | Notes |
| A | MERLIN | All | |
| B | MERLIN | "LINKED BEST" [5] | |
| C | MERLIN plus OSA | OSA subset if ≥50 families | OSA for localization |
| D | MERLIN-REGRESS | All | |
| E | MERLIN-REGRESS (M-R) plus OSA | OSA subset if ≥50 families and difference in OSA and M-R peak ≤20 cM | M-R for localization |
aLinkage region is centered on maximum LOD score (for all families or OSA subset) and defines SNPs for Stage 2 association analysis.
Characteristics of simulated loci
| Averagea | ||||||
| Chr | Locus (closest SNP) | Distance between true locus and closest SNP (cM) | NPL (SD) at locusb | SNP MAFc in cases (one per family, | SNP MAF in controls ( | |
| 9 | G (186) | 0.08 | 0.01 (0.30) | 0.374 | 0.384 | 0.01 |
| 11 | F (389) | 0.01 | 0.34 (0.49) | 0.275 | 0.501 | 0.94 |
| 18 | E (269) | 0.05 | 0.92 (0.81) | 0.297 | 0.223 | 0.15 |
aAverages calculated across 100 replicates
bNPL, nonparametric multipoint LOD score from MERLIN when analyzing binary RA affection status
cMAF, minor allele frequency
Power estimates for study designs
| Proportion of reps. meeting: | |||||||||
| Linkage threshold | Association threshold (avg. no. SNPs analyzed)c | Avg. no. cases analyzed | Overall power | ||||||
| Stringenta | Looseb | Stringent | Loose | Stringent | Loose | Stringent | Loose | ||
| 9 (186) | A | 0 | 0.14 | 0 (n/a) | 0 (144) | 1500 | 1500 | 0 | 0 |
| B | 0 | 0.14 | 0 (n/a) | 0 (144) | 983 | 987 | 0 | 0 | |
| C | 0.13 | 0.64 | 0.23 (48) | 0.25 (145) | 127 | 136 | 0.03 | 0.16 | |
| D | 0.83 | 0.96 | 0.01 (48) | 0 (147) | 1500 | 1500 | 0.01 | 0 | |
| E | 0.2 | 0.65 | 0.30 (48) | 0.20 (148) | 129 | 121 | 0.06 | 0.13 | |
| 11 (389) | A | 0.06 | 0.24 | 1 (52) | 1 (147) | 1500 | 1500 | 0.06 | 0.24 |
| B | 0.06 | 0.24 | 1 (52) | 1 (147) | 988 | 985 | 0.06 | 0.24 | |
| C | 0.11 | 0.62 | 1 (48) | 0.92 (147) | 258 | 273 | 0.11 | 0.57 | |
| D | 0.98 | 1 | 1 (44) | 1 (146) | 1500 | 1500 | 0.98 | 1 | |
| E | 0.23 | 0.56 | 1 (46) | 1 (147) | 281 | 290 | 0.23 | 0.56 | |
| 18 (269) | A | 0.21 | 0.64 | 1 (28) | 1 (97) | 1500 | 1500 | 0.21 | 0.64 |
| B | 0.21 | 0.64 | 1 (28) | 1 (97) | 998 | 988 | 0.21 | 0.64 | |
| C | 0.53 | 0.91 | 1 (28) | 1 (96) | 372 | 390 | 0.53 | 0.91 | |
| D | 0.73 | 0.99 | 1 (27) | 1 (95) | 1500 | 1500 | 0.73 | 0.99 | |
| E | 0.44 | 0.88 | 1 (27) | 1 (95) | 370 | 382 | 0.44 | 0.88 | |
aStringent linkage thresholds are LOD score ≥ 1.0, OSA p-value ≤ 0.05, 10-cM linkage region.
bLoose thresholds are LOD score ≥ 0.5, OSA p-value ≤ 0.5, 40-cM linkage region.
cThe proportion of replicates meeting the Bonferroni-corrected association threshold, average number of cases and number of SNPs analyzed were calculated across replicates meeting the linkage threshold.