| Literature DB >> 18466477 |
Hugues Aschard1,2, Mickaël Guedj3,4, Florence Demenais1,2.
Abstract
Genome-wide association studies raise study-design and analytical issues that are still being debated. Among them, stands the issue of reducing the number of markers to be genotyped without loss of efficiency in identifying trait loci, which can reduce the cost of studies and minimize the multiple testing problem. With this aim, we proposed a two-step strategy based on two analytical methods suited to examine sets of markers rather than single markers: the local score, which screens the genome to select candidate regions in Step 1, and FBAT-LC, a multiple-marker family-based association test used to obtain significance levels of regions at step 2. The performance of this strategy was evaluated on all replicates of Genetic Analysis Workshop 15 Problem 3 simulated data, using the answers to that problem. Overall, seven of the nine generated trait loci were detected in at least 87% of the replicates using a framework designed to handle either association with the disease or association with the severity of disease. This multiple-marker strategy was compared to the single-marker approach. By considering regions instead of single markers, this strategy minimizes the multiple testing problem and the number of false-positive results.Entities:
Year: 2007 PMID: 18466477 PMCID: PMC2367542 DOI: 10.1186/1753-6561-1-s1-s134
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Results for the first 10 regions in the first replicates
| Step 1 | Step 2 | ||||||
| Region | Chr | IDs of the 2 extreme markers bounding a region | Local Scoresa | FBAT_LC | Bonferroni corrected | Trait loci in the region | |
| 1 | 6 | 128 | 162 | 8235.4874 | 0.000000 | 0.000000 | DR, C, D |
| 2 | 11 | 387 | 396 | 202.3407 | 0.000000 | 0.000000 | F |
| 3 | 18 | 269 | 269 | 44.5575 | 0.000000 | 0.000000 | E |
| 4 | 18 | 10 | 11 | 28.2714 | 0.042545 | 0.425450 | |
| 5 | 10 | 355 | 357 | 16.8203 | 0.476407 | 1.000000 | |
| 6 | 4 | 347 | 348 | 11.7416 | 0.440749 | 1.000000 | |
| 7 | 6 | 355 | 355 | 11.6840 | 0.002913 | 0.029130 | |
| 8 | 3 | 58 | 58 | 11.5742 | 0.137617 | 1.000000 | |
| 9 | 8 | 392 | 392 | 11.5131 | 0.387798 | 1.000000 | |
| 10 | 7 | 94 | 95 | 11.4852 | 0.706698 | 1.000000 | |
aLocal Scores obtained from the first replicate of case-control data.
bp-Values obtained from FBAT-LC applied to regions selected in Step 1 in the first replicate of family data.
Comparison of multiple marker and single marker strategies
| Correction for multiple testing | Sensitivitya | Specificitya | FDRa | % of all TL detectedb | |
| FBAT-LC | |||||
| Multiple markersc | None | 0.97 (0.08) | 0.95 (0.03) | 0.37 (0.17) | 0.56 (0.02) |
| Benjamini & Hochberg | 0.95 (0.10) | 0.99 (0.09) | 0.03 (0.09) | 0.55 (0.02) | |
| Bonferroni | 0.94 (0.11) | 0.99 (0.09) | 0.03 (0.09) | 0.55 (0.03) | |
| Single-marker FBAT | |||||
| Single markerd | None | 0.97 (0.09) | 0.78 (0.03) | 0.83 (0.02) | 0.56 (0.02) |
| Benjamini & Hochberg | 0.95 (0.11) | 0.87 (0.01) | 0.74 (0.02) | 0.55 (0.02) | |
| Bonferroni | 0.95 (0.11) | 0.87 (0.01) | 0.74 (0.03) | 0.55 (0.02) | |
aMean (standard deviation) of sensitivity, specificity, and false-discovery rate of FBAT-LC (multiple markers) and single-marker FBAT computed over 50 replicates of family data.
bMean (standard deviation) of the proportion of all trait loci (TL) selected in Step 1 and detected in Step 2 by FBAT at the 5% level over 50 replicates.
c50 regions were selected in Step 1 using the local score method.
d115 SNPs were selected in Step 1 (see text) using the Pearson chi-square applied to case-control genotypic contingency table.
Performance of the multiple-marker strategy in disease severity analysis
| Correction for multiple testing | FBAT LC | % of severity loci detectedb | ||
| Sensitivitya | Specificitya | FDRa | ||
| Without correction | 1.00 (0.00) | 0.92 (0.03) | 0.75 (0.13) | 0.91 (0.19) |
| Benjamini & Hochberg | 0.96 (0.20) | 0.99 (0.01) | 0.14 (0.24) | 0.87 (0.26) |
| Bonferroni | 0.96 (0.20) | 0.99 (0.01) | 0.14 (0.24) | 0.87 (0.26) |
aMean (standard deviation) of sensitivity, specificity and false-discovery rate of FBAT-LC test computed over 50 replicates of family data when 50 regions were selected in Step 1.
bMean (standard deviation) of the proportion of severity loci selected in Step 1 and detected in Step 2 by FBAT at the 5% level over 50 replicates when 50 regions were selected in Step 1.