| Literature DB >> 19278541 |
Lucy Crooks1, Goutam Sahana, Dirk-Jan de Koning, Mogens Sandø Lund, Orjan Carlborg.
Abstract
As part of the QTLMAS XII workshop, a simulated dataset was distributed and participants were invited to submit analyses of the data based on genome-wide association, fine mapping and genomic selection. We have evaluated the findings from the groups that reported fine mapping and genome-wide association (GWA) efforts to map quantitative trait loci (QTL). Generally the power to detect QTL was high and the Type 1 error was low. Estimates of QTL locations were generally very accurate. Some methods were much better than others at estimating QTL effects, and with some the accuracy depended on simulated effect size or minor allele frequency. There were also indications of bias in the effect estimates. No epistasis was simulated, but the two studies that included searches for epistasis reported several interacting loci, indicating a problem with controlling the Type I error rate in these analyses. Although this study is based on a single dataset, it indicates that there is a need to improve fine mapping and GWA methods with respect to estimation of genetic effects, appropriate choice of significance thresholds and analysis of epistasis.Entities:
Year: 2009 PMID: 19278541 PMCID: PMC2654496 DOI: 10.1186/1753-6561-3-s1-s2
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Simulated QTL.
| M1 | 1 | 20.00 | 0.62 | 0.28 | 0.15 | 3.50 | 0.61 |
| S1 | 1 | 31.87 | 0.01 | 0.44 | 0.00 | 0.00 | 0.06 |
| S2 | 1 | 33.16 | 0.00 | 0.30 | 0.00 | 0.00 | 0.04 |
| M2 | 1 | 40.00 | 0.56 | 0.07 | 0.04 | 0.91 | 0.62 |
| S3 | 1 | 50.37 | 0.06 | 0.46 | 0.00 | 0.04 | 0.08 |
| S4 | 1 | 52.50 | 0.05 | 0.40 | 0.00 | 0.03 | 0.07 |
| S5 | 1 | 62.21 | 0.00 | 0.29 | 0.00 | 0.00 | 0.02 |
| M3 | 1 | 77.23 | 0.37 | 0.29 | 0.06 | 1.29 | 0.42 |
| S6 | 1 | 86.68 | 0.01 | 0.30 | 0.00 | 0.00 | 0.09 |
| S7 | 1 | 93.99 | 0.01 | 0.47 | 0.00 | 0.00 | 0.01 |
| S8 | 2 | 2.25 | 0.01 | 0.39 | 0.00 | 0.00 | 0.06 |
| S9 | 2 | 6.52 | 0.07 | 0.38 | 0.00 | 0.06 | 0.09 |
| M4 | 2 | 27.41 | 0.35 | 0.44 | 0.06 | 1.38 | 0.44 |
| M5 | 2 | 30.00 | 0.33 | 0.21 | 0.04 | 0.82 | 0.25 |
| S10 | 2 | 32.49 | 0.04 | 0.41 | 0.00 | 0.02 | 0.07 |
| S11 | 2 | 45.71 | 0.01 | 0.09 | 0.00 | 0.00 | 0.07 |
| S12 | 2 | 48.22 | 0.04 | 0.08 | 0.00 | 0.01 | 0.06 |
| M6 | 2 | 48.62 | 0.37 | 0.40 | 0.07 | 1.50 | 0.39 |
| M7 | 2 | 74.91 | 0.50 | 0.18 | 0.07 | 1.63 | 0.46 |
| S13 | 2 | 89.04 | 0.12 | 0.22 | 0.01 | 0.12 | 0.15 |
| S14 | 2 | 93.54 | 0.25 | 0.32 | 0.03 | 0.61 | 0.22 |
| S15 | 2 | 95.66 | 0.02 | 0.29 | 0.00 | 0.01 | 0.12 |
| S16 | 2 | 97.83 | 0.13 | 0.41 | 0.01 | 0.19 | 0.14 |
| S17 | 3 | 0.70 | 0.03 | 0.00 | 0.00 | 0.00 | -d |
| S18 | 3 | 7.89 | 0.01 | 0.46 | 0.00 | 0.00 | 0.04 |
| M8 | 3 | 14.91 | 0.30 | 0.40 | 0.04 | 0.98 | 0.27 |
| S19 | 3 | 21.07 | 0.02 | 0.26 | 0.00 | 0.00 | 0.00 |
| S20 | 3 | 29.81 | 0.07 | 0.29 | 0.00 | 0.04 | 0.05 |
| M9 | 3 | 60.00 | 0.68 | 0.07 | 0.06 | 1.29 | 0.70 |
| M10 | 4 | 3.21 | 0.61 | 0.39 | 0.18 | 4.01 | 0.64 |
| S21 | 4 | 3.44 | 0.08 | 0.32 | 0.00 | 0.06 | 0.10 |
| S22 | 4 | 3.88 | 0.02 | 0.23 | 0.00 | 0.00 | 0.02 |
| S23 | 4 | 10.00 | 0.01 | 0.04 | 0.00 | 0.00 | 0.06 |
| S24 | 4 | 16.35 | 0.00 | 0.36 | 0.00 | 0.00 | 0.11 |
| S25 | 4 | 19.84 | 0.07 | 0.47 | 0.00 | 0.05 | 0.10 |
| M11 | 4 | 36.93 | 0.34 | 0.24 | 0.04 | 0.95 | 0.37 |
| S26 | 4 | 69.56 | 0.00 | 0.08 | 0.00 | 0.00 | 0.01 |
| M12 | 4 | 76.06 | 0.58 | 0.41 | 0.16 | 3.70 | 0.58 |
| M13 | 4 | 96.49 | 0.29 | 0.19 | 0.03 | 0.59 | 0.38 |
| M14 | 5 | 5.15 | 0.18 | 0.21 | 0.01 | 0.24 | 0.25 |
| S27 | 5 | 12.98 | 0.09 | 0.44 | 0.00 | 0.10 | 0.09 |
| S28 | 5 | 28.64 | 0.00 | 0.13 | 0.00 | 0.00 | 0.05 |
| S29 | 5 | 68.39 | 0.12 | 0.44 | 0.01 | 0.15 | 0.17 |
| S30 | 5 | 68.48 | 0.00 | 0.43 | 0.00 | 0.00 | 0.02 |
| S31 | 5 | 72.54 | 0.00 | 0.12 | 0.00 | 0.00 | 0.06 |
| S32 | 5 | 77.02 | 0.13 | 0.25 | 0.01 | 0.14 | 0.15 |
| S33 | 5 | 80.00 | 0.08 | 0.11 | 0.00 | 0.03 | 0.05 |
| S34 | 5 | 82.14 | 0.01 | 0.36 | 0.00 | 0.00 | 0.08 |
| M15 | 5 | 93.50 | 0.75 | 0.26 | 0.22 | 4.97 | 0.75 |
| S35 | 5 | 98.32 | 0.01 | 0.45 | 0.00 | 0.00 | 0.02 |
aQTL labelled M are major QTL, known to be detectable in this dataset based on the results from our multiple regression. QTL labelled S are secondary QTL that were not detected, with the significance threshold used, in our multiple regression. bChromosome. cAverage effect of allelic substitution (absolute value). dCould not be estimated because the QTL was fixed in the population.
Figure 1Chromosomal positions of simulated QTL. Each simulated chromosome (Chr) is 100 cM long. QTL are indicated on the right-hand side of each chromosome and their position in cM on the left-hand side. No QTL were simulated on chromosome 6.
Summary of studies.
| LABayes [ | LA | multiple | multi-markerb | fixed, additive | sex + generation | Bayesian | only every tenth marker used |
| LDBayes [ | LD | multiple | single marker | random, additive | - | Bayesian | |
| LDLA1 [ | LD | multiple | single marker | fixed, additive +dominance | polygenic | mixed model, REMLc and FSd | markers selected by allele frequency difference in high/low offspring per sire |
| LDLA | single | haplotype (2) | random, additive | polygenic | variance component, REML | markers selected by allele frequency difference in high/low offspring per sire | |
| LDmulti [ | LD | multiple | single marker | fixed, additive +dominance +epistasis | pre-correction for polygenic + sex + generation | linear regression, least squares and FS | mixed model, REML used for pre-correction |
| LDHap [ | LD | single | haplotype (10e) | fixed, additive | pre-correction for polygenic + sex + generation | phylogeny building, cluster analysisf | mixed model, MLg for pre-correction, maximum in 10 cM interval |
| LD | single | single marker | fixed, additive | polygenic + sex + generation | mixed model, ML | maximum in 5 cM interval, explored epistasis | |
| LDLA2 [ | LA | two | single marker | random, additive | polygenic | variance component, REML | only marker data from last two generations used |
| LDLA | single | haplotype (10) | random, additive | polygenic | variance component, REML | only marker data from last two generations used, only on most significant region per chromosome from LA | |
| LD | single | haplotype (3) | random, additive | polygenic | variance component, REML | only marker data from last two generations used, only on most significant region per chromosome from LA | |
aIn brackets is the number of markers in a haplotype. bProbability of QTL genotype is conditional on all marker data (at reduced 1 cM density) for the chromosome. cRestricted maximum likelihood. dForward selection. eAt least 10 markers used. fUsing Blossoc [16]. gMaximum likelihood.
Threshold criteria used in the studies and description of epistatic analyses.
| LABayes | 2× ln(Bayes factor) ≥ 3 | 0.08 | We equated 2× ln(Bayes factor) with a likelihood |
| LDBayes | - | - | No significance tests were performed. |
| LDLA1 | LD: F > 4a | 0.007 | Tests were only performed on markers that had |
| LDLA: LRTb > 12.8 | 0.0003c | Tests were only performed on markers that had | |
| LDmulti | 8×10-6 | An epistatic analysis was performed. | |
| LDHap | haplotype: HQd ≥ 15 | 2×10-9e | |
| single marker: LRT >32.8 | 10-8 | An epistatic analysis was performed | |
| LDLA2 | LRT > 6 | 0.014c | - |
aLowest F-to-enter value reported. bLikelihood ratio test statistic. cFrom a χ2 approximation with one degree of freedom. dHannan-Quinn criteria, which is similar to 2× ln(Bayes factor). eFrom regression of p-values obtained by permutation test (108 replicates) for chromosome 1 with raw data against HQ score (Ledur, pers. comm.).
Comparison of M-QTL and reported QTL.
| M1 | 1 | 20.0 | 0.62 | 0.28 | 11.8 | 0.61 | 21 | 0.55 | 19.5 | 0.66 | 23.2 | -l | 19.6 | 0.31 | 20.0 | 0.71 | 19.5 | 0.12 |
| M2 | 1 | 40.0 | 0.56 | 0.07 | 3.1 | 0.62 | 41 | 0.67 | 39.3 | 0.59 | 41.5 | 0.41 | 40.2 | 0.12 | 40.2 | 0.78 | ||
| M3 | 1 | 77.2 | 0.37 | 0.29 | 4.4 | 0.42 | 76 | 0.30 | 77.7 | 0.48 | 77.8 | 0.23 | 77.8 | 0.40 | 76.6 | 0.04 | ||
| M4 | 2 | 27.4 | 0.35 | 0.44 | 4.7 | 0.44 | 24.9 | 0.43 | 27.0 | 0.22 | 26.7 | 0.43 | 26.0 | 0.12 | ||||
| M5 | 2 | 30.0 | 0.33 | 0.21 | 2.8 | 0.25 | 29 | 0.58 | 32.6 | 0.22 | ||||||||
| M6 | 2 | 48.6 | 0.37 | 0.40 | 5.1 | 0.39 | 50 | 0.46 | 48.2 | 0.42 | 48.3 | 0.29 | 48.3 | 0.18 | 48.7 | 0.45 | 53.2 | 0.10 |
| M7 | 2 | 74.9 | 0.50 | 0.18 | 5.5 | 0.46 | ||||||||||||
| M8 | 3 | 14.9 | 0.30 | 0.40 | 3.3 | 0.27 | 13.3 | 0.16 | 13.3 | 0.35 | 11.9 | 0.07 | ||||||
| M9 | 3 | 60.0 | 0.68 | 0.07 | 4.4 | 0.70 | 60.1 | -l | ||||||||||
| M10 | 4 | 3.2 | 0.61 | 0.39 | 13.6 | 0.64 | 4 | 0.78 | 3.4 | 0.55 | 3.3 | 0.49 | 3.3 | 0.33 | 3.2 | 0.59 | 3.1 | 0.49 |
| M11 | 4 | 36.9 | 0.34 | 0.24 | 3.2 | 0.37 | 36.3 | 0.40 | ||||||||||
| M12 | 4 | 76.1 | 0.58 | 0.41 | 12.5 | 0.57 | 77 | 0.50 | 76.5 | 0.52 | 76.5 | 0.50 | 76.5 | 0.24 | 76.5 | 0.55 | ||
| M13 | 4 | 96.5 | 0.29 | 0.19 | 2.0 | 0.39 | 98 | 0.41 | 99.2 | 0.4 | 96.5 | 0.32 | 95.2 | -l | ||||
| M14 | 5 | 5.1 | 0.18 | 0.21 | 0.8 | 0.25 | 2 | 0.35 | ||||||||||
| M15 | 5 | 93.5 | 0.75 | 0.26 | 16.8 | 0.75 | 95 | 0.72 | 95.5 | 0.6 | 93.5 | 0.36 | 93.5 | 0.63 | 93.9 | 0.18 | ||
aChromosome. bLocation. cAverage effect of allelic substitution (absolute value). dMinor allele frequency. ePercentage of genetic variance explained by the QTL. fMultiple regression on known QTL genotypes. gOne QTL was falsely identified at 10 cM on chromosome 4. hReported estimate of the additive effect. iOnly the positions with the 16 largest effect estimates were included. Five QTL were falsely identified at 52.6 cM on chromosome 1, 65.4 cM on chromosome 3, and 3.0, 3.7 and 75.8 cM on chromosome 4. jTwo QTL were falsely identified at 3.1 and 4.8 cM on chromosome 4. kHalf the difference between the estimated genotypic value of the 22 and 11 genotypes. lNot estimated.
Figure 2QTL detected by each study. In (A)-(F) the M-QTL identified, and not found, by each study are shown, in relation to the simulated average effect of allelic substitution and minor allele frequency.
Figure 3Accuracy in estimates of QTL location. In (A)-(F) the absolute difference between the estimated and simulated position for the M-QTL that were detected, is shown in relation to the simulated average effect of allelic substitution, for each study. Lines represent significant relationships (p < 0.05 in least squares linear regression) and regression equations and R2 values are given. P-values were: (A) 0.12, (B) 0.7, (C) 0.046, (D) 0.07, (E) 0.005, (F) 0.5.
Figure 4Accuracy and bias in estimates of QTL effects. In (A), a function of the square root of the estimated genetic variance of the QTL divided by the simulated genetic variance is shown. In (B)-(G), the difference between the estimated and simulated effect (average effect of allelic substitution) as a percentage of the simulated effect is shown. Potential relationships between the degree of inaccuracy and the simulated effect, minor allele frequency and percentage phenotypic variance explained by the QTL were tested by least squares linear regression for each study and the one with the lowest p-value (not dependent on outliers) is illustrated in (A)-(F). In (A), a relationship with the square of minor allele frequency and a zero intercept was fitted. Lines show significant relationships (p < 0.05) and regression equations and R2 values are given. Symbols indicate which values were overestimated and which were underestimated. (G) shows the comparable relationship with simulated effect for our multiple regression model. P-values were: (A) 0.003, (B) 0.6, (C) 0.3, (D) 0.006, (E) 0.15, (F) 0.07, (G) 0.0005.