| Literature DB >> 26787347 |
Shi-Bo Wang1,2, Jian-Ying Feng1, Wen-Long Ren1, Bo Huang1, Ling Zhou1, Yang-Jun Wen1, Jin Zhang1, Jim M Dunwell3, Shizhong Xu4, Yuan-Ming Zhang1,2.
Abstract
Genome-wide association studies (GWAS) have been widely used in genetic dissection of complex traits. However, common methods are all based on a fixed-SNP-effect mixed linear model (MLM) and single marker analysis, such as efficient mixed model analysis (EMMA). These methods require Bonferroni correction for multiple tests, which often is too conservative when the number of markers is extremely large. To address this concern, we proposed a random-SNP-effect MLM (RMLM) and a multi-locus RMLM (MRMLM) for GWAS. The RMLM simply treats the SNP-effect as random, but it allows a modified Bonferroni correction to be used to calculate the threshold p value for significance tests. The MRMLM is a multi-locus model including markers selected from the RMLM method with a less stringent selection criterion. Due to the multi-locus nature, no multiple test correction is needed. Simulation studies show that the MRMLM is more powerful in QTN detection and more accurate in QTN effect estimation than the RMLM, which in turn is more powerful and accurate than the EMMA. To demonstrate the new methods, we analyzed six flowering time related traits in Arabidopsis thaliana and detected more genes than previous reported using the EMMA. Therefore, the MRMLM provides an alternative for multi-locus GWAS.Entities:
Mesh:
Year: 2016 PMID: 26787347 PMCID: PMC4726296 DOI: 10.1038/srep19444
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Comparison of statistical powers of six simulated QTN from three different methods of GWAS (MRMLM, RMLM and EMMA).
Panel (a) no polgenic variance was simulated. Panel (b) an additive polygenic variance (explaining 0.092 of the phenotypic variance) was simulated. Panel (c) three epistatic QTNs each explaining 0.05 of the phenotypic variance were simulated. Panel (d) powers of six simulated QTNs with an additive polygenic variance obtained from the MRMLM method under three different sample sizes (199, 149 and 99).
Figure 2Comparison of mean squared errors of six simulated QTNs from three different methods of GWAS (MRMLM, RMLM and EMMA).
Panel (a) no polgenic variance was simulated. Panel (b) an additive polygenic variance was simulated. Panel (c) three epistatic QTNs were simulated. Panel (d) mean squared errors of six simulated QTNs with an additive polygenic variance obtained from the MRMLM method under three different sample sizes (199, 149 and 99).
Figure 3Comparison of empirical Type 1 error rates from three different methods of GWAS (MRMLM, RMLM and EMMA).
Panel (a) no polgenic variance was simulated. Panel (b) an additive polygenic variance was simulated. Panel (c) three epistatic QTNs were simulated. Panel (d) empirical Type 1 error rates obtained from the MRMLM method under three different sample sizes (199, 149 and 99).
Figure 4Statistical powers of six simulated QTNs from the first simulation experiement plotted against Type 1 error (in a log10 scale) for the three GWAS methods (MRMLM, RMLM and EMMA).
Goodness of fit (log likelihood and BIC) for SNPs detected by three methods (MRMLM, RMLM and EMMA), where a lower value indicates a better fit.
| Trait | −Log likelihood value | Bayesian information criterion (BIC) | ||||
|---|---|---|---|---|---|---|
| MRMLM | RMLM | EMMA | MRMLM | RMLM | EMMA | |
| LD | −215.9 | 153.8 | 284.6 | −67.5 | 194.8 | 289.7 |
| SDV | −337.2 | −150.1 | −119.9 | −260.4 | −124.5 | −104.5 |
| SD | −367.9 | 55.3 | 113.1 | −230.5 | 70.5 | 118.2 |
| 0W | −42.4 | 94.5 | 226.2 | 21.5 | 124.0 | 226.2 |
| 2W | −140.6 | 104.4 | 216.4 | −30.1 | 134.6 | 221.4 |
| 4W | −122.4 | 24.5 | 105.3 | −55.5 | 57.9 | 114.9 |
aLD: days to flowering under long days; LDV: days to flowering under long days with vernalization; SD: days to flowering under short days; 0W: days to flowering under long days for no vernalization; 2W: days to flowering under long days for 2 weeks vernalization; 4W: days to flowering under long days for 4 weeks vernalization.
Genes detected for six flowering time traits in Arabidopsis thaliana using three methods (MRMLM, RMLM and EMMA).
| Trait | Gene detected | Chr | SNP (bp) | MRMLM | RMLM | EMMA | Reference | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| LOD | Effect | r2(%) | P-value | Effect | r2(%) | P-value | Effect | r2(%) | |||||
| LD | 1 | 8128350 | 4.12 | −0.055 | 0.56 | ||||||||
| 2 | 9588685 | 4.47E-10 | −0.397 | 23.36 | 2.78E-9 | −0.407 | 24.59 | ||||||
| 2 | 9611587 | 8.83 | −0.104 | 1.67 | 7.11E-7 | −0.262 | 10.56 | ||||||
| 4 | 454542 | 4.70 | −0.066 | 0.72 | |||||||||
| 4 | 1260796 | 3.06 | −0.059 | 0.63 | |||||||||
| 4 | 8291057 | 6.18 | −0.060 | 0.66 | |||||||||
| 5 | 3188328 | 9.48 | −0.097 | 1.35 | 5.08E-7 | −0.248 | 8.73 | 8.82E-7 | −0.258 | 9.45 | |||
| LDV | 1 | 164375 | 6.30 | −0.049 | 3.37 | ||||||||
| 2 | 9588685 | 10.30 | −0.049 | 2.98 | 1.61E-6 | −0.111 | 15.43 | ||||||
| 2 | 18446546 | 5.75 | −0.053 | 2.05 | 2.66E-8 | −0.152 | 16.80 | 5.74E-8 | −0.158 | 18.15 | |||
| 5 | 18599929 | 7.25 | −0.058 | 4.77 | 2.04E-7 | −0.083 | 9.82 | 2.71E-7 | −0.087 | 10.87 | |||
| SD | 1 | 19713470 | 6.21 | −0.049 | 1.27 | ||||||||
| 2 | 2916675 | 3.83 | 0.030 | 0.53 | |||||||||
| 4 | 153402 | 6.47 | −0.042 | 1.13 | 2.46E-8 | −0.132 | 11.26 | 6.74E-8 | −0.136 | 12.00 | |||
| 4 | 458226 | 10.70 | −0.070 | 2.03 | 1.20E-6 | −0.132 | 7.18 | ||||||
| 5 | 3051259 | 6.23 | −0.037 | 0.87 | |||||||||
| 5 | 6546055 | 4.66 | −0.068 | 1.36 | |||||||||
| 0W | 3 | 21239134 | 6.24 | −0.111 | 3.13 | ||||||||
| 5 | 18592535 | 1.83E-6 | −0.274 | 12.72 | |||||||||
| 5 | 18595015 | 11.87 | −0.145 | 3.56 | |||||||||
| 2W | 1 | 2899659 | 8.11 | −0.081 | 1.55 | ||||||||
| 4 | 454542 | 7.18 | −0.089 | 1.72 | 4.30E-7 | −0.207 | 9.21 | ||||||
| 5 | 6846957 | 3.36 | −0.082 | 0.99 | |||||||||
| 4W | 1 | 8341601 | 4.65 | −0.070 | 1.34 | ||||||||
| 2 | 9588685 | 3.55E-9 | −0.354 | 28.25 | 1.97E-8 | −0.365 | 30.03 | ||||||
| 2 | 13031229 | 9.65 | −0.107 | 3.00 | |||||||||
| 5 | 6546259 | 5.94 | −0.132 | 2.75 | |||||||||
| 5 | 18264316 | 7.06 | −0.071 | 1.33 | |||||||||
| 5 | 18607728 | 10.09 | −0.164 | 4.69 | 8.13E-7 | −0.263 | 12.09 | ||||||
r2: Proportion of phenotypic variance contributed by the gene.