| Literature DB >> 29234158 |
Wen-Long Ren1,2, Yang-Jun Wen1,2, Jim M Dunwell3, Yuan-Ming Zhang4,5.
Abstract
Although nonparametric methods in genome-wide association studies (GWAS) are robust in quantitative trait nucleotide (QTN) detection, the absence of polygenic background control in single-marker association in genome-wide scans results in a high false positive rate. To overcome this issue, we proposed an integrated nonparametric method for multi-locus GWAS. First, a new model transformation was used to whiten the covariance matrix of polygenic matrix K and environmental noise. Using the transferred model, Kruskal-Wallis test along with least angle regression was then used to select all the markers that were potentially associated with the trait. Finally, all the selected markers were placed into multi-locus model, these effects were estimated by empirical Bayes, and all the nonzero effects were further identified by a likelihood ratio test for true QTN detection. This method, named pKWmEB, was validated by a series of Monte Carlo simulation studies. As a result, pKWmEB effectively controlled false positive rate, although a less stringent significance criterion was adopted. More importantly, pKWmEB retained the high power of Kruskal-Wallis test, and provided QTN effect estimates. To further validate pKWmEB, we re-analyzed four flowering time related traits in Arabidopsis thaliana, and detected some previously reported genes that were not identified by the other methods.Entities:
Mesh:
Year: 2017 PMID: 29234158 PMCID: PMC5836593 DOI: 10.1038/s41437-017-0007-4
Source DB: PubMed Journal: Heredity (Edinb) ISSN: 0018-067X Impact factor: 3.821
Fig. 1A flow chart of pKWmEB method
Fig. 2Comparison of statistical powers of six simulated QTNs using five GWAS methods (pKWmEB, KWmEB, KWsBC, GEMMA, and mrMLM). a No polygenic background; b an additive polygenic variance (explaining 0.092 of the phenotypic variance); c three epistatic QTNs each explaining 0.05 of the phenotypic variance. Residual error is normal distribution with mean zero and variance 10 in (a) to (c), log-normal distribution with mean zero and standard deviation 1.144 (d), and logistic distribution with mean zero and standard deviation 1.743 (e)
Paired t tests and their P-values for power and mean squared error (MSE) between pKWmEB and each of the other four methods in the first simulation experiment
| Case | KWmEB | KWsBC | GEMMA | mrMLM | |
|---|---|---|---|---|---|
| Power | 2.58 | 0.60 | 3.65 | 1.16 | |
| 0.0495* | 0.5760 | 0.0148* | 0.2972 | ||
| MSE | −3.76 | – | −3.94 | −0.96 | |
| 0.0132* | – | 0.0110* | 0.3824 | ||
* and ** significances at the 0.05 and 0.01 levels, respectively. Results using mrMLM were derived from Wang et al. (2016).
Fig. 3Comparison of mean squared errors of each simulated QTN effect using four GWAS methods (pKWmEB, KWmEB, GEMMA, and mrMLM). The descriptions in (a) to (e) are the same as those in Fig. 2
Fig. 4Comparison of false positive rates using five GWAS methods (pKWmEB, KWmEB, KWsBC, GEMMA, and mrMLM). The descriptions in (a) to (e) are the same as those in Fig. 2
Previously reported genes that were identified only by pKWmEB
| Chr | Position (bp) | LOD | Effect | Gene | Trait | Allele with code 1 | Reference | |
|---|---|---|---|---|---|---|---|---|
| 2 | 2916675 | 4.90 | 0.062 | 0.92 |
| FT GH | A | Zhao et al. ( |
| 2 | 10574932 | 3.23 | 0.098 | 1.38 |
| FT Field | T | Izawa et al. ( |
| 4 | 17392527 | 3.05 | −0.183 | 2.03 |
| FLC | C | Huang et al. ( |
| 5 | 7372523 | 3.96 | 0.122 | 1.86 |
| FT Field | G | Li et al. ( |
| 5 | 7372523 | 3.96 | 0.122 | 1.86 |
| FT Field | G | Holt et al. ( |
The genes in this table were not detected by Atwell et al. (2010)