| Literature DB >> 28295030 |
J Zhang1, J-Y Feng1, Y-L Ni1, Y-J Wen1, Y Niu1, C L Tamba1, C Yue1, Q Song2, Y-M Zhang1,3.
Abstract
Multilocus genome-wide association studies (GWAS) have become the state-of-the-art procedure to identify quantitative trait nucleotides (QTNs) associated with complex traits. However, implementation of multilocus model in GWAS is still difficult. In this study, we integrated least angle regression with empirical Bayes to perform multilocus GWAS under polygenic background control. We used an algorithm of model transformation that whitened the covariance matrix of the polygenic matrix K and environmental noise. Markers on one chromosome were included simultaneously in a multilocus model and least angle regression was used to select the most potentially associated single-nucleotide polymorphisms (SNPs), whereas the markers on the other chromosomes were used to calculate kinship matrix as polygenic background control. The selected SNPs in multilocus model were further detected for their association with the trait by empirical Bayes and likelihood ratio test. We herein refer to this method as the pLARmEB (polygenic-background-control-based least angle regression plus empirical Bayes). Results from simulation studies showed that pLARmEB was more powerful in QTN detection and more accurate in QTN effect estimation, had less false positive rate and required less computing time than Bayesian hierarchical generalized linear model, efficient mixed model association (EMMA) and least angle regression plus empirical Bayes. pLARmEB, multilocus random-SNP-effect mixed linear model and fast multilocus random-SNP-effect EMMA methods had almost equal power of QTN detection in simulation experiments. However, only pLARmEB identified 48 previously reported genes for 7 flowering time-related traits in Arabidopsis thaliana.Entities:
Mesh:
Year: 2017 PMID: 28295030 PMCID: PMC5436030 DOI: 10.1038/hdy.2017.8
Source DB: PubMed Journal: Heredity (Edinb) ISSN: 0018-067X Impact factor: 3.821
Figure 1Average powers in the detection of QTNs (a) and average of mean squared errors in the estimation of QTN effects (b) across six simulated QTNs using pLARmEB, LARmEB, EMMA, FASTmrEMMA, mrMLM and BhGLM.
Figure 2Statistical powers of six simulated QTNs in the first simulation experiment plotted against false positive rate (in a log10 scale) for pLARmEB, LARmEB, EMMA, FASTmrEMMA, mrMLM and BhGLM.
AIC and BIC values for the regression of significantly associated SNPs on each Arabidopsis flowering time trait using pLARmEB, EMMA, FASTmrEMMA and mrMLM
| LD | 63.53 | 289.74 | 263.56 | 260.60 | −26.90 | 286.62 | 201.20 | 195.12 |
| LDV | −306.01 | −104.50 | −157.79 | −142.31 | −380.99 | −113.87 | −198.40 | −176.67 |
| SD | −118.34 | 118.17 | 48.55 | 31.26 | −251.10 | 115.08 | 2.24 | −42.84 |
| SDV | −155.98 | 90.55 | 124.10 | −96.31 | −269.53 | 75.20 | 78.07 | −148.49 |
| FT10 | −390.40 | 28.18 | −99.08 | −216.17 | −514.58 | 24.92 | −164.44 | −281.52 |
| FT16 | −6.09 | 222.04 | 189.81 | 192.32 | −84.40 | 218.78 | 144.13 | 127.06 |
| FT22 | 182.71 | 332.36 | 283.04 | 235.13 | 120.72 | 329.10 | 230.84 | 160.09 |
Abbreviations: AIC, Akaike information criterion; BIC, Bayesian information criterion; EMMA, efficient mixed model association; FASTmrEMMA, fast multi-locus random-SNP-effect EMMA; FT10, FT16 and FT22, days to flowering at 10, 16 and 22 °C, respectively; LD, days to flowering under long days; LDV, days to flowering under long days with vernalization; mrMLM, multilocus random-SNP-effect mixed linear model; pLARmEB, polygenic-background-control-based least angle regression plus empirical Bayes; SD, days to flowering under short days; SDV, days to flowering under short days with vernalization; SNP, single-nucleotide polymorphism.
The previously reported genes for seven flowering time traits in Arabidopsis that were detected only by pLARmEB
| P | r | P | r | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| LD | 3 | 21 079 518 | 1.52E−03 | −0.040 | 2.18 | 0.38 | SDV | 2 | 8 516 520 | 6.65E−05 | 0.030 | 3.45 | 0.43 | ||
| 5 | 3594 757 | 7.54E−05 | −0.028 | 3.40 | 0.23 | 2 | 13 853 405 | 4.60E−08 | −0.066 | 6.49 | 3.29 | ||||
| 5 | 25 783 160 | 7.64E−05 | 0.021 | 3.40 | 0.12 | 4 | 7 586 463 | 3.19E−10 | −0.071 | 8.59 | 2.07 | ||||
| LDV | 1 | 25 525 403 | 8.71E−08 | 0.039 | 6.22 | 3.40 | 5 | 239 433 | 8.39E−07 | 0.036 | 5.27 | 0.90 | |||
| 2 | 8 516 520 | 7.37E−09 | 0.048 | 7.26 | 3.29 | 5 | 5 526 925 | 4.30E−04 | 0.024 | 2.69 | 0.53 | ||||
| 3 | 2 215 112 | 5.69E−06 | 0.029 | 4.47 | 1.99 | 5 | 18 607 728 | 1.23E−03 | −0.014 | 2.27 | 0.13 | ||||
| SD | 1 | 192 020 | 1.88E−06 | 0.029 | 4.93 | 0.52 | FT10 | 1 | 22 619 960 | 9.12E−06 | 0.012 | 4.28 | 0.56 | ||
| 1 | 25 532 914 | 7.90E−13 | 0.036 | 11.14 | 1.18 | 2 | 134 343 | 1.03E−05 | −0.013 | 4.22 | 0.71 | ||||
| 2 | 2 910 430 | 3.63E−10 | −0.036 | 8.53 | 1.71 | 2 | 1 076 833 | 1.30E−05 | 0.006 | 4.13 | 0.15 | ||||
| 2 | 9 588 685 | 1.00E−16 | −0.072 | 16.74 | 5.41 | 2 | 8 124 967 | 1.98E−04 | 0.019 | 3.01 | 0.81 | ||||
| 2 | 11 931 686 | 4.95E−14 | 0.041 | 12.32 | 2.25 | 3 | 17 653 089 | 9.62E−08 | −0.015 | 6.18 | 0.46 | ||||
| 3 | 286 197 | 1.29E−04 | −0.017 | 3.18 | 0.32 | 4 | 518 797 | 1.47E−07 | −0.024 | 6.00 | 2.35 | ||||
| 3 | 10 816 150 | 2.21E−08 | −0.049 | 6.80 | 1.50 | 4 | 16 017 869 | 6.61E−05 | −0.006 | 3.46 | 0.08 | ||||
| 3 | 20 477 225 | 1.49E−03 | 0.011 | 2.19 | 0.16 | FT16 | 2 | 882 256 | 1.48E−03 | −0.022 | 2.19 | 0.29 | |||
| 4 | 268 809 | 2.95E−06 | 0.013 | 4.74 | 0.14 | 3 | 21 079 518 | 3.24E−04 | −0.043 | 2.81 | 1.01 | ||||
| 4 | 1 371 766 | 1.32E−06 | 0.023 | 5.08 | 0.66 | 4 | 500 090 | 5.46E−10 | −0.090 | 8.36 | 3.17 | ||||
| 5 | 18 611 542 | 1.00E−07 | 0.015 | 6.16 | 0.22 | FT22 | 1 | 19 629 918 | 3.00E−06 | −0.087 | 4.74 | 2.24 | |||
| 5 | 24 008 772 | 8.91E−06 | 0.010 | 4.28 | 0.07 | 1 | 26 869 825 | 8.33E−07 | 0.118 | 5.27 | 1.90 | ||||
| 5 | 25 347 883 | 2.62E−04 | 0.002 | 2.89 | 0.002 | 4 | 16 310 486 | 4.26E−04 | 0.064 | 2.70 | 0.76 |
Abbreviations: Chr., chromosome; LOD, logarithm (base 10) of odds; pLARmEB, polygenic-background-control-based least angle regression plus empirical Bayes; SNP, single-nucleotide polymorphism.
Trait abbreviations are the same as those in Table 1.
Figure 3Comparison of least angle regression (a), empirical Bayes (b) and pLARmEB (c) in the estimation of QTN effects in one random sample of the first simulation experiment.