| Literature DB >> 26345334 |
Haiming Xu1, Beibei Jiang1, Yujie Cao1, Yingxin Zhang2, Xiaodeng Zhan2, Xihong Shen2, Shihua Cheng2, Xiangyang Lou3, Liyong Cao2.
Abstract
With development of sequencing technology, dense single nucleotide polymorphisms (SNPs) have been available, enabling uncovering genetic architecture of complex traits by genome-wide association study (GWAS). However, the current GWAS strategy usually ignores epistatic and gene-environment interactions due to absence of appropriate methodology and heavy computational burden. This study proposed a new GWAS strategy by combining the graphics processing unit- (GPU-) based generalized multifactor dimensionality reduction (GMDR) algorithm with mixed linear model approach. The reliability and efficiency of the analytical methods were verified through Monte Carlo simulations, suggesting that a population size of nearly 150 recombinant inbred lines (RILs) had a reasonable resolution for the scenarios considered. Further, a GWAS was conducted with the above two-step strategy to investigate the additive, epistatic, and gene-environment associations between 701,867 SNPs and three important quality traits, gelatinization temperature, amylose content, and gel consistency, in a RIL population with 138 individuals derived from super-hybrid rice Xieyou9308 in two environments. Four significant SNPs were identified with additive, epistatic, and gene-environment interaction effects. Our study showed that the mixed linear model approach combining with the GPU-based GMDR algorithm is a feasible strategy for implementing GWAS to uncover genetic architecture of crop complex traits.Entities:
Mesh:
Year: 2015 PMID: 26345334 PMCID: PMC4539430 DOI: 10.1155/2015/135782
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Estimates of SNP additive effects and interaction effects with the environments under three different population structures (70%).
| Chr | SNP ID | Pop |
|
|
| Power | |||
|---|---|---|---|---|---|---|---|---|---|
| Par. | Est. | Par. | Est. | Par. | Est. | ||||
| 1 | 28 | 100 | −3.24 | −3.32 | 2.65 | 2.67 | −2.65 | −2.46 | 99.00 |
| 150 | −3.39 | 2.81 | −2.51 | 100.00 | |||||
| 200 | −3.42 | 2.81 | −2.45 | 100.00 | |||||
|
| |||||||||
| 2 | 100 | 100 | −2.65 | −2.79 | 4.05 | 4.16 | −4.05 | −3.79 | 100.00 |
| 150 | −2.90 | 4.26 | −3.75 | 100.00 | |||||
| 200 | −2.94 | 4.34 | −3.73 | 100.00 | |||||
|
| |||||||||
| 3 | 93 | 100 | −1.77 | −1.86 | 3.24 | 3.34 | −3.24 | −3.17 | 87.00 |
| 150 | −1.84 | 3.34 | −3.14 | 98.50 | |||||
| 200 | −1.90 | 3.34 | −3.09 | 100.00 | |||||
Chr: the ordinal number for simulated chromosome; Pop: the population size; Power: the percentage of the detected SNP with significant effect at 0.05 levels; Par.: the true value of parameter in simulations; Est.: the estimate of parameter; a: additive effect; ae 1 and ae 2: the interaction effect of additive with environment 1 and 2, respectively.
Estimates of epistasis and interaction effects with the environments under two different heritabilities.
| Chr. | SNPID. | Chr. | SNPID. |
|
|
| Power | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Par. | Est. | Par. | Est. | Par. | Est. | |||||||||
| I | II | I | II | I | II | I | II | |||||||
| 2 | 44 | 3 | 63 | 0.39 | 0.38 | 0.38 | 0.17 | 0.18 | 0.17 | −0.17 | −0.18 | −0.16 | 100.00 | 99.50 |
Power, Par., and Est. have the same definitions as those in Table 1; aa, aae 1, and aae 2 have the same definitions as those in Table 2.
Estimates of epistasis and interaction effects with the environments under three different population structures (70%).
| Chr. | SNPID. | Chr. | SNPID. | Pop |
|
|
| Power | |||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Par. | Est. | Par. | Est. | Par. | Est. | ||||||
| 2 | 44 | 3 | 63 | 100 | 3.86 | 3.42 | 4.47 | 4.87 | −4.47 | −3.99 | 100.00 |
| 150 | 3.24 | 5.05 | −3.90 | 100.00 | |||||||
| 200 | 3.21 | 5.12 | −3.79 | 99.50 | |||||||
aa: additive-additive epistasis effect; aae 1 and aae 2: the environment-specific additive-additive epistasis effect; Pop, Power, Par., and Est. have same definitions as those in Table 1.
Estimates of SNP additive effects and interaction effects with the environments under two different heritabilities.
| Chr | SNPID |
|
|
| Power | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Par. | Est. | Par. | Est. | Par. | Est. | |||||||
| I | II | I | II | I | II | I | II | |||||
| 1 | 28 | −0.79 | −0.81 | −0.83 | 0.63 | 0.65 | 0.67 | −0.63 | −0.60 | −0.58 | 100.00 | 100.00 |
| 2 | 100 | −0.67 | −0.67 | −0.68 | 0.39 | 0.38 | 0.39 | −0.39 | −0.37 | −0.37 | 100.00 | 100.00 |
| 3 | 93 | −0.40 | −0.41 | −0.41 | 0.32 | 0.32 | 0.32 | −0.32 | −0.31 | −0.31 | 69.00 | 70.00 |
I and II stand for two different heritabilities of 50% and 70%, respectively, which are the proportions of total phenotypic variation ascribed to SNP additive effects and additive-environment interaction effects; Power, Par., Est., a, ae 1, and ae 2 have the same definitions as those in Table 1.
Detected SNPs with significant genetic effects.
| Trait | QTS | Chr. | Allele | Effect | Predict | −log10( |
| Total |
|---|---|---|---|---|---|---|---|---|
| AC | rs1610021 | 6 | G/A |
| −2.20 | 26.94 | 16.89 | 68.67 |
| rs1644460 | 6 | C/T |
| −3.85 | 79.97 | 51.78 | ||
|
| ||||||||
| GC | rs1644460 | 6 | C/T |
| 15.02 | 61.32 | 52.15 | 58.58 |
|
| −3.48 | 2.52 | 2.80 | |||||
|
| 3.47 | 2.11 | 2.80 | |||||
| rs919289 | 7 | C/G |
| −3.96 | 4.97 | 3.63 | ||
|
| ||||||||
| GT | rs1289107 | 6 | G/A |
| −0.54 | 25.06 | 24.00 | 52.01 |
|
| 0.25 | 3.49 | 5.00 | |||||
|
| −0.25 | 2.99 | 5.00 | |||||
| rs1644460 | 6 | C/T |
| −0.35 | 10.73 | 9.81 | ||
|
| −0.30 | 5.01 | 7.40 | |||||
|
| 0.30 | 4.05 | 7.40 | |||||
| rs1289107 and rs1644460 | 6 | G/A |
| 0.21 | 4.33 | 3.60 | ||
|
| −0.16 | 1.76 | 2.20 | |||||
| 6 | C/T |
| 0.17 | 1.61 | 2.20 | |||
AC: amylose content; GC: gel consistency; GT: gelatinization temperature; e 1: environment 1; e 2: environment 2; a: additive effect; ae 1 and ae 2: environment-specific additive effect; aae 1 and aae 2: environment-specific additive-additive epistasis effect. −log10(P) = −log10(P-value). h 2 (%) = heritability (%) due to the genetic component effect.