| Literature DB >> 29265559 |
Bo Wang1, Zhikun Wu1, Zhaohong Li1, Qinghua Zhang1, Jianlin Hu1, Yingjie Xiao1, Dongfang Cai1, Jiangsheng Wu1, Graham J King2, Haitao Li1, Kede Liu1.
Abstract
Genome-wide association studies (GWASs) combining high-throughput genome resequencing and phenotyping can accelerate the dissection of genetic architecture and identification of genes for plant complex traits. In this study, we developed a rapeseed genomic variation map consisting of 4 542 011 SNPs and 628 666 INDELs. GWAS was performed for three seed-quality traits, including erucic acid content (EAC), glucosinolate content (GSC) and seed oil content (SOC) using 3.82 million polymorphisms in an association panel. Six, 49 and 17 loci were detected to be associated with EAC, GSC and SOC in multiple environments, respectively. The mean total contribution of these loci in each environment was 94.1% for EAC and 87.9% for GSC, notably higher than that for SOC (40.1%). A high correlation was observed between phenotypic variance and number of favourable alleles for associated loci, which will contribute to breeding improvement by pyramiding these loci. Furthermore, candidate genes were detected underlying associated loci, based on functional polymorphisms in gene regions where sequence variation was found to correlate with phenotypic variation. Our approach was validated by detection of well-characterized FAE1 genes at each of two major loci for EAC on chromosomes A8 and C3, along with MYB28 genes at each of three major loci for GSC on chromosomes A9, C2 and C9. Four novel candidate genes were detected by correlation between GSC and SOC and observed sequence variation, respectively. This study provides insights into the genetic architecture of three seed-quality traits, which would be useful for genetic improvement of B. napus.Entities:
Keywords: candidate genes; genome-wide association study (GWAS); genomic variation; quantitative trait loci (QTLs); rapeseed (Brassica napus); seed-quality traits
Mesh:
Substances:
Year: 2018 PMID: 29265559 PMCID: PMC5999192 DOI: 10.1111/pbi.12873
Source DB: PubMed Journal: Plant Biotechnol J ISSN: 1467-7644 Impact factor: 9.803
Figure 1Heatmap for distribution of genomic variations in rapeseed. The number of SNPs and INDELs was calculated by 100‐kb sliding window across each chromosome.
Phenotypic variation, variance components and broad‐sense heritability for all traits
| Traits | Environments | Range (Min–Max) | Mean ± SD | ‐Log10( | ANOVA | Heritability ( | |||
|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
| ||||||
| EAC | 2011 | 0.01–53.91 | 14.08 ± 20.04 | 18.06 | 401.37 | 0.49 | 99.96 | ||
| 2012 | 0.01–55.95 | 14.6 ± 20.65 | 17.53 | 426.02 | 1.14 | 99.91 | |||
| All | 0.04–54.82 | 14.19 ± 20.15 | 18.07 | 406.71 | 0.8 | 0.02 | 0.72 | 99.88 (99.84–99.91) | |
| GSC | 2009 | 12.27–147.17 | 51.43 ± 36.22 | 12.41 | 1321.94 | 27.09 | 99.32 | ||
| 2010 | 14.32–166.17 | 58.35 ± 42.08 | 12.98 | 1746.75 | 63.14 | 98.81 | |||
| 2011 | 13.01–131.27 | 50.30 ± 33.28 | 12.45 | 1111.06 | 14.55 | 99.57 | |||
| All | 13.78–150.19 | 53.5 ± 36.54 | 12.74 | 1354 | 35.44 | 15.26 | 42.9 | 98.67 (98.29–98.96) | |
| SOC | 2009 | 32.69–47.75 | 41.22 ± 2.60 | 1.78 | 6.1 | 1.49 | 92.47 | ||
| 2010 | 29.5–46.25 | 40.12 ± 2.62 | 3.37 | 5.44 | 2.83 | 85.22 | |||
| 2011 | 29.62–45.62 | 39.87 ± 2.88 | 4.57 | 7.51 | 2.78 | 89.02 | |||
| 2012 | 28.34–44.32 | 38.18 ± 2.89 | 2.55 | 6.84 | 3.24 | 86.37 | |||
| All | 31.65–44.99 | 39.76 ± 2.31 | 4.07 | 5.84 | 2.59 | 1.51 | 1.12 | 92.17 (90.41–93.53) | |
The BLUP values of traits across all environments.
P values of the Shapiro–Wilk test.
All variances including δG 2, δE 2 and δGE 2 were significant at p < 0.001.
Broad‐sense heritability of each traits and the 95% confidence intervals for broad‐sense heritability across environments are indicated in parentheses.
Figure 2Genome‐wide association studies of three seed‐quality traits. Manhattan plots for EAC (a), GSC (b) and SOC (c) across multiple environments. ‐log10(p) values are plotted against position on each chromosome. Dashed grey lines indicate the genome‐wide significance threshold.
Figure 3Characterization of PVE and favourable allele of associated loci for three traits. (a) The frequencies of favourable alleles and corresponding PVE. The favourable alleles of each loci for EAC and GSC were that reducing EAC and GSC in seeds, respectively. The favourable alleles of each loci for SOC were that increasing SOC. (b) Total PVE of all significant loci for three traits across each environment. Numbers on top of each bars indicated the number of loci detected in each year. (c‐e) Correlation between favourable allele number and EAC, GSC and SOC, respectively.
Figure 4Associated loci and candidate genes for EAC on chromosome A8 and C3. (a, d) Regional Manhattan plot surrounding the peak signals on chromosome A8 (a) and C3 (d) in 2011. Red dot indicates the peak signals snp1348001 (a) and snp2429000 (d). Dashed line represents the significance threshold. (b, e) Exon structure and functional variations of BnaA08g11130D (b) and BnaC03g65980D (e). Numbers indicate the positions of open reading frame from translation start site. (c, f) Boxplots for EAC based on the genotypes of BnaA08g11130D (c) and haplotypes of BnaC03g65980D (f). Differences between the genotypes or haplotypes were analysed by Wilcoxon rank‐sum test. n.s. represents not significant.
Figure 5Associated loci and candidate genes for GSC on chromosome C9 and A9. (a, d) Regional Manhattan plot surrounding the peak signals on chromosome C9 (a) and A9 (d) in 2011. Red dot indicates the peak signal indel479620 (a) and indel240140 (d). Dashed line represents the significance threshold. (b, e) Exon–intron structure and functional variation of BnaC09g05300D (b) and BnaA09g08410D (e). Numbers indicate the positions of open reading frame from translation start site. (c, f) Boxplots for GSC based on the genotypes of BnaC09g05300D (c) and BnaA09g08410D (f). Differences between the genotypes were analysed by Wilcoxon rank‐sum test.
Figure 6Associated loci and candidate genes for SOC on chromosome A10 and C4. (a, d) Regional Manhattan plot surrounding the peak signals on chromosome A10 in 2010 (a) and C4 in 2012 (d). Red dot indicates the peak signals snp1778098 (a) and snp2649172 (d). Dashed line represents the significance threshold. (b, e) Exon–intron structure and functional variations of BnaA10g23290D (b) and BnaC04g45690D (e). Numbers indicate the positions of open reading frame from translation start site. (c, f) Boxplots for SOC based on the haplotypes of BnaA10g23290D (c) and BnaC04g45690D (f). Differences between the genotypes or haplotypes were analysed by Wilcoxon rank‐sum test. n.s. represents not significant.