| Literature DB >> 35845656 |
Chuanji Zhao1,2, Meili Xie2, Longbing Liang1, Li Yang2,3, Hongshi Han1, Xinrong Qin1, Jixian Zhao1, Yan Hou1, Wendong Dai1, Caifu Du1, Yang Xiang1, Shengyi Liu2, Xianqun Huang1.
Abstract
Rapeseed, an allotetraploid oil crop, provides vegetable oil for human consumption. The growing demand for oilseeds has necessitated the development of rapeseed varieties with improved quality. Therefore, a clear understanding of the genetic basis underlying the seed oil content (SOC) is required. In this study, a natural population comprising 204 diverse accessions and recombinant inbred lines (RILs) derived from Brassica napus and Sinapis alba via distant hybridization were collected for genome-wide association analysis (GWAS) and quantitative trait loci (QTL) mapping of the SOC trait, respectively. The variable coefficient of the RIL and natural populations ranged from 7.43 to 10.43% and 8.40 to 10.91%. Then, a high-density linkage map was constructed based on whole genome re-sequencing (WGS); the map harbored 2,799 bin markers and covered a total distance of 1,835.21 cM, with an average marker interval of 0.66 cM. The QTLs for SOC on chromosome A07 were stably detected in both single and multiple environments. Finally, a novel locus qA07.SOC was identified as the major QTL for SOC based on the GWAS and RIL populations. In addition, the RNA-seq results showed that photosynthesis, lipid biosynthesis proteins, fatty acid metabolism, and unsaturated fatty acid biosynthesis were significantly different between the developed seeds of the two parents of the RIL population. By comparing the variation information and expression levels of the syntenic genes within qA07.SOC and its syntenic genomic regions, as well as through haplotype analysis via GWAS, BnaA07.STR18, BnaA07.NRT1, and BnaA07g12880D were predicted as candidate genes in the qA07.SOC interval. These stable QTLs containing candidate genes and haplotypes can potentially provide a reliable basis for marker-assisted selection in B. napus breeding for SOC.Entities:
Keywords: Brassica napus; GWAS; QTL mapping; distant hybridization; seed oil content; syntenic gene
Year: 2022 PMID: 35845656 PMCID: PMC9283957 DOI: 10.3389/fpls.2022.929197
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 6.627
Phenotypic variations in the recombinant inbred line (RIL) and natural population.
| Population Type | Environment | Min | Max | Mean |
|
| Var | Kurtosis | Skewness | CV (%) |
|---|---|---|---|---|---|---|---|---|---|---|
| RIL | 2012–2013 GY | 26.4 | 45.83 | 38.78 | 0.35 | 4.06 | 16.52 | 0.895 | −0.883 | 10.47 |
| 2013–2014 QH | 27.93 | 45.42 | 36.49 | 0.27 | 3.31 | 10.95 | −0.210 | 0.022 | 9.07 | |
| 2014–2015 CS | 34.01 | 47.74 | 41.56 | 0.3 | 3.29 | 10.8 | −0.592 | −0.498 | 7.91 | |
| 2015–2016 GY | 25.01 | 42.1 | 36.2 | 0.2 | 2.69 | 7.21 | 1.898 | −0.791 | 7.43 | |
| 2015–2016 CS | 28.72 | 41.71 | 35.53 | 0.24 | 2.72 | 7.4 | −0.317 | −0.245 | 7.66 | |
| 2016–2017 CS | 24.07 | 48.12 | 39.84 | 0.29 | 3.49 | 12.2 | 1.872 | −0.589 | 8.76 | |
| 2017–2018 TT | 31.06 | 49.03 | 38.32 | 0.25 | 3.07 | 9.41 | 0.611 | 0.292 | 8.02 | |
| 2017–2018 JS | 21.67 | 46.74 | 38.87 | 0.27 | 3.29 | 10.84 | 4.842 | −1.237 | 8.46 | |
| 2018–2019 CS | 29.94 | 43.52 | 37 | 0.23 | 2.81 | 7.89 | −0.343 | −0.123 | 7.59 | |
| GWAS | 2014–2015 CS | 32.02 | 50.77 | 42.24 | 0.27 | 3.82 | 14.62 | −0.316 | 0.081 | 9.04 |
| 2015–2016 CS | 33.50 | 49.01 | 41.17 | 0.24 | 3.46 | 11.96 | −0.733 | 0.092 | 8.40 | |
| 2016–2017 CS | 29.24 | 53.39 | 41.53 | 0.32 | 4.53 | 20.53 | −0.169 | 0.207 | 10.91 | |
| BLUP | 33.69 | 49.29 | 40.60 | 0.22 | 3.20 | 10.26 | −0.286 | 0.547 | 7.88 |
Env, environment; Min, minimum value; Max, maximum value; Mean, mean value; SE, standard error; SD, standard deviation; Var, variance; and CV, coefficient of variation.
Figure 1Genome-wide association analysis (GWAS) for the seed oil content (SOC) trait in consecutive 3 years based on Mixed Linear Model (MLM) of Efficient Mixed-Model Association eXpedited (EMMAX). (A) The phenotype distributions of SOC in consecutive 3 years and best linear unbiased prediction (BLUP). (B) GWAS and Q-Q plot for SOC in consecutive 3 years and BLUP.
The statistics of SNP number on each chromosome.
| Chromosome | SNP-number | Chromosome length | SNP/kb |
|---|---|---|---|
| A01 | 107,089 | 23,267,856 | 4.6 |
| A02 | 105,683 | 24,793,737 | 4.26 |
| A03 | 165,992 | 29,767,490 | 5.58 |
| A04 | 113,531 | 19,151,660 | 5.93 |
| A05 | 147,616 | 23,067,598 | 6.4 |
| A06 | 145,427 | 24,396,386 | 5.96 |
| A07 | 142,496 | 24,006,521 | 5.94 |
| A08 | 89,148 | 18,961,941 | 4.7 |
| A09 | 158,319 | 33,865,340 | 4.67 |
| A10 | 107,570 | 17,398,227 | 6.18 |
| C01 | 128,195 | 38,829,317 | 3.3 |
| C02 | 113,683 | 46,221,804 | 2.46 |
| C03 | 185,302 | 60,573,394 | 3.06 |
| C04 | 141,790 | 48,930,237 | 2.9 |
| C05 | 85,975 | 43,185,227 | 1.99 |
| C06 | 91,498 | 37,225,952 | 2.46 |
| C07 | 129,098 | 44,770,477 | 2.88 |
| C08 | 103,876 | 38,477,087 | 2.7 |
| C09 | 78,593 | 48,508,220 | 1.62 |
Figure 2Population structure of Brassica napus accessions. (A) The value of cross−validation error in different K value. (B) Model-based group analysis in K = 4. The y axis quantifies clusters membership, and the x axis lists the different B. napus accessions. (C) Genome-wide and sub-genome average LD decay estimated. (D) Relative kinship of 204 B. napus accessions.
SNP position with significant association for SOC in MLM of EMMAX.
| Chromosome | Physical position | Environment | |
|---|---|---|---|
| A07 | 11,440,139 | 1.21 × 10−6 | 2014_2015 CS |
| A07 | 11,440,137 | 1.74 × 10−5 | 2014_2015 CS |
| A07 | 11,343,133 | 4.21 × 10−5 | 2014_2015 CS |
| A07 | 11,411,162 | 2.68 × 10−7 | 2015_2016 CS |
| A07 | 11,630,281 | 3.85 × 10−7 | 2015_2016 CS |
| A07 | 11,630,282 | 4.16 × 10−7 | 2015_2016 CS |
| A07 | 11,630,292 | 2.82 × 10−7 | 2015_2016 CS |
| A07 | 11,632,013 | 2.52 × 10−7 | 2015_2016 CS |
| A07 | 11,632,726 | 4.04 × 10−7 | 2015_2016 CS |
| A07 | 11,774,647 | 3.67 × 10−7 | 2015_2016 CS |
| A07 | 11,778,813 | 3.71 × 10−7 | 2015_2016 CS |
| A07 | 11,428,788 | 3.57 × 10−7 | 2016_2017 CS |
| A07 | 11,428,860 | 3.07 × 10−7 | 2016_2017 CS |
| A07 | 11,440,137 | 4.26 × 10−7 | 2016_2017 CS |
| A07 | 11,575,763 | 3.16 × 10−7 | 2016_2017 CS |
| C08 | 7,894,416 | 1.35 × 10−7 | 2016_2017 CS |
| A07 | 11,440,137 | 6.02 × 10−7 | BLUP |
| A07 | 11,440,139 | 8.21 × 10−7 | BLUP |
Figure 3High-density genetic map illustration and collinearity analysis between the genetic map and the genome. (A) Distribution of Bin markers on 19 linkage groups. Horizontal black lines on each linkage group represent mapped Bin markers. Vertical lines of each linkage group represent the total genetic distance (cM) of the linkage group. The linkage group ID is shown on the x-axis, and the genetic distance is shown on the y-axis. (B) The x-axis is the genetic distance of each linkage group. The y-axis is the physical distance of each chromosome, with the collinearity of genomic markers and genetic maps represented in scatter. Different colors represent different chromosomes or linkage groups.
Characteristics of the high-density genetic map.
| Linkage group | Total bin marker | Total distance (cM) | Average distance (cM) | Gaps < 5 cM | Max Gap (cM) |
|---|---|---|---|---|---|
| A01 | 128 | 84.23 | 0.66 | 97.64% | 13.61 |
| A02 | 178 | 80.41 | 0.45 | 98.87% | 18.27 |
| A03 | 240 | 114.82 | 0.48 | 99.58% | 8.48 |
| A04 | 231 | 97.09 | 0.42 | 99.57% | 8.49 |
| A05 | 103 | 110.12 | 1.07 | 95.10% | 6.81 |
| A06 | 140 | 100.92 | 0.72 | 96.40% | 17.02 |
| A07 | 137 | 111.1 | 0.81 | 94.85% | 13.28 |
| A08 | 127 | 97.91 | 0.77 | 100.00% | 2.07 |
| A09 | 173 | 90.96 | 0.53 | 98.26% | 19.59 |
| A10 | 120 | 81.33 | 0.68 | 96.64% | 11.57 |
| C01 | 155 | 85.23 | 0.55 | 99.35% | 6.38 |
| C02 | 125 | 100.7 | 0.81 | 97.58% | 17.97 |
| C03 | 122 | 106.29 | 0.87 | 97.52% | 15.23 |
| C04 | 115 | 68.15 | 0.59 | 99.12% | 15.28 |
| C05 | 169 | 110.94 | 0.66 | 97.62% | 21.7 |
| C06 | 167 | 145.79 | 0.87 | 93.98% | 21.34 |
| C07 | 131 | 77.07 | 0.59 | 97.69% | 17.02 |
| C08 | 139 | 79.85 | 0.57 | 97.10% | 14.41 |
| C09 | 99 | 92.3 | 0.93 | 92.86% | 9.62 |
| Total | 2,799 | 1,835.21 | 0.66 | 97.35% |
Figure 4Quantitative trait loci (QTLs) identification for SOC in single and multi-environments. (A) The various colors represented the QTL identified in different single environments. (B) Co-localization analysis of the major QTLs identified in single and multi-environments on linkage group A07. ME represented multi-environments; SE represented single environment.
Major QTLs for SOC identified in single environment.
| QTL | Chr | Position (cM) | LOD | Interval (cM) | Add | PVE (%) | Environment |
|---|---|---|---|---|---|---|---|
|
| A02 | 48.8 | 3.4 | 46.8–51.9 | −1.7007 | 8.49 | 2012–2013 GY |
|
| A09 | 46.2 | 3.8 | 46.1–47.6 | −3.0434 | 9.72 | 2012–2013 GY |
|
| C08 | 38.2 | 4.7 | 37.4–44 | −1.3912 | 11.23 | 2013–2014 QH |
|
| A07 | 2.1 | 3.6 | 0.6–3 | −0.9701 | 15.07 | 2015–2016 GY |
|
| A07 | 16.5 | 6.9 | 15.8–38.8 | 1.404 | 11.72 | 2015–2016 GY |
|
| A03 | 83.8 | 4 | 82.4–86.7 | −0.9741 | 11.07 | 2015–2016 CS |
|
| C08 | 2.3 | 4.4 | 0–9.5 | −1.0376 | 13.79 | 2015–2016 CS |
|
| A05 | 0.3 | 3.7 | 0–1.3 | −0.9162 | 8.11 | 2017–2018 TT |
|
| A06 | 86.5 | 3.4 | 85.3–86.9 | −0.8491 | 7.97 | 2018–2019 CS |
SOC, seed oil content; Chr, Chromosome; LOD, logarithm of the odds; PVE, phenotypic variation explained; and Add, additive effect. QTL was named by the consensus QTL nomenclature, and SE represented single environment.
Figure 5RNA-seq analysis and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis of differentially expressed genes (DEGs). (A) Spearman correlation plot in developed seeds of GRD328 (high seed oil content) and GRG2462 (low seed oil content). (B,C) Heatmap represented that the expression levels of the DEGs based on the RNA-seq was verified by quantitative real-time PCR (qRT-PCR). DEGs 1–9 were upregulated in GRD328, and DEGs 11–19 were downregulated in GRD328. The bars represented the normalized transformed counts of qRT-PCR values (B) and FPKM (C), respectively. (D) The statistical analysis of DEGs in developed seeds between two parents. (E) KEGG enrichment analysis of DEGs.
Figure 6The candidate genes prediction by comparative analysis. (A) The variations were identified in qA07.SOC and the syntenic regions. (B) The syntenic analysis for qA07.SOC. (C) Heatmap displays the expression levels of the candidate genes and the syntenic genes. The bars represented FPKM normalized transformed counts. (D) Haplotype analysis based on GWAS in BLUP.