| Literature DB >> 24510440 |
Feng Li1, Biyun Chen1, Kun Xu1, Jinfeng Wu1, Weilin Song1, Ian Bancroft2, Andrea L Harper2, Martin Trick3, Shengyi Liu1, Guizhen Gao1, Nian Wang1, Guixin Yan1, Jiangwei Qiao1, Jun Li1, Hao Li1, Xin Xiao1, Tianyao Zhang1, Xiaoming Wu4.
Abstract
Association mapping can quickly and efficiently dissect complex agronomic traits. Rapeseed is one of the most economically important polyploid oil crops, although its genome sequence is not yet published. In this study, a recently developed 60K Brassica Infinium(®) SNP array was used to analyse an association panel with 472 accessions. The single-nucleotide polymorphisms (SNPs) of the array were in silico mapped using 'pseudomolecules' representative of the genome of rapeseed to establish their hypothetical order and to perform association mapping of seed weight and seed quality. As a result, two significant associations on A8 and C3 of Brassica napus were detected for erucic acid content, and the peak SNPs were found to be only 233 and 128 kb away from the key genes BnaA.FAE1 and BnaC.FAE1. BnaA.FAE1 was also identified to be significantly associated with the oil content. Orthologues of Arabidopsis thaliana HAG1 were identified close to four clusters of SNPs associated with glucosinolate content on A9, C2, C7 and C9. For seed weight, we detected two association signals on A7 and A9, which were consistent with previous studies of quantitative trait loci mapping. The results indicate that our association mapping approach is suitable for fine mapping of the complex traits in rapeseed.Entities:
Keywords: Brassica napus; SNP; association mapping; seed quality; seed weight
Mesh:
Substances:
Year: 2014 PMID: 24510440 PMCID: PMC4131830 DOI: 10.1093/dnares/dsu002
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Phenotypic variations for seed weight and seed quality in this B. napus panel
| Traits | Min ± SDa | Max ± SD | Mean ± SD | CV (%)b |
|---|---|---|---|---|
| Glucosinolate content (µmol/g)—Wuhan 2013 | 28.5 ± 2.0 | 142.8 ± 0.7 | 91.5 ± 0.3 | 29.4 |
| Glucosinolate content (µmol/g)—Nanchan 2013 | 30.1 ± 0.9 | 138.0 ± 3.6 | 90.1 ± 0.9 | 27.4 |
| Oil content (%)—Wuhan 2013 | 34.2 ± 0.8 | 51.4 ± 0.4 | 42.6 ± 0.3 | 5.9 |
| Oil content (%)—Nanchan 2013 | 33.4 ± 1.9 | 52.6 ± 1.5 | 43.6 ± 0.7 | 5.4 |
| Erucic acid content (%)—Wuhan 2012 | 0 | 53.7 | 22.7 | 73.4 |
| Seed weight (g)—Wuhan 2013 | 2.3 ± 0.1 | 5.9 ± 0.2 | 3.5 ± 0.0 | 14.2 |
| Seed weight(g)—Nanchan 2013 | 2.3 ± 0.1 | 5.2 ± 0.2 | 3.6 ± 0.1 | 13.8 |
aSD is an abbreviation of standard deviation, which was calculated based on the measured values of seeds from three replicated experimental blocks.
bCV is an abbreviation of coefficient of variation, which was estimated as the ratio of the standard deviation to the mean of all accessions.
Summary of the PIC values in different linkage groups of the B. napus
| Linkgae group | Number of SNPs | PICa value | |||||||
|---|---|---|---|---|---|---|---|---|---|
| 0.05–0.1 | 0.1–0.15 | 0.15–0.2 | 0.2–0.25 | 0.25–0.3 | 0.3–0.35 | 0.35–0.4 | Average PICb | ||
| A1 | 1072 | 9 (0.8%) | 34 (3.2%) | 83 (7.7%) | 117 (10.9%) | 102 (9.5%) | 248 (23.1%) | 479 (44.7%) | 0.31 g |
| A2 | 506 | 2 (0.4%) | 34 (6.7%) | 25 (4.9%) | 46 (9.1%) | 115 (22.7%) | 130 (25.7%) | 154 (30.4%) | 0.294 cd |
| A3 | 1469 | 6 (0.4%) | 45 (3.1%) | 93 (6.3%) | 130 (8.8%) | 210 (14.3%) | 383 (26.1%) | 602 (41%) | 0.311 g |
| A4 | 1035 | 6 (0.6%) | 61 (5.9%) | 56 (5.4%) | 131 (12.7%) | 209 (20.2%) | 272 (26.3%) | 300 (29%) | 0.293 bcd |
| A5 | 1123 | 1 (0.1%) | 35 (3.1%) | 69 (6.1%) | 115 (10.2%) | 179 (15.9%) | 376 (33.5%) | 348 (31%) | 0.307 fg |
| A6 | 1099 | 3 (0.3%) | 36 (3.3%) | 58 (5.3%) | 91 (8.3%) | 168 (15.3%) | 317 (28.8%) | 426 (38.8%) | 0.312 g |
| A7 | 1427 | 11 (0.8%) | 50 (3.5%) | 70 (4.9%) | 126 (8.8%) | 217 (15.2%) | 382 (26.8%) | 571 (40%) | 0.31 g |
| A8 | 691 | 7 (1%) | 35 (5.1%) | 38 (5.5%) | 40 (5.8%) | 53 (7.7%) | 143 (20.7%) | 375 (54.3%) | 0.319 h |
| A9 | 1225 | 9 (0.7%) | 45 (3.7%) | 39 (3.2%) | 68 (5.6%) | 190 (15.5%) | 261 (21.3%) | 613 (50%) | 0.32 h |
| A10 | 805 | 8 (1%) | 33 (4.1%) | 38 (4.7%) | 72 (8.9%) | 104 (12.9%) | 193 (24%) | 357 (44.3%) | 0.312 g |
| C1 | 2012 | 1 (0%) | 73 (3.6%) | 124 (6.2%) | 238 (11.8%) | 298 (14.8%) | 1074 (53.4%) | 204 (10.1%) | 0.298 cde |
| C2 | 1292 | 2 (0.2%) | 23 (1.8%) | 54 (4.2%) | 116 (9%) | 397 (30.7%) | 321 (24.8%) | 379 (29.3%) | 0.3 de |
| C3 | 2201 | 29 (1.3%) | 77 (3.5%) | 159 (7.2%) | 307 (13.9%) | 352 (16%) | 516 (23.4%) | 761 (34.6%) | 0.298 cde |
| C4 | 2104 | 32 (1.5%) | 94 (4.5%) | 163 (7.7%) | 181 (8.6%) | 351 (16.7%) | 560 (26.6%) | 723 (34.4%) | 0.301 ef |
| C5 | 719 | 2 (0.3%) | 32 (4.5%) | 59 (8.2%) | 105 (14.6%) | 110 (15.3%) | 121 (16.8%) | 290 (40.3%) | 0.298 cde |
| C6 | 1539 | 28 (1.8%) | 52 (3.4%) | 101 (6.6%) | 136 (8.8%) | 403 (26.2%) | 417 (27.1%) | 402 (26.1%) | 0.296 cde |
| C7 | 2006 | 44 (2.2%) | 591 (29.5%) | 52 (2.6%) | 185 (9.2%) | 156 (7.8%) | 587 (29.3%) | 391 (19.5%) | 0.254 a |
| C8 | 1096 | 5 (0.5%) | 23 (2.1%) | 40 (3.6%) | 362 (33%) | 90 (8.2%) | 229 (20.9%) | 347 (31.7%) | 0.292 bc |
| C9 | 835 | 14 (1.7%) | 18 (2.2%) | 78 (9.3%) | 97 (11.6%) | 227 (27.2%) | 192 (23%) | 209 (25%) | 0.287 b |
aPIC is an abbreviation of polymorphism information content.
bValues followed by different letters in this column are significantly different at the level of 0.05.
Figure 1.Analysis of the population structure of 472 rapeseed accessions by STRUCTURE. (a) Estimated LnP(D) of possible clusters (k) from 1 to 10; (b) Δk based on the rate of change of LnP(D) between successive k; (c) population structure based on k = 3. Each individual is represented by a vertical bar, partitioned into coloured segments with the length of each segment representing the proportion of the individual's genome. A given group is represented: Red, Group 1; Green, Group 2; Purple, Group 3.
Figure 2.Quantile–quantile plots of estimated −log10(p) from association analysis using six methods for four traits: (a) Erucic acid content; (b) glucosinolate content; (c) oil content and (d) seed weight. The black line is the expected line under the null distribution, and the deviations from expectation indicate that the statistical analysis may cause spurious associations. The horizontal dashed green lines indicates genome-wide significance threshold −log10(p) = 5.7.
Figure 3.Manhattan plots of association analysis using the Q + K model for four traits: (a) erucic acid content; (b) glucosinolate content; (c) oil content and (d) seed weight. Each dot represents a SNP. The horizontal dashed green line represents the Bonferroni-corrected significance threshold −log10(p) = 5.7.
Genome-wide significant association signals of seed quality and seed weight
| Trait | SNP | Linkage group | Position in ‘pseudomolecules’ (kb) | Major allele | Minor allele | Minor allele frequency | −log10 ( | Contributiona (%) | Candidate gene | Position of candidate genes in ‘pseudomolecules' (kb) |
|---|---|---|---|---|---|---|---|---|---|---|
| Erucic acid content | Bn-A08-p12599446 | A8 | 9514 | T | C | 0.432 | 12.0 | 12.27 | A8: 9281 | |
| Bn-scaff_15794_3-p29807 | C3 | 63 672 | A | C | 0.446 | 8.0 | 7.9 | C1: 63 800 | ||
| Glucosinolate content | Bn-A01-p9004629 | A9 | 3210 | T | C | 0.314 | 23.6 | 25.17 | A9: 3426 | |
| Bn-scaff_26086_1-p11779 | C2 | 50 098 | G | A | 0.292 | 12.7 | 13.19 | C2: 49 620 | ||
| Bn-scaff_16534_1-p2070156b | C4 | 6327 | T | C | 0.084 | 10.1 | 5 | |||
| Bn-scaff_15705_1-p1628802 | C7 | 39 871 | G | A | 0.284 | 6.2 | 6.06 | C7: 40 676 | ||
| Bn-scaff_17526_1-p1140588 | C9 | 2823 | C | T | 0.335 | 10.5 | 12.26 | C9: 4725 | ||
| Oil content | Bn-A08-p12599446 | A8 | 9514 | T | C | 0.432 | 6.1 | 6.22 | A8: 9281 | |
| Seed weight | Bn-A10-p12639538b | A7 | 2674 | T | G | 0.265 | 6.2 | 4.9 | ||
| Bn-A09-p30654305 | A9 | 34 653 | A | C | 0.275 | 11.4 | 13.87 |
aOnly the values estimated by the Q + K model were shown, if the SNPs were detected by both Q + K and PCA models.
bAssociation signals identified only by the PCA model.