| Literature DB >> 35915786 |
Jon White1,2, Rajiv Sharma3, David Balding2,4, James Cockram1, Ian J Mackay3.
Abstract
Association mapping using crop cultivars allows identification of genetic loci of direct relevance to breeding. Here, 150 U.K. wheat (Triticum aestivum L.) cultivars genotyped with 23,288 single nucleotide polymorphisms (SNPs) were used for genome-wide association studies (GWAS) using historical phenotypic data for grain protein content, Hagberg falling number (HFN), test weight, and grain yield. Power calculations indicated experimental design would enable detection of quantitative trait loci (QTL) explaining ≥20% of the variation (PVE) at a relatively high power of >80%, falling to 40% for detection of a SNP with an R2 ≥ .5 with the same QTL. Genome-wide association studies identified marker-trait associations for all four traits. For HFN (h 2 = .89), six QTL were identified, including a major locus on chromosome 7B explaining 49% PVE and reducing HFN by 44 s. For protein content (h 2 = 0.86), 10 QTL were found on chromosomes 1A, 2A, 2B, 3A, 3B, and 6B, together explaining 48.9% PVE. For test weight, five QTL were identified (one on 1B and four on 3B; 26.3% PVE). Finally, 14 loci were identified for grain yield (h 2 = 0.95) on eight chromosomes (1A, 2A, 2B, 2D, 3A, 5B, 6A, 6B; 68.1% PVE), of which five were located within 16 Mbp of genetic regions previously identified as under breeder selection in European wheat. Our study demonstrates the utility of exploiting historical crop datasets, identifying genomic targets for independent validation, and ultimately for wheat genetic improvement.Entities:
Year: 2022 PMID: 35915786 PMCID: PMC9314726 DOI: 10.1002/csc2.20692
Source DB: PubMed Journal: Crop Sci ISSN: 0011-183X Impact factor: 2.763
The volume and structure of historical winter wheat trials data
|
| |||||||||
|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
| Yield | 74,332 | 32 | 1,219 | 1,472 | 50.5 | 46.0 | 61.0 | 2.2 | 22.0 |
| HFN | 21,458 | 31 | 1,324 | 586 | 36.6 | 18.9 | 16.2 | 2.0 | 8.1 |
| Protein | 17,097 | 31 | 1,145 | 691 | 24.7 | 22.3 | 14.9 | 1.8 | 8.4 |
| TW | 18,452 | 31 | 1,211 | 546 | 33.8 | 17.6 | 15.2 | 1.7 | 9.1 |
Note. HFN, Hagberg falling number; Protein, protein content; TW, test weight; Var., variety.
The average number of estimations of the trait for a cultivar in the database.
Variance components and trait heritability
|
|
|
|
|
|
| ||
|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
| |
| Yield | 0.393 | 10.265 | 0.400 | 24.89 | 0.393 | 0.020 | 0.950 |
| Protein | 0.226 | 2.017 | 0.287 | 7.67 | 0.226 | 0.037 | 0.858 |
| HFN | 1591 | 13,287 | 1,437 | 7.45 | 1591 | 192.89 | 0.892 |
| TW | 2.580 | 20.150 | 2.020 | 7.03 | 2.580 | 0.287 | 0.899 |
Note. h 2, heritability; HFN, Hagberg falling number; nv, average replication per cultivar; Protein, protein content; ResMS, residual mean square; TW, test weight VarMS, varieties mean square; Vg, genetic variance; Ve, environmental variance.
FIGURE 1Pearson correlation between grain yield (Yield), the presence or absence of the wheat/rye chromosome 1B/1R translocation, grain protein content (PRT), Hagberg falling number (HFN), and test weight (TW)
FIGURE 2Population subsubstructure in the association mapping panel of 150 accessions, based on (a) all markers (n = 16,801), and (b) skimmed markers (n = 463; linkage disequilibrium threshold R = .2). Markers with a minor allele frequency >0.05 were used for both analyses. The presence or absence of the chromosome 1B/1R wheat/rye chromosomal substitution is indicated by red circles and blue triangles, respectively. PC, principal component
FIGURE 3Manhattan plots of genome‐wide association studies for Hagberg falling number (HFN; a), protein content (PRT; b), test weight (TW; c), and grain yield (Yield; d). Markers are shown in genetic map order, according to the genetic map published by Wang et al. (2014). Unmapped markers are not shown here but are listed in Supplemental Table S5. The significance threshold (−log10 P = 3) is indicated by the horizontal red line
FIGURE 4Histogram of Hagberg falling number (HFN; a), protein content (PRT; b), test weight (TW; c), grain yield (YLD; d) in the association mapping panel
Summary of the quantitative trait loci (QTL) identified by genome wide association studies (GWAS) for Hagberg falling number (HFN), protein content (PRT), test weight (TW), and grain yield (YLD)
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|
| HFN |
| 1B | Jagger_c5878_119 | 0 | 1,254,498 | 3.10 | 0.079 | 28.42 | 0.58 |
| HFN |
| 3A | RAC875_c99055_69 | 283.735979 | 728,322,627 | 4.08 | 0.114 | −30.65 | 0.47 |
| HFN |
| 3B | BS00099738_51 | 227.3745367 | 759,168,681 | 3.04 | 0.077 | −31.32 | 0.73 |
| HFN |
| 6A | Kukri_c29110_360 | 88.77758706 | Un | 3.83 | 0.103 | −31.56 | 0.61 |
| HFN |
| 6A | BS00082104_51 | 192.8384162 | 581,841,966 | 3.69 | 0.098 | 32.95 | 0.38 |
| HFN |
| 7B | RAC875_c525_202 | 261.6310052 | 750,082,927 | 4.78 | 0.134 | −44.19 | 0.19 |
| PRT |
| 1A | BS00086680_51 | 71.0976257 | 281,657,026 | 3.26 | 0.087 | 0.39 | 0.52 |
| PRT |
| 1A | Excalibur_c13489_867 | 96.6018344 | 452,020,371 | 3.03 | 0.079 | −0.36 | 0.38 |
| PRT |
| 1A | RAC875_c5882_307 | 231.6429733 | 589,054,742 | 3.01 | 0.079 | −0.33 | 0.34 |
| PRT |
| 2A | RAC875_rep_c69619_78 | 207.5522799 | 734,352,032 | 3.15 | 0.083 | −0.33 | 0.56 |
| PRT |
| 2B | BS00046165_51 | 260.7730538 | 697,510,384 | 3.04 | 0.080 | 0.44 | 0.19 |
| PRT |
| 3A | wsnp_Ku_c30545_40369365 | 107.2210875 | 363,458,708 | 3.34 | 0.090 | −0.35 | 0.52 |
| PRT |
| 3B | wsnp_Ex_c20652_29734133 | 112.6476302 | 292,024,034 | 3.39 | 0.091 | −0.44 | 0.79 |
| PRT |
| 3B | RAC875_c58159_989 | 155.9566423 | 564,248,743 | 3.10 | 0.081 | −0.46 | 0.83 |
| PRT |
| 3B | GENE_1618_780 | 279.0622821 | 820,894,420 | 3.02 | 0.079 | −0.33 | 0.52 |
| PRT |
| 6B | BS00009795_51 | 12.20934913 | 4,876,473 | 3.36 | 0.090 | ‐0.35 | 0.44 |
| TW |
| 1B | RAC875_rep_c95069_54 | 81.49097047 | 336,647,904 | 3.12 | 0.080 | −1.24 | 0.21 |
| TW |
| 3B | GENE_1771_541 | 57.06175117 | 32,458,901 | 3.60 | 0.095 | 1.20 | 0.36 |
| TW |
| 3B | RAC875_c58159_989 | 155.9566423 | 571,753,368 | 3.20 | 0.083 | −1.43 | 0.86 |
| TW |
| 3B | Excalibur_c33274_498 | 207.7302565 | 738,752,902 | 3.21 | 0.083 | 2.04 | 0.05 |
| TW |
| 3B | BS00073480_51 | 269.7509427 | 812,721,677 | 3.41 | 0.089 | −1.91 | 0.91 |
| YLD |
| 1A | RAC875_rep_c105092_114 | 72.6102779 | 304,040,711 | 4.03 | 0.109 | 0.62 | 0.50 |
| YLD |
| 1A | wsnp_CAP8_c4785_2322876 | 166.3032581 | 544,054,745 | 3.28 | 0.085 | 0.56 | 0.79 |
| YLD |
| 2A | RAC875_c48625_182 | 22.63785838 | 18,636,671 | 4.30 | 0.118 | 0.57 | 0.53 |
| YLD |
| 2A | RAC875_c16993_839 | 245.2533379 | 774,815,015 | 4.57 | 0.127 | −0.73 | 0.38 |
| YLD |
| 2B | BS00002660_51 | 38.40853424 | 16,025,160 | 3.09 | 0.079 | 0.50 | 0.57 |
| YLD |
| 2B | BS00091099_51 | 221.2304675 | 578,601,402 | 3.97 | 0.107 | −0.55 | 0.54 |
| YLD |
| 2B | BS00046165_51 | 260.7730538 | 697,510,384 | 3.45 | 0.090 | −0.70 | 0.19 |
| YLD |
| 2B | Kukri_c34553_188 | 319.3174469 | 766,234,466 | 5.56 | 0.161 | −1.00 | 0.17 |
| YLD |
| 2D | wsnp_Ex_c1668_3169623 | 35.64284331 | Un | 3.94 | 0.106 | 0.59 | 0.70 |
| YLD |
| 3A | wsnp_Ex_c8884_14841846 | 179.8202073 | 625,239,797 | 3.13 | 0.08027 | −0.51 | 0.73 |
| YLD |
| 5B | CAP8_rep_c5825_165 | 3.286161131 | 15,054,913 | 3.27 | 0.0847 | 0.5 | 0.4 |
| YLD |
| 6A | BS00082812_51 | 0.502519793 | 639,383 | 4.22 | 0.11534 | 0.73 | 0.78 |
| YLD |
| 6A | wsnp_Ku_c3354_6228393 | 132.9525631 | 430,933,984 | 3.43 | 0.09203 | −0.71 | 0.24 |
| YLD |
| 6B | Kukri_c21405_2131 | 15.73078008 | 166,814 | 3.8 | 0.10161 | −0.73 | 0.19 |
Note. The most significant marker at each QTL is listed. Genetic (Gardner et al., 2016) and physical (IWGSC, 2018) map positions for peak single nucleotide polymorphisms (SNPs) at each QTL are indicated. Where SNPs were anchored to physical map regions currently not allocated to a chromosome designation in the wheat reference genome assembly, the bp position is recorded as unknown (Un). SNP effect at each QTL were determined via two methods (defined below).
SNP effects determined via the outputs of the software TASSELL.
SNP effects determined via modeling the effects considering just those QTL identified for a specific trait.
GWAS hits identified using both kinship and kinship+PCA to correct for population structure. The remaining hits were identified using kinship correction only.