Literature DB >> 35701755

Genetic architecture of end-use quality traits in soft white winter wheat.

Meriem Aoun1,2, Arron H Carter1, Craig F Morris3, Alecia M Kiszonas4.   

Abstract

BACKGROUND: Genetic improvement of end-use quality is an important objective in wheat breeding programs to meet the requirements of grain markets, millers, and bakers. However, end-use quality phenotyping is expensive and laborious thus, testing is often delayed until advanced generations. To better understand the underlying genetic architecture of end-use quality traits, we investigated the phenotypic and genotypic structure of 14 end-use quality traits in 672 advanced soft white winter wheat breeding lines and cultivars adapted to the Pacific Northwest region of the United States.
RESULTS: This collection of germplasm had continuous distributions for the 14 end-use quality traits with industrially significant differences for all traits. The breeding lines and cultivars were genotyped using genotyping-by-sequencing and 40,518 SNP markers were used for association mapping (GWAS). The GWAS identified 178 marker-trait associations (MTAs) distributed across all wheat chromosomes. A total of 40 MTAs were positioned within genomic regions of previously discovered end-use quality genes/QTL. Among the identified MTAs, 12 markers had large effects and thus could be considered in the larger scheme of selecting and fixing favorable alleles in breeding for end-use quality in soft white wheat germplasm. We also identified 15 loci (two of them with large effects) that can be used for simultaneous breeding of more than a single end-use quality trait. The results highlight the complex nature of the genetic architecture of end-use quality, and the challenges of simultaneously selecting favorable genotypes for a large number of traits. This study also illustrates that some end-use quality traits were mainly controlled by a larger number of small-effect loci and may be more amenable to alternate selection strategies such as genomic selection.
CONCLUSIONS: In conclusion, a breeder may be faced with the dilemma of balancing genotypic selection in early generation(s) versus costly phenotyping later on.
© 2022. The Author(s).

Entities:  

Keywords:  Association mapping; End-use quality; Molecular markers; Soft white winter wheat

Mesh:

Year:  2022        PMID: 35701755      PMCID: PMC9195237          DOI: 10.1186/s12864-022-08676-5

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   4.547


Introduction

End-use quality improvement in soft white wheat (Triticum aestivum L.) is one of the primary objectives of wheat breeding programs. End-use quality is complex and involves multiple traits. The key end-use quality parameters in soft white wheat include softer kernels, lower grain protein content and gluten strength, less damaged starch, lower non-starch polysaccharides that lead to decreased water absorption capacity, larger cookies diameter and cake volume. For some soft wheat products, starch paste viscosity is a key quality trait. In breeding programs, end-use quality phenotyping is laborious, expensive, time consuming and requires a large amount of grain. Consequently, selection for end-use quality is often delayed until later breeding stages [1, 2]. Since most end-use quality traits are predominantly controlled by genetic factors [3-5], a better understanding of the underlying genetic architecture of the various traits can support strategies for both phenotypic and genotypic selection, including an assessment of the potential effectiveness of marker-assisted selection. Analysis of marker trait associations have identified numerous quantitative trait loci (QTL) for different end-use quality traits distributed across all 21 wheat chromosomes [2, 4, 6–23]. However, most of these studies were performed in hard wheat (bread wheat) and these investigations [2, 4, 17, 22, 23] were performed in soft wheat. Soft white wheat has unique milling and baking parameters which are aimed at making food products such as cookies and cakes [24, 25]. A number of end-use quality traits are influenced by the effect(s) of major genes. For example, the genetic architecture of grain hardness is primarily controlled by the puroindolines, of gluten strength by the high molecular weight glutenins, and of starch paste viscosity by the granule bound starch synthase (‘waxy’) genes [1, 26]. However, these major genes are often fixed in elite breeding populations due to parent selection, or early generation phenotypic and/or genotypic selection and do not sufficiently account for the levels of end-use quality required for cultivar release nor for the range of variation observed among breeding populations [25]. A number of mapping studies for end-use quality were performed in bi-parental populations and in some cases one or both parents were either poorly adapted or would not constitute ‘elite’ germplasm for applied plant breeding [9, 11, 13]. Additionally, the bi-parental genetic structure limits QTL mapping resolution. Genome-wide association mapping (GWAS) can overcome these limitations by using historical recombination events that occur throughout the germplasm evolution and using elite breeding germplasm from the breeding program of interest.. In this study we implemented GWAS using recent breeding lines and cultivars from the Washington State University (WSU) soft white winter wheat breeding program to investigate the underlying genetic architecture of phenotypic variation of 14 end-use quality traits in 672 soft white winter wheat genotypes. We identified end-use quality associated single nucleotide polymorphism (SNP) markers using GWAS and identified large effect QTL. These QTL contribute to better understanding of the underlying genetic architecture of end-use quality in soft white wheat and provide an objective assessment as to the potential for marker assisted selection (MAS) versus other genotypic and phenotypic selection strategies.

Materials and methods

Plant materials

A total of 672 soft white winter wheat breeding lines and cultivars were used in this study. The breeding lines were F4:5 lines and double haploid lines selected from different crosses to represent the diversity present in the WSU winter wheat breeding program. The genotypes and the environments in which the lines were grown were described in Aoun et al. [27]. In brief, this germplasm was evaluated in 29 environments (year-location combinations). Genotypes were grown from 2015 to 2019 in seven locations in Washington State (WA), USA including Pullman, Lind, Davenport, Ritzville, Waterville, Walla Walla, and Dayton. In this dataset, there were 1–7 nurseries per environment with a total of 76 nurseries. From each nursery, a single sample from one replicate per genotype was evaluated for end-use quality traits. The dataset was unbalanced with some shared lines between environments. The connectivity between environments in terms of genotypes was described in Aoun et al. [27]. There were 43 genotypes (out of the 672) evaluated for end-use quality in more than one-quarter of the environments [27].

Phenotypic data

The wheat genotypes were evaluated for 14 end-use quality traits that are classified into four categories which are grain characteristics, milling traits, flour characteristics, and baking parameters. The phenotypic and genotypic data were retrieved from Aoun et al. [27] which investigated genotype × environment interactions and tested the performance of genomic prediction for the 14 end‐use quality traits. Traits associated with grain characteristics included Single Kernel Characterization System (SKCS) hardness, SKCS size, SKCS weight, test weight, and grain protein content. SKCS hardness is a key determinant of end-use quality where hard wheat is mainly used for making bread, and soft wheat is primarily used for making cookies, cakes, and confectionery products [1, 28, 29]. In the grain market, test weight and grain protein content are the two main parameters. High test weight, which is correlated with kernel weight and size [30, 31], usually leads to higher milling performance [32]. Milling traits included break flour yield, flour yield, flour ash content, and milling score. Break flour yield was calculated as the percent of flour recovered from the break rolls, whereas flour yield (‘straight grade’) was determined as the proportion of grain recovered as flour (break plus reduction flour). Flour ash content is the minerals remaining after flour combustion. Milling score was a function of both flour yield and flour ash content [33]. Higher break flour yield, flour yield, and milling score are desirable in soft wheat. Higher inclusion of bran reduces the functionality of most doughs and batters [34]. As such, mineral content of flours (ash) serves as a proxy for bran contamination and lower flour ash is preferred. Flour functionality plays an important role in baking performance. Flour parameters included flour protein content, flour sodium dodecyl sulfate sedimentation volume (SDS sedimentation), water solvent retention capacity (water SRC), and flour swelling volume (FSV). Unlike bread, soft white wheat products require lower grain/flour protein content, weaker gluten strength (lower SDS sedimentation volume), and low water SRC. FSV is an end-use quality parameter associated with the amount of amylose and amylopectin components in endosperm starch [35] and needs to be high for making some Asian-style noodles [1, 36]. The baking parameter cookie diameter is considered an important indicator of the overall quality of soft wheat [28, 37] and has been a key selection trait in soft wheat breeding programs. These end-use quality traits were measured following the procedures from the American Association of Cereal Chemists International [38] and as described by Aoun et al. [27]. The data set was analyzed using mixed linear model in the R package lme4 [39, 40]. The environments were considered random, while genotypes were fitted as fixed in the model. For each trait, best linear unbiased estimators (BLUEs) of the genotypes were extracted from the mixed linear model and used for further statistical analysis. Broad sense heritability (HCullis) and correlations between traits were previously described by Aoun et al. [27].

Genotyping

Genotyping-by-sequencing (GBS) [41] was used to genotype the 672 soft white wheat breeding lines and cultivars. The genotypic data for the 672 genotypes were previously provided by Aoun et al. [27]. The GBS was performed at the North Carolina State University Genomic Sciences Laboratory in Raleigh, NC, USA. The sequence reads were aligned to the T. aestivum RefSeq v1.0 reference genome [42] and SNP data were filtered for minor allele frequency (MAF) ≥ 5%, missing data ≤ 30%, and heterozygous frequency ≤ 15%. From this, 40,518 SNPs were used for further analysis. Missing datapoints in the SNP data were imputed using the expectation–maximization algorithm implemented in the package rrBLUP [43] in R version 4.0.2 [44].

Population structure and linkage disequilibrium

To visualize the population structure in the 672 genotypes, principal component analysis (PCA) was performed using the ‘prcomp’ function in R based on 40,518 SNPs. The population structure was visualized using the first two principal components (PCs) that explained the highest percentage of variation. Pairwise linkage disequilibrium (LD) between SNPs (r2) was estimated using TASSEL v5 [45] by applying a sliding window of 50 markers. The r2 values of marker pairs were plotted against the physical distances in Mega base pairs (Mb) after randomly selecting 10% of the total SNP pairs. To visualize the LD decay across the genome and for each of the 21 chromosomes, a locally estimated scatterplot smoothing (LOESS) curve was fitted using the function ‘geom_smooth’ in R package ggplot2 [46]. The r threshold was derived from the 95th percentile of the distribution of unlinked r2 (for markers on different chromosomes) [47] that were significant at the 99.99% level of confidence. The r threshold is the value beyond which LD was likely to be caused by genetic linkage. The intersection of the horizontal line at the r threshold value with the LOESS curve on the LD scatter plot was considered as the estimate of the extent of LD across the genome (genome-wise LD decay plot) and across each chromosome (chromosome-wise LD decay plot).

Genome-wide association mapping

The BLUEs for each trait were considered as the phenotype in the GWAS. Association mapping was performed using three models 1) mixed linear model (MLM), 2) Fixed and random model Circulating Probability Unification (FarmCPU) [48], and 3) Bayesian-information and Linkage-disequilibrium Iteratively Nested Keyway (BLINK) [49] implemented in the GAPIT R package [50]. The single-locus MLM is the most widely used in association mapping studies. However, it tests one marker at a time and therefore is likely to increase the number of false negatives for complex traits [22, 51]. Multi-locus models such as FarmCPU were proposed to overcome this problem. FarmCPU iteratively uses fixed and random models in which the identified significant SNPs from the iterations are fitted as cofactors [48]. FarmCPU was reported to control for false negatives and false positives without causing model overfitting. BLINK was derived from the FarmCPU method with a few modifications. BLINK does not assume that causal genes are evenly distributed across the genome. It also works directly on markers instead of bins and excludes markers in LD with the most significant markers. BLINK uses Bayesian Information Content (BIC) of a fixed effect model to approximate the maximum likelihood of a random effect model to select marker trait associations (MTAs). The GWAS models considered family relatedness (Kinship matrix or K matrix) [52] and population structure (Q matrix). K matrix was included in all GWAS models, whereas the optimal number of principal components (PCs) in the Q matrix were determined based on quantile–quantile (Q-Q) plots that visualize the expected -log10 (P) versus the observed -log10 (P). The number of PCs included in the GWAS models was limited to the first four PCs. Manhattan plots for MTAs were visualized using the R package ‘qqman’ [53]. MTAs were considered significant at a false discovery rate (FDR) [54] of ≤ 0.05. Based on the T. aestivum RefSeq v1.0 reference genome assembly (https://wheat.pw.usda.gov/jb/?data=/ggds/whe-iwgsc2018), we identified annotated genes within the genomic regions of significant SNPs that exhibit large effects (significantly impacted trait values based on Tukey’s HSD test, had large impact on the phenotype, and were unlikely to be false positives, i.e., MAF ≥ 7%). In this search, we considered only high-confidence annotated genes located within a few kilobase pairs before and after the associated SNP physical positions. The putative biological functions of the candidate genes were retrieved from this website https://wheat-urgi.versailles.inra.fr/Seq-Repository/Annotations. In addition, we extracted MTAs available at the International Wheat Genome Sequencing Consortium (IWGSC) sequence repository that were within the genomic regions of large effect MTAs. Furthermore, markers (SNP, simple sequence repeat: SSR, Diversity Arrays Technology: DArT) associated with end-use quality in previous genetic studies [2, 4, 6–17, 19–22] and physically close (based on T. aestivum RefSeq v1.0 reference genome) to all MTAs in this study were determined.

Results

The distributions of BLUEs for each of the 14 end-use quality traits are illustrated in Fig. 1. There were continuous phenotypic distributions for all end-use quality traits. For grain characteristics, BLUEs ranged from -0.8 to 54.1 for SKCS hardness, 2.1 to 3.2 mm for SKCS size, 28.5 to 48.3 mg for SKCS weight, 55.6 to 65.4 kg/hL for test weight, and 8.3 to 15.7% for grain protein. For milling traits, BLUEs ranged from 36.5 to 54.9% for break flour yield, 57.1 to 74.3% for flour yield, 0.20 to 0.51% for flour ash, and 67.0 to 98.6 for milling score. For flour parameters, BLUEs ranged from 6.4 to 12.4% for flour protein, 3.4 to 18.2 g/mL for SDS sedimentation, 44.5 to 73.1% for water SRC, and 11.6 to 26.2 mL/g for FSV. For cookie diameter, BLUEs ranged from 7.8 to 9.7 cm. For all of these traits, differences in phenotypes would be considered to be industrially significant, with many values below minimum targets [25]. Moderate to high broad sense heritability (H = 0.46–0.70) was observed for all traits except for grain and flour protein content (H = 0.18 to 0.19) [27].
Fig. 1

Distributions of best linear unbiased estimators (BLUEs) for 14 end-use quality traits in 672 soft white winter wheat genotypes

Distributions of best linear unbiased estimators (BLUEs) for 14 end-use quality traits in 672 soft white winter wheat genotypes Of the 40,518 SNPs, there were 14,102 (34.8%) SNPs on the A genome, 16,626 (41.0%) SNPs on the B genome, 8,656 (21.4%) SNPs on the D genome, and 1,134 (2.8%) SNPs on unaligned (UN) chromosome(s). PCA based on the first two PCs showed minimal clustering in wheat genotypes, which was expected since the plant materials in this study were from the same wheat breeding program (Supplementary Fig. S1). The first 10 PCs accounted cumulatively for 26.3% of the variation. The first four PCs explained 5.3%, 4.0%, 3.3% and 3.0% of variation, respectively. The genome-wise LD dropped to an r threshold of 0.1 within 6.5 Mb on average (Supplementary Fig. S2). LD decayed to 0.1 at ~ 2.5–5.0 Mb for chromosomes on the A genome, to 5.0–10 Mb for chromosomes on the B and the D genome (Supplementary Fig. S3).

GWAS model selection

The best models within each method were selected based on examination of Q-Q plots. For MLM, we selected MTAs from the K (Kinship) model. Using FarmCPU, K + 2PCs (Kinship and Q based on the first two PCs) was selected to model SKCS hardness, SKCS weight, grain protein content, flour yield, flour ash, milling score, SDS sedimentation, water SRC, and cookie diameter, whereas K + 3PCs (Kinship and Q based on the first three PCs) was selected to model the remaining traits, SKCS size, test weight, break flour yield, flour protein content, and FSV. For BLINK, we selected K + 4PCs (Kinship and Q based on the first four PCs) for all traits. In contrast to the Q-Q plots generated from FarmCPU models, the Q-Q plots from MLM and BLINK did not show a sharp deviation of the observed P-value distribution from the expected P-value distribution (Supplementary Fig. S4, S5, S6). These results suggest that FarmCPU provided a better control of false negatives and false positives compared to MLM and BLINK. Thus, only association mapping results from FarmCPU will be discussed in this study (Tables 1,2,3,4 , Supplementary Fig. S7). MTAs generated from MLM and BLINK are provided in Supplementary Table S1, S2.
Table 1

Summary of SNP markers associated with grain characteristics in 672 soft white winter wheat genotypes

TraitaSNPbChromosomePosition (bp)cAllelesdMAFeP valuefFDRgEffect
SKCS hardnessS1A_5838946891A583,894,689T/C0.281.75E-054.44E-021.04
SKCS hardnessS1B_6358691001B635,869,100A/G0.243.84E-085.19E-04-1.46
SKCS hardnessS3A_6907863873A690,786,387A/G0.234.73E-061.60E-021.72
SKCS hardnessS3B_7188509703B718,850,970C/G0.467.04E-091.43E-04-1.34
SKCS hardnessS4D_5097982524D509,798,252A/G0.381.32E-053.55E-02-0.97
SKCS hardnessS5A_4805152215A480,515,221C/T0.142.91E-191.18E-14-3.34
SKCS hardnessS5B_5495564745B549,556,474G/T0.061.45E-071.18E-03-2.49
SKCS hardnessS6B_115378716B11,537,871C/T0.055.51E-073.19E-03-3.51
SKCS hardnessS6B_1301638596B130,163,859A/G0.184.98E-073.19E-03-1.47
SKCS hardnessS6B_2331220366B233,122,036T/C0.077.69E-073.90E-032.10
SKCS hardnessS6B_7056137776B705,613,777G/A0.081.36E-066.15E-032.14
SKCS hardnessS7A_120110697A12,011,069T/A0.059.44E-062.73E-022.85
SKCS hardnessS7A_5148929017A514,892,901A/T0.121.56E-066.32E-03-1.78
SKCS hardnessS7B_6435014077B643,501,407A/G0.465.79E-085.86E-041.37
SKCS hardnessS7D_560658547D56,065,854C/G0.179.33E-062.73E-021.32
SKCS hardnessS7D_6173775397D617,377,539C/T0.063.91E-061.44E-022.27
SKCS sizeS1B_5695079321B569,507,932T/C0.471.19E-065.36E-030.02
SKCS sizeS2B_1548463502B154,846,350T/G0.401.72E-066.96E-030.02
SKCS sizeS2D_5637991662D563,799,166G/T0.078.65E-062.50E-02-0.04
SKCS sizeS2D_5869616402D58,6961,640A/G0.203.38E-061.05E-020.03
SKCS sizeS3A_606895913A60,689,591T/C0.323.84E-072.25E-03-0.02
SKCS sizeS3A_7210926853A721,092,685C/G0.172.26E-071.96E-030.03
SKCS sizeS4B_5785211404B578,521,140T/C0.232.91E-083.93E-040.03
SKCS sizeS5A_5780747315A578,074,731A/G0.214.99E-072.53E-03-0.03
SKCS sizeS5A_5830313415A583,031,341C/T0.422.28E-067.80E-03-0.02
SKCS sizeS5A_6369597835A636,959,783T/C0.172.31E-067.80E-03-0.02
SKCS sizeS5B_6159695325B615,969,532G/A0.083.89E-072.25E-030.04
SKCS sizeS6B_1266326936B126,632,693T/G0.051.31E-082.65E-040.04
SKCS sizeS6B_3366061086B336,606,108T/G0.172.42E-071.96E-03-0.02
SKCS sizeS6B_5832817106B583,281,710G/A0.112.86E-101.16E-050.04
SKCS weightS1B_4275308231B427,530,823T/C0.155.42E-074.39E-03-0.87
SKCS weightS2A_1658707822A165,870,782A/T0.388.34E-075.63E-030.64
SKCS weightS2B_183262072B18,326,207C/T0.234.55E-074.39E-030.63
SKCS weightS2B_5331781652B533,178,165C/G0.094.42E-074.39E-03-1.11
SKCS weightS2D_5637991662D563,799,166G/T0.071.16E-055.22E-02-1.04
SKCS weightS2D_6134426362D613,442,636G/T0.051.77E-068.96E-031.36
SKCS weightS6B_5832817106B583,281,710G/A0.111.77E-074.39E-030.75
SKCS weightS7B_6860042317B686,004,231G/A0.093.31E-074.39E-03-1.15
SKCS weightS7D_74955677D7,495,567G/T0.051.14E-066.57E-031.42
Test weightS2A_7622926622A762,292,662T/G0.238.08E-062.89E-02-0.31
Test weightS4B_4134979494B413,497,949C/T0.094.02E-085.43E-04-0.39
Test weightS5A_6115688875A611,568,887A/G0.251.79E-083.62E-040.24
Test weightS5B_5069533325B506,953,332T/A0.211.76E-061.42E-02-0.21
Test weightS5B_5509105135B550,910,513T/C0.433.72E-062.15E-020.19
Test weightS5B_6051716045B605,171,604T/C0.094.76E-062.15E-020.27
Test weightS6A_466750246A46,675,024C/G0.065.32E-062.15E-02-0.36
Test weightS6A_5594289776A559,428,977G/A0.262.45E-099.93E-050.27
Test weightS6B_5106084406B510,608,440G/A0.084.91E-062.15E-020.40
Test weightS7A_6697159417A669,715,941T/C0.143.66E-062.15E-02-0.29
Test weightS7B_401948787B40,194,878G/C0.182.26E-072.29E-030.25
Test weightS7B_6367443137B636,744,313T/A0.178.54E-062.89E-020.28
Grain proteinS1A_5851751451A58,517,5145G/C0.101.33E-054.48E-020.28
Grain proteinS1B_468838681B46,883,868C/A0.091.04E-104.22E-060.38
Grain proteinS1B_6339749581B633,974,958A/G0.459.35E-087.58E-04-0.18
Grain proteinS2A_6128360912A612,836,091C/T0.055.91E-062.18E-02-0.31
Grain proteinS3B_4051613123B405,161,312T/C0.272.74E-061.20E-02-0.19
Grain proteinS3B_6908780203B690,878,020T/C0.441.72E-069.94E-03-0.14
Grain proteinS4A_3524952004A352,495,200G/A0.101.61E-069.94E-03-0.23
Grain proteinS4B_631213164B63,121,316C/T0.103.36E-083.41E-040.31
Grain proteinS5D_5432536025D543,253,602G/A0.081.85E-092.50E-05-0.38
Grain proteinS6D_4711797336D471,179,733A/C0.052.95E-061.20E-020.52
Grain proteinS7A_7310260677A731,026,067A/T0.261.43E-092.50E-050.27
Grain proteinSUN_86449317UN86,449,317T/G0.191.45E-054.53E-020.22
Grain proteinSUN_351152321UN351,152,321A/G0.312.75E-061.20E-02-0.19

aSKCS, Single Kernel Characterization System

bSignificant single-nucleotide polymorphism (SNP). Markers in bold are markers with large effects

cPhysical position of SNP sequence based on Wheat Chinese Spring IWGSC RefSeq v1.0

dSNP major allele/minor allele, the allele in bold is the favorable allele in soft white wheat (associated with higher phenotypic values for SKCS size, SKCS weight, and test weight and lower phenotypic values for SKCS hardness and grain protein content)

eMinor allele frequency of the SNP

fP value of the significant SNP

gFalse discovery rate of the significant SNP

Table 2

Summary of SNP markers associated with milling traits in 672 soft white winter wheat genotypes

TraitSNPaChromosomePosition (bp)bAllelescMAFdP valueeFDRfEffect
Break flour yieldS1B_1000550261B100,055,026T/C0.082.96E-072.40E-031.00
Break flour yieldS1B_6536817521B65,368,1752G/T0.141.32E-133.52E-09-1.05
Break flour yieldS2A_6780056502A678,005,650C/T0.311.74E-133.52E-090.55
Break flour yieldS3B_6303944563B630,394,456G/T0.066.94E-074.69E-030.79
Break flour yieldS4A_5845388274A584,538,827A/G0.117.69E-111.04E-06-0.87
Break flour yieldS4B_6118529434B611,852,943G/C0.086.37E-096.45E-050.70
Break flour yieldS5B_5086657775B508,665,777T/C0.073.67E-061.65E-02-0.62
Break flour yieldS7A_345273137A34,527,313G/C0.339.84E-063.63E-020.27
Break flour yieldS7A_6428993317A642,899,331C/G0.141.23E-067.13E-03-0.44
Break flour yieldS7A_7325940087A732,594,008A/G0.231.60E-068.11E-03-0.41
Break flour yieldS7D_560658547D56,065,854C/G0.174.55E-061.84E-02-0.44
Flour yieldS1A_91283131A9,128,313G/T0.391.09E-064.43E-030.38
Flour yieldS1A_156863461A15,686,346A/G0.342.28E-068.40E-03-0.31
Flour yieldS1A_5875811291A587,581,129A/G0.061.20E-053.05E-02-0.48
Flour yieldS1B_5552941341B555,294,134C/T0.103.62E-061.20E-02-0.48
Flour yieldS1B_5853260311B585,326,031C/G0.445.55E-073.22E-030.28
Flour yieldS1B_6536817521B653,681,752G/T0.145.38E-172.18E-12-1.05
Flour yieldS2A_579241832A57,924,183A/G0.429.10E-062.46E-02-0.25
Flour yieldS3A_6215928643A621,592,864C/T0.317.74E-062.24E-02-0.31
Flour yieldS5A_3822941235A382,294,123G/A0.241.37E-111.84E-070.55
Flour yieldS5B_5086657775B508,665,777T/C0.072.14E-071.45E-03-0.64
Flour yieldS6B_193359966B19,335,996G/A0.072.13E-134.32E-09-0.89
Flour yieldS6D_4716149816D471,614,981T/C0.351.88E-071.45E-03-0.32
Flour yieldS7A_662836127A66,283,612A/C0.338.24E-074.17E-03-0.34
Flour yieldS7A_7358189917A735,818,991C/G0.229.60E-074.32E-030.36
Flour yieldS7B_1236565167B123,656,516G/C0.157.25E-107.34E-06-0.60
Flour ashS1D_207021501D20,702,150G/T0.232.54E-061.03E-020.01
Flour ashS1D_4888052721D488,805,272G/A0.173.53E-061.30E-020.01
Flour ashS2A_777442542A77,744,254G/A0.091.18E-053.99E-02-0.01
Flour ashS2B_7944292902B794,429,290C/T0.305.61E-073.79E-03-0.01
Flour ashS2D_5518477962D551,847,796G/C0.102.19E-069.85E-03-0.01
Flour ashS3D_5402324583D540,232,458C/A0.081.60E-054.99E-02-0.01
Flour ashS4A_1201444124A120,144,412T/C0.104.63E-084.85E-04-0.02
Flour ashS5A_36495345A3,649,534C/A0.333.13E-072.53E-030.01
Flour ashS5B_646332235B64,633,223G/A0.057.56E-073.83E-03-0.02
Flour ashS5B_680524785B68,052,478A/T0.361.49E-093.02E-05-0.01
Flour ashS5D_4165968735D416,596,873T/G0.101.33E-105.38E-060.02
Flour ashS6D_4695838076D469,583,807G/C0.107.00E-073.83E-030.01
Flour ashS7B_6249471997B624,947,199C/A0.054.79E-084.85E-040.02
Milling scoreS1D_147077391D14,707,739G/T0.084.78E-083.88E-041.49
Milling scoreS2B_472002832B47,200,283C/G0.225.52E-062.24E-021.00
Milling scoreS2B_1300289012B130,028,901G/C0.276.33E-062.33E-02-0.59
Milling scoreS2B_7597354902B759,735,490T/C0.273.51E-111.42E-061.01
Milling scoreS3B_824956343B82,495,634G/C0.071.64E-069.48E-031.17
Milling scoreS4A_7243164094A724,316,409G/A0.271.67E-055.19E-020.58
Milling scoreS5A_206405665A20,640,566C/T0.091.41E-082.29E-04-1.24
Milling scoreS5B_680524785B68,052,478A/T0.362.19E-061.11E-020.71
Milling scoreS5B_5033262065B503,326,206C/T0.192.49E-061.12E-02-0.78
Milling scoreS6B_279182216B27,918,221C/T0.245.81E-083.93E-040.90
Milling scoreS6D_4716149816D47,161,4981T/C0.371.70E-082.29E-04-0.83
Milling scoreS7D_3198493577D319,849,357G/A0.144.16E-083.88E-041.25
Milling scoreSUN_31626104UN31,626,104G/T0.251.08E-053.63E-020.65

aSignificant single-nucleotide polymorphism (SNP). Markers in bold are markers with large effects

bPhysical position of SNP sequence based on Wheat Chinese Spring IWGSC RefSeq v1.0

cSNP major allele/minor allele, the allele in bold is the favorable allele in soft white wheat (associated with higher phenotypic values for break flour yield, flour yield, and milling score and lower phenotypic values for flour ash)

dMinor allele frequency of the SNP

eP value of the significant SNP

fFalse discovery rate of the significant SNP

Table 3

Summary of SNP markers associated with flour functionality in 672 soft white winter wheat genotypes

TraitaSNPbChromosomePosition (bp)cAllelesdMAFeP valuefFDRgEffect
Flour proteinS1B_468838681B46,883,868C/A0.091.67E-093.37E-050.29
Flour proteinS2A_1598820692A159,882,069C/T0.094.24E-072.46E-030.24
Flour proteinS2D_287632992D28,763,299G/A0.341.17E-081.19E-04-0.18
Flour proteinS3B_6908780203B690,878,020T/C0.448.56E-091.16E-04-0.19
Flour proteinS4A_5144308984A514,430,898C/T0.463.11E-121.26E-070.19
Flour proteinS4B_631213164B63,121,316C/T0.102.21E-069.95E-030.22
Flour proteinS5D_1356454555D135,645,455G/A0.352.28E-081.85E-04-0.17
Flour proteinS7A_328057347A32,805,734G/A0.271.27E-066.46E-03-0.16
Flour proteinS7A_7304164267A730,416,426G/A0.211.06E-054.00E-020.13
Flour proteinS7D_5072278657D507,227,865G/T0.051.09E-054.00E-02-0.32
Flour proteinSUN_351152321UN351,152,321A/G0.311.35E-079.13E-04-0.19
SDS sedimentationS1A_53265551A5326,555G/A0.301.59E-068.65E-03-0.37
SDS sedimentationS1B_5617125201B561,712,520G/A0.408.53E-143.45E-090.48
SDS sedimentationS1D_1219906801D121,990,680T/G0.071.08E-067.26E-03-0.67
SDS sedimentationS1D_2237972301D223,797,230C/A0.051.78E-068.65E-03-0.75
SDS sedimentationS1D_4110630681D411,063,068T/A0.141.95E-103.95E-060.65
SDS sedimentationS2B_1551104382B155,110,438A/G0.422.14E-068.65E-03-0.33
SDS sedimentationS3A_6972022793A697,202,279T/C0.097.40E-062.50E-020.47
SDS sedimentationS3B_7235053343B723,505,334T/G0.431.18E-091.59E-05-0.42
SDS sedimentationS5A_5850180415A585,018,041A/G0.441.97E-068.65E-030.33
SDS sedimentationS5A_6331058375A633,105,837C/A0.063.25E-061.20E-02-0.75
SDS sedimentationS5B_5390754835B539,075,483A/G0.117.34E-087.44E-040.61
SDS sedimentationS7A_1087853657A108,785,365G/A0.071.06E-053.32E-020.93
SDS sedimentationS7A_6751987287A675,198,728C/T0.111.42E-071.15E-03-0.57
Water SRCS1B_5479731541B547,973,154C/T0.105.59E-083.78E-040.60
Water SRCS1B_6536817521B653,681,752G/T0.141.25E-105.08E-060.92
Water SRCS2A_6137207682A613,720,768T/C0.062.02E-071.02E-03-0.74
Water SRCS2B_665595342B66,559,534C/A0.085.49E-072.22E-030.69
Water SRCS3A_249938763A24,993,876C/T0.081.49E-078.61E-040.74
Water SRCS5A_3822941235A382,294,123G/A0.242.88E-082.33E-04-0.46
Water SRCS5A_6735503055A673,550,305C/G0.371.18E-091.59E-05-0.45
Water SRCS6B_297718216B29,771,821G/C0.352.97E-071.34E-03-0.36
Water SRCS7A_7097651487A709,765,148G/T0.129.56E-101.59E-050.69
Water SRCS7B_5391962887B539,196,288A/G0.071.35E-081.37E-040.72
Water SRCS7D_3276905807D327,690,580G/A0.268.76E-063.23E-02-0.38
FSVS1A_5340556531A534,055,653G/T0.051.70E-068.40E-030.74
FSVS1B_66787321B6,678,732A/C0.288.90E-093.60E-04-0.38
FSVS2A_7055838922A705,583,892T/C0.102.86E-069.65E-030.44
FSVS2B_76539642B7,653,964C/T0.192.28E-068.40E-030.43
FSVS2B_313820502B31,382,050G/C0.481.74E-068.40E-03-0.27
FSVS3D_2668392643D266,839,264A/G0.292.21E-068.40E-030.31
FSVS3D_6010136373D601,013,637A/G0.134.77E-073.22E-030.49
FSVS4A_121683934A12,168,393C/T0.333.61E-073.22E-03-0.33
FSVS4B_5412527594B541,252,759A/G0.152.04E-068.40E-03-0.44
FSVS5D_863088785D86,308,878C/A0.054.10E-088.31E-040.80
FSVS7A_2006081147A20,060,8114G/T0.181.28E-053.71E-020.45
FSVS7A_5833572147A583,357,214G/A0.054.20E-073.22E-030.73
FSVS7B_5400568507B540,056,850C/T0.066.71E-089.06E-040.66
FSVS7D_380000377D38,000,037C/T0.065.62E-061.75E-02-0.63

aWater SRC, water solvent retention capacity; FSV, flour swelling volume

bSignificant single-nucleotide polymorphism (SNP). Markers in bold are markers with large effects

cPhysical position of SNP sequence based on Wheat Chinese Spring IWGSC RefSeq v1.0

dSNP major allele/minor allele, the allele in bold is the favorable allele in soft white wheat (associated with higher phenotypic values FSV and lower phenotypic values for flour protein, SDS sedimentation, and water SRC)

eMinor allele frequency of the SNP

fP value of the significant SNP

gFalse discovery rate of the significant SNP

Table 4

Summary of SNP markers associated with cookie diameter in 672 soft white winter wheat genotypes

TraitSNPaChromosomePosition (bp)bAllelescMAFdP valueeFDRfEffect
Cookie diameterS1A_5867063971A586,706,397G/T0.061.89E-093.83E-05-0.08
Cookie diameterS1B_5733235461B573,323,546G/A0.126.74E-062.62E-02-0.06
Cookie diameterS2D_288320582D28,832,058T/C0.402.25E-061.01E-020.04
Cookie diameterS4A_5839688234A583,968,823G/A0.108.52E-085.77E-040.06
Cookie diameterS4A_6884075114A688,407,511G/A0.175.03E-085.09E-04-0.05
Cookie diameterS4B_6678333524B667,833,352C/T0.181.33E-066.73E-03-0.06
Cookie diameterS5A_5553348645A555,334,864C/T0.431.27E-093.83E-050.05
Cookie diameterS5B_4184636805B418,463,680A/T0.189.94E-085.77E-04-0.06
Cookie diameterS5B_5717251915B571,725,191T/A0.367.11E-062.62E-020.03
Cookie diameterS5B_5809766325B580,976,632A/C0.181.25E-053.89E-020.04
Cookie diameterS7B_6487845737B648,784,573G/A0.104.48E-085.09E-040.07
Cookie diameterS7D_560658547D56,065,854C/G0.189.97E-085.77E-04-0.05
Cookie diameterSUN_82579190UN82,579,190G/C0.149.93E-063.35E-02-0.05

aSignificant single-nucleotide polymorphism (SNP). Markers in bold are markers with large effects

bPhysical position of SNP sequence based on Wheat Chinese Spring IWGSC RefSeq v1.0

cSNP major allele/minor allele, the allele in bold is the favorable allele in soft white wheat (associated with higher cookie diameter)

dMinor allele frequency of the SNP

eP value of the significant SNP

fFalse discovery rate of the significant SNP

Summary of SNP markers associated with grain characteristics in 672 soft white winter wheat genotypes aSKCS, Single Kernel Characterization System bSignificant single-nucleotide polymorphism (SNP). Markers in bold are markers with large effects cPhysical position of SNP sequence based on Wheat Chinese Spring IWGSC RefSeq v1.0 dSNP major allele/minor allele, the allele in bold is the favorable allele in soft white wheat (associated with higher phenotypic values for SKCS size, SKCS weight, and test weight and lower phenotypic values for SKCS hardness and grain protein content) eMinor allele frequency of the SNP fP value of the significant SNP gFalse discovery rate of the significant SNP Summary of SNP markers associated with milling traits in 672 soft white winter wheat genotypes aSignificant single-nucleotide polymorphism (SNP). Markers in bold are markers with large effects bPhysical position of SNP sequence based on Wheat Chinese Spring IWGSC RefSeq v1.0 cSNP major allele/minor allele, the allele in bold is the favorable allele in soft white wheat (associated with higher phenotypic values for break flour yield, flour yield, and milling score and lower phenotypic values for flour ash) dMinor allele frequency of the SNP eP value of the significant SNP fFalse discovery rate of the significant SNP Summary of SNP markers associated with flour functionality in 672 soft white winter wheat genotypes aWater SRC, water solvent retention capacity; FSV, flour swelling volume bSignificant single-nucleotide polymorphism (SNP). Markers in bold are markers with large effects cPhysical position of SNP sequence based on Wheat Chinese Spring IWGSC RefSeq v1.0 dSNP major allele/minor allele, the allele in bold is the favorable allele in soft white wheat (associated with higher phenotypic values FSV and lower phenotypic values for flour protein, SDS sedimentation, and water SRC) eMinor allele frequency of the SNP fP value of the significant SNP gFalse discovery rate of the significant SNP Summary of SNP markers associated with cookie diameter in 672 soft white winter wheat genotypes aSignificant single-nucleotide polymorphism (SNP). Markers in bold are markers with large effects bPhysical position of SNP sequence based on Wheat Chinese Spring IWGSC RefSeq v1.0 cSNP major allele/minor allele, the allele in bold is the favorable allele in soft white wheat (associated with higher cookie diameter) dMinor allele frequency of the SNP eP value of the significant SNP fFalse discovery rate of the significant SNP

Marker-trait associations

Based on the LD between markers, each MTA identified from the FarmCPU models represent a distinct locus or QTL. Considering all traits together, a total of 178 significant MTAs were identified across all wheat chromosomes (Tables 1,2,3,4). Sixty-two MTAs were detected on the A genome, 77 MTAs on the B genome, 34 MTAs on the D genome, and five MTAs on unaligned (UN) chromosomes. Chromosome 1B and 7A carried the highest number of MTAs (n = 16), whereas chromosome 4D had only a single MTA. The favorable alleles and their corresponding frequencies are described in Tables 1,2,3,4. There were 12 large-effect markers associated with 11 traits (1–2 markers per trait) (Table 5). For SKCS size, FSV, and cookie diameter, all significant markers had small effects.
Table 5

Allelic effect of large-effect markers on end-use quality traits in soft white wheat

TraitMarkersMAFaAllelesbNcMean BLUEdTukey's teste
SKCS hardnessS5A_4805152210.14CC57225.6A
CT1820.6B
TT8218.9B
S6B_7056137770.08AA1332.8A
AG7524.9B
GG58424.5B
SKCS weightS2B_5331781650.09CC56939.3A
CG8937.9B
GG1435.8B
Test weightS4B_4134979490.09CC60262.1A
CT1561.3AB
TT5560.9B
Grain proteinS1B_468838680.09AA4611.9A
AC1711.8A
CC60911.1B
Break flour yieldS1B_6536817520.14GG49448.3A
GT17147.3B
TT745.5C
Flour yieldS1B_6536817520.14GG49469.6A
GT17168.7B
TT766.9B
S6B_193359960.07AA3867.0B
AG1668.1B
GG61869.5A
Flour ashS4A_1201444120.10CC490.36B
CT290.36B
TT5940.39A
Milling scoreS1D_147077390.08GG55684.7B
GT8486.8A
TT1088.5A
S5A_206405660.09CC57685.2A
CT3185.7A
TT4382.8B
Flour proteinS1B_468838680.09CC6099.2B
CA179.7A
AA469.8A
SDS SedimentationS1D_1219906800.07GG169.4B
GT5810.2AB
TT59810.7A
S1D_4110630680.14AA7611.9A
AT2510.8AB
TT57110.5B
Water SRCS1B_6536817520.14GG49454.0C
GT17155.1B
TT758.9A

aMinor allele frequency

bAllele in bold is the favorable allele in soft white wheat

cNumber of genotypes per marker allele

dBLUE, best linear unbiased estimators

eThe letters represent results of Tukey’s HSD test (treatments with different letters are significantly different at 95% level of confidence)

Allelic effect of large-effect markers on end-use quality traits in soft white wheat aMinor allele frequency bAllele in bold is the favorable allele in soft white wheat cNumber of genotypes per marker allele dBLUE, best linear unbiased estimators eThe letters represent results of Tukey’s HSD test (treatments with different letters are significantly different at 95% level of confidence) For grain characteristics, five MTAs with large effects were detected on chromosome 1B, 2B, 4B, 5A, and 6B (Table 5). Markers S5A_480515221 and S6B_705613777 were associated with SKCS hardness and impacted the hardness index by 6.7 and 7.9 units on average, respectively. Marker S2B_533178165 was associated with SKCS weight and influenced the phenotype by 3.55 mg. For test weight, S4B_413497949 had the largest effect and resulted in 1.2 kg/hL difference in the phenotype on average, whereas for grain protein content, S1B_46883868 had the largest effect with 0.85% increase/decrease in the phenotype. S1B_46883868 was also associated with flour protein content and affected the trait value by 0.63% on average. For milling traits, five markers had large effects and were detected on chromosomes 1B, 1D, 5A, and 6B (Table 5). Marker S1B_653681752 was associated with break flour yield and flour yield and influenced trait values by 2.9% and 2.7% on average, respectively. An additional large effect marker was associated with flour yield: S6B_19335996 affected the trait value by 2.5%. Marker S4A_120144412 was associated with flour ash and influenced the phenotype by 0.03% on average. For milling score, S1D_14707739 and S5A_20640566 affected the phenotype by 3.8 and 2.4 units, respectively. For flour parameters, there were four markers with large effects located on chromosomes 1B, 1D, and 4A (Table 5). In addition to break flour yield, S1B_653681752 also influenced water SRC by 4.9% on average. For SDS sedimentation, two large effect markers were identified including S1D_121990680 and S1D_411063068, which affected the trait values by 1.3 and 1.4 g/mL, respectively. Except for S5A_480515221, S4A_120144412, and S1D_121990680, which were associated with SKCS hardness, flour ash, and SDS sedimentation, the favorable alleles for the remaining nine large effect markers were present in high frequencies (86–93%) in this soft white wheat germplasm.

Loci associated with at least two end-use quality traits

Among the 178 MTAs identified in this study, there were 17 loci associated with more than a single end-use quality trait (Table 6). Among these loci, there were two large-effect markers (S1B_46883868 and S1B_653681752) and ten small-effect markers that were associated with at least two end-use quality traits. For each of these 12 markers, there was desirable linkage between the favorable alleles. This suggests that these markers are having desirable pleiotropic effects and could be useful to simultaneously breed for more than a single end-use quality trait. Based on pairwise LD estimates between physically close markers, there were additionally five loci on chromosomes 1A, 1B, 6B, 7A, and 7B that were associated with more than a single end-use quality trait (Table 6). For each of these loci, LD between significant markers that were associated with different traits was higher than the r threshold of 0.1. For three of these loci, S1A_586706397/S1A_587581129, S6B_27918221/ S6B_29771821 and S7A_730416426/S7A_731026067, there was desirable linkage between marker alleles in each locus, whereas for the other two loci S1B_561712520/S1B_569507932 and S7B_624947199/S7B_636744313, there was unfavorable linkage. Therefore, for the latter two loci, selecting for one trait could negatively affect the other trait.
Table 6

Loci associated with two or more end-use quality traits in 672 soft white winter wheat genotypes

MarkersaSKCS hardnessSKCS sizeSKCS weightTest weightGrain proteinBreak flour yieldFlour yieldFlour ashMilling ScoreFlour proteinSDS sedimentationWater SRCCookie diameter
S1A_586706397/S1A_587581129 ×  × 
S1B_46883868 ×  × 
S1B_561712520/S1B_569507932 ×  × 
S1B_653681752 ×  ×  × 
S2D_563799166 ×  × 
S3B_690878020 ×  × 
S4B_63121316 ×  × 
S5A_382294123 ×  × 
S5B_68052478 ×  × 
S5B_508665777 ×  × 
S6B_27918221/ S6B_29771821 ×  × 
S6B_583281710 ×  × 
S6D_471614981 ×  × 
S7A_730416426/S7A_731026067 ×  × 
S7B_624947199/S7B_636744313 ×  × 
S7D_56065854 ×  ×  × 
SUN_351152321 ×  × 

aMarkers in bold had large effects. Markers before and after the forward slash (/) were in linkage disequilibrium (r ≥ 0.1). S1A_586706397 and S1A_587581129 were associated with cookie diameter and flour yield, respectively, S1B_561712520 and S1B_569507932 were associated with SDS sedimentation and SKCS size, respectively, S6B_27918221 and S6B_29771821 were associated with milling score and water SRC, respectively. S7A_730416426 and S7A_731026067 were associated with flour protein and grain protein content, respectively, and S7B_624947199 and S7B_636744313 were associated with flour ash and test weight, respectively

Loci associated with two or more end-use quality traits in 672 soft white winter wheat genotypes aMarkers in bold had large effects. Markers before and after the forward slash (/) were in linkage disequilibrium (r ≥ 0.1). S1A_586706397 and S1A_587581129 were associated with cookie diameter and flour yield, respectively, S1B_561712520 and S1B_569507932 were associated with SDS sedimentation and SKCS size, respectively, S6B_27918221 and S6B_29771821 were associated with milling score and water SRC, respectively. S7A_730416426 and S7A_731026067 were associated with flour protein and grain protein content, respectively, and S7B_624947199 and S7B_636744313 were associated with flour ash and test weight, respectively

Co-localized MTAs with previously identified end-use quality genes/QTL

A total of 35 annotated genes were located close to the physical positions of the 12 large effect markers. The putative functions of these genes are described in Supplementary Table S3. In addition, we found 13 GWAS MTAs available at the IWGSC sequence repository that were within the genomic regions of the 12 large effect markers identified in our study. These GWAS MTAs from previous studies were associated with thousand kernel weight, test weight, grain fill duration, grain protein content, SDS sedimentation, and grain minerals (Cu and Zn) (Supplementary Table S3). Furthermore, comparative mapping (based on physical positions of molecular markers) between all the 178 identified MTAs from this study and end-use quality QTL/genes from previous genetic studies [2, 4, 6–17, 19–22] showed that 40 MTAs were positioned within genomic regions of previously discovered end-use quality genes/QTL (Supplementary Table S4). Of the 16 identified loci for SKCS hardness in this study, 10 loci were found within genomic regions of previously reported grain hardness QTL (Supplementary Table S4). For instance, SKCS hardness associated markers in this study, S1A_583894689, S3A_690786387, S3B_718850970, S5A_480515221, S6B_130163859, S6B_705613777, and S7A_12011069 were located close to the positions of previously reported grain hardness associated markers/QTL, QGh.caas-1A (~ 575 Mb, [9]), wPt-4725 (709 Mb [12],), QKh.WJ-3B.3 (~ 695 Mb, [8]), S5A_463766631 (464 Mb, [22]), IWB11485 (121 Mb, [2]), S6B_703822990 (704 Mb, [22]), wPt-0744 (0.2 Mb, [12]), respectively. Similarly, S1B_635869100 was positioned 8–13 Mb from Qgh-1B [13], QKh.WY-1B.1b [8], QGh.cass-1B [7], QNhi.hwwgr-1BL [10], and QKH.ksw-1B [21]. On chromosome 5B, S5B_549556474 identified in this study was found close to Qshi.hwwgr-5BL (566–571 Mb, [10]) and a QTL flanked by the SSR marker wmc289 (556 Mb, [11]). In addition, a MTA on chromosome 7B, S7B_643501407, was located at ~ 8 Mb from QKh.WJ-7B.1B [8] and QGh.caas-7B.1b [7]. Our comparative mapping showed the importance of these 10 previously characterized loci in controlling kernel hardness and suggests that the remaining six loci could be novel. Three loci associated with SKCS size were located close to previously identified kernel size QTL (Supplementary Table S4). For instance, S2B_154846350, associated with SKCS size in this study, was located at 4–5 Mb from IWB30179 [2] and QKd.cob-1A (~ 134 Mb, [14]) that were associated with SKCS size and kernel diameter, respectively. Similarly, S2D_563799166 and S5A_578074731 were close to previously identified kernel diameter QTL, QKd.hwwgr-2DL (~ 552 Mb, [10]) and QKD.ksu-5A1 (568 Mb, [21]), respectively. For SKCS weight, S2D_563799166 identified in this study was positioned at ~ 12 Mb from QSkw.hwwgr-2DL (552 Mb, [10]) (Supplementary Table S4). Two of our identified MTAs for test weight were found near previously identified test weight associated markers/QTL. This includes S2A_762292662 located close to IWB35564 (760 Mb, [2]) and S7B_40194878 located at 7–8 Mb from QTW.ksu-7B [21] and IWB54370 [2] (Supplementary Table S4). Four of the associated loci with grain protein content in this study were previously reported (Supplementary Table S4). This includes S1B_633974958, which was positioned close to QGpc.caas-1B.1 (628–638 Mb) and QWgc.caas-1B (~ 628–634 Mb), which were associated with grain protein content and wet gluten content, respectively [7]. Similarly, S4B_63121316 and S5D_543253602 were found close to associated markers with grain protein content gwm368 (60 Mb, [11]) and wPt-9788/wPt-0400 (560 Mb, [12]), respectively. On chromosome 7A, the grain protein content associated marker, S7A_731026067 was 18 Mb from Qgpc.7A.1 (wmc525, [19]), which was associated with both protein content and dry gluten content. For break flour yield, the associated locus tagged by marker S3B_630394456 was at the physical position of IWA6254 (630 Mb) also associated with break flour yield [17]. For flour yield, five of the identified MTAs were mapped close to flour yield associated loci in previous genetic studies (Supplementary Table S4). For instance, S1A_9128313 and S1A_15686346 were in close proximity to QFY.ksu-1A (7 Mb, [21]). S1A_9128313 and S1A_15686346 were also within the genomic regions of the genes TraesCS1A01G010900 (5 Mb) and TraesCS1A01G039600 (16 Mb), respectively. TraesCS1A01G010900 (5 Mb) and TraesCS1A01G039600 were annotated as low molecular weight glutenin subunit and high molecular weight glutenin subunit, respectively. Another MTA on chromosome 1B associated with flour yield, S1B_653681752, was identified close to QFY.ksu-1B (649 Mb) [21]. Similarly, S5A_382294123 and S6B_19335996 were found in proximity to IWB76667 (384 Mb, [2]) and IWA7725 (27 Mb, [17]), respectively. Two flour ash associated loci in this study, S5A_3649534 and S5B_68052478, were positioned close to Qgac.cob-5A (gwm443, 11 Mb) and Qgac.cob-5B.1 (gwm540, 67 Mb), respectively, which were previously identified by El-Feki et al. [14] (Supplementary Table S4). For flour protein, S2D_28763299 was located 15–17 Mb from QGpc.caas-2D [7], which was associated with grain protein content and QWgc.WY-2D.5 [8], which was associated with wet gluten content. Similarly, S7A_730416426 associated with flour protein content in this study was ~ 17 Mb away from qGPC.7A.1 [19], which was associated with grain protein content and dry gluten content (Supplementary Table S4). Two SDS sedimentation associated markers in this study were previously identified (Supplementary Table S4). These loci include S1B_561712520 located close to IWB14950 (558 Mb, [2]) and the gene Glu-B1 (556 Mb) and S5B_539075483, which mapped close to QSsd.caas-5B (527 Mb, [9]). Among water SRC associated markers, S1B_547973154, S1B_653681752, S2B_66559534, and S6B_29771821 were positioned near Glu-B1 (556 Mb), IWB27057 (652 Mb, [2]), IWA820 (44 Mb, [17]), and IWA7725 (27 Mb, [17]), respectively (Supplementary Table S4).

Discussion

This study used historical data that captured a wide range of phenotypic variation for end-use quality within a soft white winter wheat breeding program. Heritability estimates for end-use quality traits were moderate to high except for grain and flour protein content. This suggests that most traits are primarily controlled by genetic factors and that a genotypic selection (as opposed to phenotypic selection) is a rational strategy. This study identified the genetic architecture underlying 14 end-use quality traits among recent breeding lines and cultivars from a soft white winter wheat breeding program. Prior to this study, Jernigan et al. [2] investigated the genetic architecture of end use quality in a set of 480 advanced soft white winter wheat breeding lines and cultivars from Pacific Northwest breeding programs selected from 1992 to 2014. Thus, the germplasm used in this study is different from that used by Jernigan et al. [2]. Consequently, our investigation was expected to corroborate previous and/or discover additional QTL associated with end-use quality in soft white wheat. Identified MTAs in this study as well as genotypes with favorable alleles will be useful for end-use quality improvement in soft white and other types of wheat. The 12 large effect markers can be converted into Kompetitive Allele Specific PCR (KASP) or thermal asymmetric reverse PCR (STARP) markers for use in marker-assisted selection (MAS). Among these large effect markers, S1B_653681752 is useful to breed for higher break flour yield and flour yield and lower water SRC. Similarly, S1B_46883868 is associated with both grain protein content and flour protein. The favorable alleles of nine of the large effect markers were present in high frequencies in this germplasm. This suggested that these markers were under high selection pressure in the soft white wheat breeding program, likely the result of long-term phenotyping and selection, and the pyramiding of favorable alleles across the breeding populations. Based on our comparative mapping, eight of the large effect markers including S1B_46883868, S1D_14707739, S1D_121990680, S1D_411063068, S2B_533178165, S4A_120144412, S4B_413497949, and S5A_20640566 were not reported in previous studies, thus should be prioritized for MAS. Only a few loci were found to have large effects, suggesting that many end-use quality traits have complex genetic architecture and are mainly controlled by several minor genes with small effects. For some traits like SKCS diameter, FSV, and cookie diameter, all identified markers had small effects, suggesting that MAS may not be useful for these traits. Therefore, genomic selection might be a better approach to implement for such traits [27, 55].

Grain characteristics

Grain characteristics greatly influence wheat end-use quality [4, 7, 11, 30, 47]. Grain hardness affects most end-use quality traits including break flour yield, flour yield, flour particle size, starch damage, dough strength, and cookie diameter [27, 56–58]. The variation in grain hardness in the present soft wheat germplasm, like most soft wheat breeding populations, is independent of the puroindolines because wild-type puroindoline genes at the Ha locus are generally fixed. This is consistent that no MTAs were identified on chromosome 5DS in this study. Other grain characteristics including SKCS size, SKCS weight, test weight, and grain protein influence wheat milling performance [28, 30]. SKCS size and SKCS weight were highly correlated in this germplasm (r = 0.8; [27]) and this was reflected in the GWAS in which S2D_563799166 and S6B_583281710 were found to be associated with both traits. Grain protein content is an essential quality trait that affects flour functionality. Unlike bread, soft wheat products often require lower protein levels to minimize gluten formation and mixing strength [5]. The positive correlation (r = 0.4–05; [27]) between grain/flour protein content and SDS sedimentation (a measure of gluten strength) in this germplasm provides further evidence of their direct relationship. However, based on the GWAS, no significant markers were in common between SDS sedimentation and grain/flour protein content. Grain and flour protein were phenotypically correlated in this germplasm [27]. This relationship was also evident in our GWAS in which five markers were associated with both grain and flour protein. Grain and flour protein in this wheat collection had low heritability estimates and high genotype by environment interactions as described by Aoun et al. [27]. Consequently, most markers associated with grain/flour protein in this study had small effects, except for marker S1B_46883868.

Milling traits

Higher break flour yield, flour yield, lower flour ash, and higher milling score are desirable traits in soft wheat. Cultivars with alleles that increase these traits could lead to higher milling performance and thus greater profit for flour millers. Moderate to high heritability estimates and positive correlations among milling traits in this germplasm [27] suggest that genetic gain and simultaneous breeding for these traits is possible. Positive correlations between milling traits were also obvious in our GWAS results. For instance, S1B_653681752 and S5B_508665777 favorable alleles for break flour yield were also associated with higher flour yield. Similarly, S6D_471614981, a favorable allele for flour yield was also associated with higher milling score. Negative correlations between milling score and ash in this germplasm (r = -0.7) were discussed in Aoun et al. [27]. This desirable negative correlation was also reflected in our GWAS in which the S5B_68052478 minor allele was associated with lower ash and higher milling score. We found that S1B_100055026, which was associated with break flour yield, was located close to Glu-B3 gene flanked by the DArT marker wPt-1317 (137 Mb, [14]). Similarly, the flour yield associated marker in this study, S1B_555294134, was located 1 Mb from Glu-B1 (556 Mb). It is well known that glutenin subunit families are major components of wheat endosperm storage proteins and are associated with many end-use quality traits. The presence of break flour yield and flour yield associated loci close to Glu-B1 and Glu-B3 may suggest that there is a genetic association between endosperm storage proteins and endosperm structure as evidenced by Boehm Jr et al. [59]. The composition of the protein matrix surrounding starch granules likely contributes to the mechanical strength of the endosperm.

Flour and baking parameters

Unlike bread, confectionary products require lower gluten strength and water absorption capacity, which were measured using SDS sedimentation and water SRC, respectively. Higher water SRC is in part due to starch damage from milling and non-starch polysaccharides [5, 33, 60] and thus, lower water absorption is preferred as it results in better cookie spread and lower viscosity batters. Three water SRC associated markers co-localized with milling trait associated markers including S1B_653681752, S5A_382294123, and S6B_27918221/ S6B_29771821. Negative correlations between water SRC and milling traits previously discussed by Aoun et al. [27] were also observed in our GWAS results particularly for markers S1B_653681752 and S5A_382294123. Higher FSV is desirable for making some Asian-style noodles [1, 36]. We found that S1A_534055653, which was associated with FSV in our study was near the gene Glu-A1 flanked by the SSR marker wmc312 (511 Mb, [14]). This result suggests genetic correlation between gluten content/strength and FSV. Similar observation was also found for cookie diameter in which its associated marker S1B_573323546 was close to the position of the gene Glu-B1. The FSV associated marker S7D_38000037 from this study was 2 Mb from the waxy locus Wx-D1. The association between S7D_38000037 and any null allele at Wx-D1 is at present unknown, but is unlikely as the known Waxy allele at Wx-D1 is rare [61]. Similarly, we did not identify MTAs for FSV that were close to the locations of the other homoeologous waxy loci Wx-A1 and Wx-B1 which were located on chromosome 7A and 4A, respectively [35, 62]. Mutation/deletion in any of the three waxy loci often results in reduced amylose ‘partial waxy’ wheat which is associated with higher FSV. Therefore, the variation in FSV in this germplasm is likely independent of the waxy loci. As noted above, there were no major QTL identified for cookie baking. As such, alternative genotypic selection strategies such as genomic selection may be more appropriate for this trait.

Conclusion

In this study we investigated the phenotypic and genotypic structure of 14 end-use quality traits in 672 soft white winter wheat breeding lines and cultivars adapted to the Pacific Northwest region of the United States. A total of 178 MTAs were identified across all wheat chromosomes of which 40 MTAs were positioned within genomic regions of previously discovered end-use quality genes/QTL. These results highlight the fact that among the multitude of traits that a wheat breeder selects for, end-use quality is a relatively large proportion. The high heritability of most traits underscores the success of long-term phenotypic selection. Among the identified MTAs, 12 markers had large effects (eight of them were previously uncharacterized) and thus could be prioritized in breeding programs. For example, a relatively manageable number of lines, say, those resulting from head row selection, could be subjected to a single round of genotypic selection to fix the favorable allele at one or more of the large effect loci. Such a strategy could return benefits later on as a greater proportion of lines would meet end-use quality targets during subsequent replicated yield trials. This study also revealed that for some end-use quality traits (SKCS size, FSV, and cookie diameter), only small effect markers were identified, suggesting that these traits are controlled by multiple minor genes in this germplasm, and that alternative selection strategies such as genomic selection could augment traditional and laborious phenotyping. Additional file 1: Supplementary Fig. S1. Principal component(PC) analysis obtained from 40,518 SNPs in 672 soft white winter wheat genotypes. The first two PCs, PC1 and PC2 explaining 5.3% and 4.0% of the variation, respectively. Supplementary Fig. S2. Scatter plot representing the genome-wise linkage disequilibrium (LD) decay. The LD estimate (r2)for pairs of SNPs was plotted against the corresponding physical positions inmega base pairs (Mb) based on IWGSC Wheat Chinese Spring IWGSC RefSeq v1.0. The dashed red line represents the population-specific critical value of r2=0.1.Supplementary Fig. S3. Scatter plot representing the chromosome-wise linkage disequilibrium (LD) decay. The LD estimate (r2) for pairs of SNPs was plotted against the corresponding physical positions in megabase pairs (Mb) based on Wheat Chinese Spring IWGSC RefSeq v1.0. The dashed redline represents the LD population threshold of 0.1. Supplementary Fig. S4. Quantile-Quantile plots of the expected -log10(P) versus the observed -log10(P) for association mapping model MLM for the14 end-use quality traits. Supplementary Fig. S5. Quantile-Quantile plots of the expected -log10 (P) versus the observed -log10(P) for association mapping model FarmCPU for the 14 end-use quality traits. Supplementary Fig. S6. Quantile-Quantile plots of the expected-log10 (P) versus the observed -log10(P) for association mapping model BLINK for the 14 end-use quality traits. Supplementary Fig. S7. Summary of genome-wide association studies for 14 end-use quality traits in 672 soft winter wheat genotypes based on Fixed and random model Circulating Probability Unification model. The horizontal red line indicates significance level at FDR ≤ 0.05. Additional file 2: Table S1. Summary of GWAS for 14 end-use quality in softwhite winter wheat using mixed linear model (MLM). Table S2. Summary of GWAS for 14 end-use quality in soft whitewinter wheat using Bayesian-information and Linkage-disequilibrium Iteratively Nested Keyway (BLINK) model. Table S3. Annotated genes within the genomic regions of large-effect markers identified in this study for end-use quality traits in soft white winter wheat. Table S4. Previously identified QTL within the genomic regions of Marker-Trait-Associations identified in this study.
  23 in total

1.  TASSEL: software for association mapping of complex traits in diverse samples.

Authors:  Peter J Bradbury; Zhiwu Zhang; Dallas E Kroon; Terry M Casstevens; Yogesh Ramdoss; Edward S Buckler
Journal:  Bioinformatics       Date:  2007-06-22       Impact factor: 6.937

2.  Nonstarch polysaccharides in wheat flour wire-cut cookie making.

Authors:  Mary J Guttieri; Edward J Souza; Clay Sneller
Journal:  J Agric Food Chem       Date:  2008-11-26       Impact factor: 5.279

3.  Efficient methods to compute genomic predictions.

Authors:  P M VanRaden
Journal:  J Dairy Sci       Date:  2008-11       Impact factor: 4.034

4.  Technical note: an R package for fitting generalized linear mixed models in animal breeding.

Authors:  A I Vazquez; D M Bates; G J M Rosa; D Gianola; K A Weigel
Journal:  J Anim Sci       Date:  2009-10-09       Impact factor: 3.159

5.  Mapping quantitative trait loci for quality factors in an inter-class cross of US and Chinese wheat.

Authors:  Xiaochun Sun; Felix Marza; Hongxiang Ma; Brett F Carver; Guihua Bai
Journal:  Theor Appl Genet       Date:  2009-12-13       Impact factor: 5.699

6.  Rapid classification of partial waxy wheats using PCR-based markers.

Authors:  T Nakamura; P Vrinten; M Saito; M Konda
Journal:  Genome       Date:  2002-12       Impact factor: 2.166

7.  Shifting the limits in wheat research and breeding using a fully annotated reference genome.

Authors: 
Journal:  Science       Date:  2018-08-16       Impact factor: 47.728

8.  Genome-wide association mapping of the 'super-soft' kernel texture in white winter wheat.

Authors:  Meriem Aoun; Arron H Carter; Brian P Ward; Craig F Morris
Journal:  Theor Appl Genet       Date:  2021-05-30       Impact factor: 5.699

Review 9.  Wheat quality improvement at CIMMYT and the use of genomic selection on it.

Authors:  Carlos Guzman; Roberto Javier Peña; Ravi Singh; Enrique Autrique; Susanne Dreisigacker; Jose Crossa; Jessica Rutkoski; Jesse Poland; Sarah Battenfield
Journal:  Appl Transl Genom       Date:  2016-10-29

10.  Deciphering the Genetics of Major End-Use Quality Traits in Wheat.

Authors:  Sepehr Mohajeri Naraghi; Senay Simsek; Ajay Kumar; S M Hisam Al Rabbi; Mohammed S Alamri; Elias M Elias; Mohamed Mergoum
Journal:  G3 (Bethesda)       Date:  2019-05-07       Impact factor: 3.154

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.