| Literature DB >> 35684219 |
Grant T Billings1,2, Michael A Jones3, Sachin Rustgi3, William C Bridges4, James B Holland1,5, Amanda M Hulse-Kemp1,2,6, B Todd Campbell7.
Abstract
Researchers have used quantitative genetics to map cotton fiber quality and agronomic performance loci, but many alleles may be population or environment-specific, limiting their usefulness in a pedigree selection, inbreeding-based system. Here, we utilized genotypic and phenotypic data on a panel of 80 important historical Upland cotton (Gossypium hirsutum L.) lines to investigate the potential for genomics-based selection within a cotton breeding program's relatively closed gene pool. We performed a genome-wide association study (GWAS) to identify alleles correlated to 20 fiber quality, seed composition, and yield traits and looked for a consistent detection of GWAS hits across 14 individual field trials. We also explored the potential for genomic prediction to capture genotypic variation for these quantitative traits and tested the incorporation of GWAS hits into the prediction model. Overall, we found that genomic selection programs for fiber quality can begin immediately, and the prediction ability for most other traits is lower but commensurate with heritability. Stably detected GWAS hits can improve prediction accuracy, although a significance threshold must be carefully chosen to include a marker as a fixed effect. We place these results in the context of modern public cotton line-breeding and highlight the need for a community-based approach to amass the data and expertise necessary to launch US public-sector cotton breeders into the genomics-based selection era.Entities:
Keywords: GWAS; cotton breeding; cotton yield; fiber quality; genomic prediction
Year: 2022 PMID: 35684219 PMCID: PMC9182660 DOI: 10.3390/plants11111446
Source DB: PubMed Journal: Plants (Basel) ISSN: 2223-7747
Figure 1Genomic relationship matrix among the 80 lines in this study. The deeper red colors indicate a higher pairwise relationship between lines at the whole-genome level. The two large blocks in the top left consist of (1, red arrow) half- and full-sib relatives with PD695 or PD875 parents; and (2, black arrow) lines selected out of the initial gene pool from the Pee Dee breeding program in the first two breeding cycles. Breeding cycles are outlined in Campbell et al. (2011) [8].
Broad-sense heritabilities and their standard errors for 20 cotton traits. Overall is calculated using data from all 14 year–locations, and Environment is the mean of heritabilities and standard errors calculated for all environments separately.
| Overall | (Mean from up to 14 Tests in Separate Environments) | Difference in H2 (Overall-Environment) | ||||
|---|---|---|---|---|---|---|
| Trait Type | Trait | Broad-Sense Heritability (H2) | Standard | Broad-Sense Heritability (H2) | Standard | |
| Advanced Fiber | Fineness | 0.84 | 0.03 | 0.37 | 0.11 | 0.47 |
| Length Top 25% Mean by Wt | 0.96 | 0.01 | 0.79 | 0.05 | 0.17 | |
| Length Mean by Number | 0.92 | 0.01 | 0.47 | 0.11 | 0.45 | |
| Immature Fiber Content | 0.22 | 0.13 | 0.28 | 0.11 | −0.06 | |
| Maturity Ratio | 0.75 | 0.04 | 0.32 | 0.12 | 0.43 | |
| Short Fiber Content by Wt | 0.86 | 0.03 | 0.41 | 0.09 | 0.45 | |
| High | Elongation | 0.93 | 0.01 | 0.40 | 0.11 | 0.53 |
| Upper Half Mean Length | 0.96 | 0.01 | 0.64 | 0.08 | 0.32 | |
| Strength | 0.96 | 0.01 | 0.69 | 0.07 | 0.27 | |
| Micronaire | 0.88 | 0.02 | 0.68 | 0.07 | 0.20 | |
| Short Fiber Content | 0.80 | 0.03 | 0.47 | 0.10 | 0.33 | |
| Uniformity Index | 0.88 | 0.02 | 0.40 | 0.11 | 0.48 | |
| Seed | Seed Oil % | 0.88 | 0.02 | 0.50 | 0.09 | 0.38 |
| Seed Protein % | 0.27 | 0.14 | 0.12 | 0.14 | 0.15 | |
| Yield | Bolls/m−2, from Seed Cotton | 0.65 | 0.06 | 0.35 | 0.15 | 0.30 |
| Boll Wt, from Seed Cotton | 0.71 | 0.05 | 0.27 | 0.13 | 0.44 | |
| Gin Turnout | 0.75 | 0.01 | 0.69 | 0.07 | 0.27 | |
| Lint Yield | 0.75 | 0.04 | 0.52 | 0.12 | 0.23 | |
| Seed Cotton Yield | 0.55 | 0.08 | 0.48 | 0.12 | 0.07 | |
| Seed Index | 0.93 | 0.01 | 0.60 | 0.08 | 0.33 | |
|
|
|
|
|
|
| |
AFIS traits are highlighted in teal, HVI traits in yellow, seed traits in purple, and yield traits in red. Wt = weight.
Number of significant marker–trait associations discovered by genome-wide association study. “-” indicates no hits from GWAS or GWAS was not run due to near-zero heritability.
| Trait Type | Trait | Overall Means | Blackville | Florence | Hartsville | Rocky Mount | Stoneville | Tifton | Average per Trait | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2004 | 2005 | 2006 | 2004 | 2005 | 2006 | 2004 | 2005 | 2005 | 2006 | 2005 | 2006 | 2005 | 2006 | ||||
| Advanced | Fineness | 657 | 292 | 8 | 30 | 34 | 238 | 96 | - | 2 | 240 | 1 | 35 | 18 | - | 36 |
|
| Length Top 25% Mean by Wt | 212 | 762 | 41 | - | 407 | 57 | 59 | 280 | 197 | 419 | 2 | 64 | 123 | 9 | 242 |
| |
| Length Mean by Number | - | 74 | - | 5 | 24 | 9 | - | - | 53 | 174 | 1 | - | - | 9 | 141 |
| |
| Immature Fiber Content | - | 88 | - | 39 | 17 | 187 | 199 | 164 | 10 | 52 | 58 | 78 | 9 | 184 | 112 |
| |
| Maturity Ratio | 3 | 4 | 58 | 18 | - | 300 | 32 | 41 | - | 110 | 9 | 65 | 13 | - | 150 |
| |
| Short Fiber Content by Wt | 1 | 1 | 5 | 72 | 21 | 19 | 112 | 168 | 44 | 125 | - | - | 9 | 9 | 37 |
| |
| High | Elongation | - | 18 | - | 201 | 434 | 8 | - | - | 3 | 19 | - | - | 15 | - | 55 |
|
| Upper Half Mean Length | 471 | 163 | 10 | 76 | 585 | 54 | - | 451 | 193 | 12 | 278 | 203 | 108 | 18 | 281 |
| |
| Strength | 284 | 41 | 15 | 30 | 112 | 15 | 570 | 35 | 41 | 47 | - | 364 | 535 | 31 | 80 |
| |
| Micronaire | 769 | 280 | 10 | 152 | 18 | - | 66 | - | 353 | 743 | 591 | 48 | 192 | 681 | 228 |
| |
| Short Fiber Content | 58 | 40 | 10 | 4 | 123 | 307 | 269 | 231 | - | - | 3 | 167 | 29 | 426 | - |
| |
| Uniformity Index | 17 | 30 | 3 | - | 26 | 726 | - | - | 3 | - | - | 1 | 26 | 112 | - |
| |
| Seed | Seed Oil % | 180 | 89 | - | 155 | 161 | 33 | 151 | - | 89 | 129 | 156 | - | 230 | 96 | - |
|
| Seed Protein % | 104 | 21 | - | 16 | - | 299 | 119 | - | 13 | - | 454 | - | 3 | 24 | - |
| |
| Yield | Bolls/m2, from Seed Cotton | 91 | 38 | - | 2 | 150 | - | - | 9 | - | 6 | - | 6 | 381 | 47 | 440 |
|
| Boll Wt, from Seed Cotton | 9 | 105 | - | 10 | 29 | - | 3 | 33 | 7 | 22 | 34 | 30 | - | 4 | 460 |
| |
| Gin Turnout | 301 | 276 | 2 | - | 133 | 3 | 53 | 628 | 27 | 349 | 897 | 100 | 9 | - | 603 |
| |
| Lint Yield | 1007 | 9 | 38 | 201 | 192 | - | - | 1372 | - | - | - | 29 | 628 | 28 | 334 |
| |
| Seed Cotton Yield | 116 | 75 | 22 | 161 | 92 | - | - | 785 | - | 3 | - | 5 | 469 | 38 | 67 |
| |
| Seed Index | 605 | 831 | 11 | 15 | 1 | - | 278 | - | 168 | - | 732 | 237 | 8 | 1 | 815 |
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| ||
Location abbreviations—BL: Blackville, SC; FL: Florence, SC; HV: Hartsville, SC; RM: Rocky Mount, NC; ST: Stoneville, MS; TFT: Tifton, Georgia. AFIS traits are highlighted in teal, HVI traits in yellow, seed traits in purple, and yield traits in red. Wt = weight.
Comparison of prediction ability building the model using the overall means and predicting individual environment means.
| Prediction Ability Compared to | |||||
|---|---|---|---|---|---|
| Within Location, Different Year | Within Year, | Different Year and Location | Interpretation | ||
| Advanced | Fineness | 95% | 84% | 94% | Within location is easier |
| Length Mean by Number | 100% | 88% | 96% | Within location is easier | |
| Length Top 25% Mean by Wt | 100% | 90% | 98% | Within location is easier | |
| Immature Fiber Content | 95% | 63% | 92% | Within location is easier | |
| Maturity Ratio | 93% | 67% | 91% | Within location is easier | |
| Short Fiber Content by Wt | 84% | 72% | 82% | Within location is easier | |
| High | Elongation | 95% | 92% | 94% | Within location is easier |
| Upper Half Mean Length | 100% | 90% | 98% | Within location is easier | |
| Strength | 100% | 95% | 97% | Within location is easier | |
| Micronaire | 67% | 65% | 62% | Within location is easier | |
| Short Fiber Content | 36% | 71% | 47% | Within year is easier | |
| Uniformity Index | 96% | 88% | 93% | Within location is easier | |
| Seed | Seed Oil % | 100% | 76% | 100% | Within location is easier |
| Seed Protein % | 56% | 30% | 5% | Within location is easier | |
| Yield | Bolls/m2, from Seed Cotton | 1% | 48% | 29% | Within year is easier |
| Boll Wt, from Seed Cotton | 18% | 100% | 81% | Within year is easier | |
| Gin Turnout | 98% | 96% | 95% | Within location is easier | |
| Lint Yield | 49% | 68% | 59% | Within year is easier | |
| Seed Cotton Yield | 13% | 44% | 34% | Within year is easier | |
| Seed Index | 96% | 99% | 99% | Within year is easier | |
“Within Location, Different Year”—training and testing the model in different field trials, restricted to only those comparisons that took place at the same location but in different years; “Within Year, Different Location”—training and testing the model in different field trials, restricted to only those comparisons that occurred in the same year but in different locations; and “Different Year and Location”—training and testing the model in different field trials, restricted to only those comparisons that took place in both different years and locations. AFIS traits are highlighted in teal, HVI traits in yellow, seed traits in purple, and yield traits in red. Wt = weight.
Figure 2The number of year–location environments each SNP-trait association was discovered in. Results are shown for markers identified with at least 5 of 100 resamples of the association test with FDR-corrected p-value < 20%. The y-axis is plotted with a log 10 transformation. AFIS traits are highlighted in teal, HVI traits in yellow, seed traits in purple, and yield traits in red.
Figure 3Prediction ability for the genomic best linear unbiased predictor (GBLUP) models trained and tested on the overall means. Error bars are standard errors. The red diamond corresponds to the theoretical maximum prediction ability, estimated as the square root of the trait heritability. AFIS: Advanced Fiber Information System; HVI: High Volume Instrument. AFIS traits are highlighted in teal, HVI traits in yellow, seed traits in purple, and yield traits in red.
Figure 4Prediction ability within 14 individual year–location environment means and overall means for 20 fiber, agronomic, and seed traits. The red solid line corresponds to r = 0, the black dashed line is r = 0.5, and the green dashed/dotted line is r = 0.7. AFIS traits are highlighted in teal, HVI traits in yellow, seed traits in purple, and yield traits in red. Wt = weight.
Figure 5Histograms of change in prediction ability due to inclusion of genome-wide association study (GWAS) hits as fixed effects in the genomic prediction model at differing significance thresholds. FDR: Benjamini–Hochberg False Discovery Rate p-value correction. RMIP: resample model inclusion probability.
Overview of data available for each year–location.
| Location | Year | Comments |
|---|---|---|
| Blackville, SC | 2004 | ✔ |
| 2005 | No seed oil % or seed protein % | |
| 2006 | ✔ | |
| Florence, SC | 2004 | ✔ |
| 2005 | ✔ | |
| 2006 | No bolls m−2, boll weight, lint, or seed cotton yield | |
| Hartsville, SC | 2004 | No seed oil % or seed protein % |
| 2005 | No bolls m−2, boll weight, lint, or seed cotton yield | |
| Rocky Mount, NC | 2005 | No high-volume instrument short fiber content |
| 2006 | ✔ | |
| Stoneville, MS | 2005 | No seed oil % or seed protein % |
| 2006 | ✔ | |
| Tifton, GA | 2005 | ✔ |
| 2006 | ✔ |
“✔” observations available for all traits.