| Literature DB >> 32205398 |
Alice H MacQueen1, Jeffrey W White2, Rian Lee3, Juan M Osorno3, Jeremy Schmutz4, Phillip N Miklas5, Jim Myers6, Phillip E McClean3, Thomas E Juenger7.
Abstract
Multienvironment trials (METs) are widely used to assess the performance of promising crop germplasm. Though seldom designed to elucidate genetic mechanisms, MET data sets are often much larger than could be duplicated for genetic research and, given proper interpretation, may offer valuable insights into the genetics of adaptation across time and space. The Cooperative Dry Bean Nursery (CDBN) is a MET for common bean (Phaseolus vulgaris) grown for > 70 years in the United States and Canada, consisting of 20-50 entries each year at 10-20 locations. The CDBN provides a rich source of phenotypic data across entries, years, and locations that is amenable to genetic analysis. To study stable genetic effects segregating in this MET, we conducted genome-wide association studies (GWAS) using best linear unbiased predictions derived across years and locations for 21 CDBN phenotypes and genotypic data (1.2 million SNPs) for 327 CDBN genotypes. The value of this approach was confirmed by the discovery of three candidate genes and genomic regions previously identified in balanced GWAS. Multivariate adaptive shrinkage (mash) analysis, which increased our power to detect significant correlated effects, found significant effects for all phenotypes. Mash found two large genomic regions with effects on multiple phenotypes, supporting a hypothesis of pleiotropic or linked effects that were likely selected on in pursuit of a crop ideotype. Overall, our results demonstrate that statistical genomics approaches can be used on MET phenotypic data to discover significant genetic effects and to define genomic regions associated with crop improvement.Entities:
Keywords: common bean (Phaseolus vulgaris); crop improvement; genetics of diversification; genome-wide association; multienvironment trial
Mesh:
Year: 2020 PMID: 32205398 PMCID: PMC7198278 DOI: 10.1534/genetics.120.303038
Source DB: PubMed Journal: Genetics ISSN: 0016-6731 Impact factor: 4.562
Figure 1The CDBN is an extensive multienvironment trial with hundreds of germplasm entries for common bean (P. vulgaris) grown at 75 locations over 35 years. The relationships between germplasm entries (first row and column), locations (second row and column), and years (third row and column) used in the CDBN are shown in the grid. Purple represents genotyped germplasm entries from the Durango race of the Mesoamerican gene pool, blue represents genotyped entries in the Mesoamerican race of the Mesoamerican gene pool, green represents genotyped entries in the Nueva Granada race of the Andean gene pool, and yellow represents entries that were not genotyped. Beans of the two most commonly genotyped market classes from each race are pictured to the right of the color key, and the number of entries of each race that were genotyped is displayed to the left of the color key. On the maps of CDBN locations, pie chart size is scaled relative to the total number of entries grown at that location, and circle saturation represents locations with a greater number of years of data available. CDBN, Cooperative Dry Bean Nursery.
Figure 2Phenotypic data available in the CDBN, and genetic variation within the two of these phenotypes with the most data. (A) Details of 21 phenotypes present in the CDBN; BLUPs from these phenotypes were used for genome-wide association. Black circles left of the y-axis indicate phenotypes with one or more genetic associations that had P-values above the Benjamini–Hochberg false discovery rate. (B) Manhattan plot of BLUPs for seed yield (kg/ha) from the CDBN data. The dashed lines are the cutoff values for peak significance. SNPs above the Benjamini–Hochberg false discovery rate are above the black line, and those in the 0.001 percentile are above the gray line. (C) Quantile–quantile plot of the goodness of fit of the model on seed yield. (D) Manhattan plot of BLUPs for seed weight (mg) from the CDBN data, as in (B). (E) Quantile–quantile plot of the goodness of fit of the model on seed weight. Distribution of the BLUPs for (F) seed yield and (G) seed weight. Black represents the presence of the reference allele for the top SNP labeled in panels (B) and (D), respectively, and gray represents the presence of the alternate allele. BCMV, bean common mosaic virus; BLUP, best linear unbiased predictor; CBB, common bacterial blight; CDBN, Cooperative Dry Bean Nursery; CTV, curly top virus.
BLUP statistics, with phenotypes ordered as in Figure 2
| Phenotype | Va | Ve | BLUP range | Units | |
|---|---|---|---|---|---|
| Seed yield | 53,173 | 222,409 | 0.193 | 1,727 | kg ha−1 |
| Seed weight | 2,340 | 1,076 | 0.685 | 443 | mg |
| Days to maturity | 12.3 | 18.3 | 0.402 | 17.0 | Days |
| Days to flowering | 4.68 | 6.11 | 0.434 | 11.7 | Days |
| Seed fill duration | 5.584 | 17.812 | 0.239 | 12.7 | Days |
| Lodging score | 0.244 | 0.466 | 0.344 | 2.97 | 1–5 scale |
| Harvest index | 13.5 | 31.5 | 0.299 | 20.1 | % |
| Plant height | 13.8 | 42.5 | 0.245 | 18.1 | cm |
| Biomass | 2.46E+05 | 6.72E+05 | 0.268 | 2,990 | kg |
| Growth habit | 0.160 | 0.154 | 0.509 | 2.19 | 1–3 scale |
| Seed appearance score | 0.017 | 0.241 | 0.067 | 0.346 | 1–3 scale |
| CBB damage score | 0.282 | 1.127 | 0.200 | 2.32 | 1–9 scale |
| Rust damage score | 3.201 | 1.957 | 0.621 | 7.35 | 1–9 scale |
| Early vigor score | 0.064 | 0.747 | 0.078 | 0.957 | 1–9 scale |
| White mold damage score | 0.143 | 0.631 | 0.185 | 2.60 | 1–5 scale |
| CTV presence/absence | 0.025 | 0.132 | 0.157 | 0.432 | 0–1 scale |
| Halo blight damage score | 0.176 | 0.708 | 0.199 | 1.12 | 1–5 scale |
| BCMV blackroot response | 0.049 | 0.086 | 0.363 | 0.843 | 0–1 scale |
| BCMV presence/absence | 0.016 | 0.097 | 0.145 | 0.428 | 0–1 scale |
| Root rot damage score | 0.455 | 3.193 | 0.125 | 2.25 | 1–9 scale |
| Zinc deficiency damage score | 3.831 | 1.395 | 0.733 | 8.96 | 1–9 scale |
BCMV, bean common mosaic virus; BLUP, best linear unbiased predictor; CBB, common bacterial blight; CTV, curly top virus; REML, restricted maximum likelihood.
Va is the REML estimate of the additive genetic variance from rrBLUP.
Ve is the REML estimate of the error variance.
h2 is narrow-sense heritability, defined as Va / (Va + Ve).
Major associations in GWAS from phenotypes from the CDBN and from previously published GWAS
| FDR | Trait | Study | Chr | Position in v2.0 | Candidate gene |
|---|---|---|---|---|---|
| Plant height (cm) | This study | 1 | 6.13 | ||
| FDR | Growth habit | This study | 1 | 6.28 | |
| Biomass (kg) | This study | 1 | 6.49 | ||
| Lodging score | Resende | 1 | 13.76 | ||
| Growth habit | Resende | 1 | 13.76 | ||
| Days to flowering | 1 | 13.76 | |||
| FDR | Days to flowering | This study | 1 | 13.45–15.36 | |
| Root rot damage | 1 | 23.92 | |||
| Days to flower | 1 | 27.68 | |||
| Root rot damage | 1 | 33.03 | |||
| Halo blight damage score | This study | 1 | 36.72 | ||
| Root rot damage | 1 | 37.20 | |||
| Plant height (cm) | This study | 1 | 38.74 | ||
| Root rot damage | 1 | 40.20 | |||
| FDR | Growth habit | This study | 1 | 42.17 | |
| FDR | Seed yield | This study | 1 | 42.23 | |
| Growth habit | 1 | 42.23 | |||
| Biomass (kg) | This study | 1 | 42.27 | ||
| Growth habit | 1 | 44.80 | |||
| Growth habit | 1 | 44.80 | |||
| Days to flowering | 1 | 47.07 | |||
| Days to flowering | 1 | 48.86 | |||
| Days to flowering | 1 | 49.65 | |||
| Halo blight damage score | This study | 2 | 16.17 | ||
| FDR | Seed weight | This study | 2 | 30.38 | |
| Days to flowering | 2 | 38.07 | |||
| Halo blight damage score | 2 | 49.08 | |||
| FDR | Harvest index (%) | This study | 3 | 2.16 | |
| FDR | Seed weight | This study | 3 | 4.40 | |
| Days to maturity | This study | 3 | 15.75 | ||
| Days to maturity | This study | 3 | 32.04 | ||
| Harvest index (%) | This study | 3 | 36.82 | ||
| Days to maturity | This study | 3 | 36.83 | ||
| Seed yield | 3 | 37.60 | |||
| Days to flowering | 3 | 40.27 | |||
| Days to flowering | 3 | 41.09 | |||
| Harvest index (%) | 3 | 46.70 | |||
| Harvest index (%) | 3 | 47.17 | |||
| Days to flower | 3 | 47.35 | |||
| Seed yield | Resende | 3 | 49.28–50.33 | ||
| Days to flowering and days to maturity | 3 | 51.48 | |||
| Days to flowering and days to maturity | 3 | 52.32 | |||
| Days to flowering and days to maturity | 3 | 52.6 | |||
| Halo blight damage score | 4 | 0.55–1.899 | |||
| Days to maturity | 4 | 1.94 | |||
| FDR | Lodging score | This study | 4 | 2.87 | |
| Growth habit indeterminate | 4 | 3.20 | |||
| Halo blight damage score | MacQueen | 4 | 6.79 | ||
| Days to flowering | 4 | 16.37 | |||
| Days to flowering | 4 | 36.88 | |||
| Halo blight damage score | 4 | 46.20 | |||
| Days to flowering and days to maturity | 4 | 46.33 | |||
| Days to flowering and days to maturity | 4 | 47.06 | |||
| Halo blight damage score | This study | 5 | 13.25 | ||
| Halo blight damage score | 5 | 39.00 | |||
| Root rot damage | 6 | 0.57 | |||
| Root rot damage | 6 | 5.75 | |||
| Root rot damage | 6 | 6.89 | |||
| Root rot damage | 6 | 8.16 | |||
| Root rot damage | 6 | 12.20 | |||
| Root rot damage | 6 | 17.85 | |||
| Plant height (cm) | This study | 6 | 20.89 | ||
| Growth habit indeterminate | 6 | 29.92 | |||
| Days to flowering | 6 | 31.60 | |||
| Days to maturity | This study | 7 | 1.15 | ||
| Growth habit indeterminate | 7 | 34.12 | |||
| Lodging score | 7 | 34.20 | |||
| FDR | Lodging score | This study | 7 | 33.60–34.51 | |
| Plant height (cm) | 7 | 34.20 | |||
| Growth habit indeterminate | 7 | 34.20 | |||
| Biomass (kg) | This study | 7 | 35.74 | ||
| Seed weight | 8 | 1.10 | |||
| Root rot damage score | This study | 8 | 1.34 | ||
| Days to flowering | 8 | 4.93 | |||
| Biomass (kg) | 8 | 6.86 | |||
| Biomass (kg) | 8 | 7.60 | |||
| Root rot damage | 8 | 15.26 | |||
| Root rot damage | 8 | 17.72 | |||
| Days to flowering and days to maturity | 8 | 24.95 | |||
| Days to flowering | 8 | 26.40 | |||
| Halo blight damage score | 8 | 61.34 | |||
| Root rot damage score | This study | 8 | 61.98 | ||
| Halo blight damage score | This study | 9 | 5.42 | ||
| Days to maturity | This study | 9 | 5.85 | ||
| Seed yield | 9 | 10.00 | |||
| Plant height (cm) | This study | 9 | 27.98 | ||
| FDR | Growth habit | This study | 9 | 30.93 | |
| Biomass (kg) | 10 | 0.60 | |||
| Seed weight | 10 | 2.60 | |||
| Root rot damage | 10 | 16.40 | |||
| Root rot damage | 10 | 19.51 | |||
| Root rot damage | 10 | 25.44 | |||
| Root rot damage | 10 | 29.88 | |||
| Biomass (kg) | 10 | 36.45 | |||
| Halo blight damage score | This study | 10 | 41.52 | ||
| Days to flowering | 10 | 42.50 | |||
| FDR | Growth habit | This study | 10 | 42.79 | |
| Biomass (kg) | 11 | 1.59 | |||
| Days to flowering | 11 | 4.02 | |||
| Days to maturity | 11 | 4.46 | |||
| Days to flowering | 11 | 10.66 | |||
| days to maturity | MacQueen | 11 | 14.8 | ||
| Root rot damage | 11 | 26.41 | |||
| Days to flowering | 11 | 27.27 | |||
| Days to flowering | 11 | 36.22 | |||
| Root rot damage | 11 | 41.36 | |||
| Days to maturity | 11 | 45.09 | |||
| Days to flowering | 11 | 45.29 | |||
| Growth habit indeterminate | 11 | 46.75 | |||
| Days to flowering | 11 | 47.3 | |||
| Root rot damage | 11 | 50.59 | |||
| FDR | Rust | This study | 11 | 50.67 | |
| Rust | Phil McClean | 11 | 50.67 | ||
| Biomass (kg) | 11 | 52.55 |
Chr, chromosome; FDR, false discovery rate; GWAS, genome-wide association study; CDBN, common dry bean nursery.
FDR indicates associations from this paper, which were above the Benjamini–Hochberg FDR correction.
Positions (in Mb) indicate associations in more than one published GWAS that are within 200 kb of one another.
Associations in more than one published GWAS, which fall within 20 kb of one another and have the same candidate gene.
Figure 3Patterns of phenotypic effects of genetic associations for 22 phenotypes from the CDBN, determined using multivariate adaptive shrinkage. (A) SNPs with significant effects on 1 or more of the 22 phenotypes in the CDBN. (B) Manhattan plot of the Bayes factor (log10) comparing the model likelihood that the SNP has significant effects to the likelihood that it has no significant effects. Bayes factors of > 102 are considered decisive evidence in favor of the alternate model. Point color represents the number of phenotypes for which the SNP has a local false-sign rate < 0.05. Squares represent even chromosomes, while circles represent odd chromosomes. The top associations for three regions of the genome are highlighted. (C) Correlation in the sign and magnitude of significant effects in all pairwise comparisons of the 22 CDBN phenotypes. Circle size and color indicate the fraction of all significant SNPs that have the same effect sign and similar effect magnitude. (D–F) Effect estimates and standard errors for 22 phenotypes for the top associations from three regions of the genome, (D) P. vulgaris chromosome 1 (Pv01) at 15.4 Mb, (E) Pv01 at 42.2 Mb, and (F) Pv07 at 14.5 Mb. Genomic locations are based on the P. vulgaris v2.1 genome annotation. Standard error bars are colored by the six groups present in (C). CBB, common bacterial blight; CDBN, Cooperative Dry Bean Nursery; CTV, curly top virus.