| Literature DB >> 30455185 |
Nicholas Santantonio1, Jean-Luc Jannink2,3, Mark Sorrells2.
Abstract
Whole genome duplications have played an important role in the evolution of angiosperms. These events often occur through hybridization between closely related species, resulting in an allopolyploid with multiple subgenomes. With the availability of affordable genotyping and a reference genome to locate markers, breeders of allopolyploids now have the opportunity to manipulate subgenomes independently. This also presents a unique opportunity to investigate epistatic interactions between homeologous orthologs across subgenomes. We present a statistical framework for partitioning genetic variance to the subgenomes of an allopolyploid, predicting breeding values for each subgenome, and determining the importance of inter-genomic epistasis. We demonstrate using an allohexaploid wheat breeding population evaluated in Ithaca, NY and an important wheat dataset from CIMMYT previously shown to demonstrate non-additive genetic variance. Subgenome covariance matrices were constructed and used to calculate subgenome interaction covariance matrices for variance component estimation and genomic prediction. We propose a method to extract population structure from all subgenomes at once before covariances are calculated to reduce collinearity between subgenome estimates. Variance parameter estimation was shown to be reliable for additive subgenome effects, but was less reliable for subgenome interaction components. Predictive ability was equivalent to current genomic prediction methods. Including only inter-genomic interactions resulted in the same increase in accuracy as modeling all pairwise marker interactions. Thus, we provide a new tool for breeders of allopolyploid crops to characterize the genetic architecture of existing populations, determine breeding goals, and develop new strategies for selection of additive effects and fixation of inter-genomic epistasis.Entities:
Keywords: Allopolyploidy; Epistasis; Genomic prediction; Heterosis
Mesh:
Year: 2019 PMID: 30455185 PMCID: PMC6404612 DOI: 10.1534/g3.118.200613
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Means (μ) and standard deviations (σ) of four traits in the CNLM population
| units | |||
|---|---|---|---|
| GY | kg ha | 5315.20 | 1015.76 |
| PH | cm | 90.84 | 11.99 |
| HD | Julian days | 151.64 | 3.87 |
| TW | g L | 74.95 | 3.09 |
Figure A1Distribution of minor allele frequencies for 11,604 GBS markers in the CNLM population.
Figure 1Estimates and standard errors of variance components from the full model (red) compared to the sampling distribution of variance component estimates from the cross-validation scheme (black violins). Two traits from the CNLM population, (A) PH and (B) HD, with contrasting genetic architectures are shown. GG (5) and ABDABD (7) models are shown to the left and right of the dotted line, respectively. The sum of the additive and epitstatic variance components is also shown for the ABDABD model.
Figure 2Plot of whole genome additive effects (GEBV) by subgenome additive effects (SGEBV) for GY in the CNLM populations. The dotted line indicates the 95% quantiles for whole or subgenome effects. Blue squares, triangles and diamonds indicate the line with the highest SGEBV for each of the A, B and D subgenomes, respectively.
Table of genomic prediction accuracies for eight traits in the CNLM (GY, PH, TW and HD) or W-GY (E1, E2, E3, E4) populations with and . k is the number of principal components removed from the marker matrix prior to calculating subgenome covariance matrices. The first 1 to k principal components were included as fixed effects in the model fit for
| CNLM | GY | PH | TW | HD | |
|---|---|---|---|---|---|
| G (2) | 0 | 0.601 | 0.559 (0.007) | 0.515 (0.010) | 0.664 (0.009) |
| ABD (4) | 0.600 (0.008) | 0.557 (0.008) | 0.514 (0.011) | 0.679 (0.007) | |
| G | 0.604 (0.008) | 0.637 (0.004) | 0.576 (0.010) | 0.712 (0.008) | |
| ABD | 0.603 (0.008) | 0.638 (0.005) | 0.569 (0.011) | 0.720 (0.006) | |
| G | 5 | 0.600 (0.009) | 0.558 (0.007) | 0.514 (0.011) | 0.663 (0.010) |
| ABD | 0.600 (0.009) | 0.556 (0.008) | 0.513 (0.011) | 0.678 (0.008) | |
| G | 0.602 (0.008) | 0.624 (0.005) | 0.560 (0.010) | 0.701 (0.008) | |
| ABD | 0.602 (0.007) | 0.618 (0.005) | 0.557 (0.010) | 0.708 (0.006) | |
| W-GY | E1 | E2 | E3 | E4 | |
| G | 0 | 0.501 (0.010) | 0.493 (0.016) | 0.356 (0.008) | 0.457 (0.010) |
| ABD | 0.492 (0.012) | 0.481 (0.023) | 0.346 (0.010) | 0.449 (0.011) | |
| G | 0.568 (0.010) | 0.494 (0.017) | 0.396 (0.013) | 0.520 (0.010) | |
| ABD | 0.549 (0.011) | 0.484 (0.023) | 0.393 (0.015) | 0.509 (0.013) | |
| G | 5 | 0.502 (0.010) | 0.491 (0.017) | 0.354 (0.007) | 0.458 (0.010) |
| ABD | 0.495 (0.011) | 0.475 (0.024) | 0.345 (0.010) | 0.453 (0.011) | |
| G | 0.526 (0.010) | 0.491 (0.017) | 0.381 (0.007) | 0.493 (0.011) | |
| ABD | 0.520 (0.012) | 0.475 (0.023) | 0.373 (0.013) | 0.486 (0.012) |
Equation.
Mean genomic prediction accuracy across ten replicates of five fold cross validation.
Standard deviation of prediction accuracy across ten replicates are shown in parentheses.
Figure 3Subgenome additive and interaction variance parameter estimates from the ABDABD (7) model correcting for population structure with principal components as fixed effects. Models were fit with four traits for the CNLM population.
Estimated genetic correlation of traits with additive (below diagonal) and independent genetic relationships (above diagonal). Genetic standard deviations of scaled traits estimated with a realized additive covariance between individuals and assuming independence are shown in parentheses on the diagonal, respectively
| GY | PH | TW | HD | |
|---|---|---|---|---|
| GY | (0.29, 0.36) | −0.39 | −0.24 | 0.16 |
| PH | −0.44 | (0.72, 0.65) | 0.31 | 0.05 |
| HD | −0.05 | 0.11 | (0.44, 0.44) | −0.28 |
| TW | −0.04 | 0.3 | −0.22 | (0.5, 0.49) |