| Literature DB >> 33903985 |
Seema Yadav1, Xianming Wei2, Priya Joyce3, Felicity Atkin4, Emily Deomano3, Yue Sun3, Loan T Nguyen1, Elizabeth M Ross1, Tony Cavallaro1, Karen S Aitken5, Ben J Hayes1, Kai P Voss-Fels6.
Abstract
KEY MESSAGE: Non-additive genetic effects seem to play a substantial role in the expression of complex traits in sugarcane. Including non-additive effects in genomic prediction models significantly improves the prediction accuracy of clonal performance. In the recent decade, genetic progress has been slow in sugarcane. One reason might be that non-additive genetic effects contribute substantially to complex traits. Dense marker information provides the opportunity to exploit non-additive effects in genomic prediction. In this study, a series of genomic best linear unbiased prediction (GBLUP) models that account for additive and non-additive effects were assessed to improve the accuracy of clonal prediction. The reproducible kernel Hilbert space model, which captures non-additive genetic effects, was also tested. The models were compared using 3,006 genotyped elite clones measured for cane per hectare (TCH), commercial cane sugar (CCS), and Fibre content. Three forward prediction scenarios were considered to investigate the robustness of genomic prediction. By using a pseudo-diploid parameterization, we found significant non-additive effects that accounted for almost two-thirds of the total genetic variance for TCH. Average heterozygosity also had a major impact on TCH, indicating that directional dominance may be an important source of phenotypic variation for this trait. The extended-GBLUP model improved the prediction accuracies by at least 17% for TCH, but no improvement was observed for CCS and Fibre. Our results imply that non-additive genetic variance is important for complex traits in sugarcane, although further work is required to better understand the variance component partitioning in a highly polyploid context. Genomics-based breeding will likely benefit from exploiting non-additive genetic effects, especially in designing crossing schemes. These findings can help to improve clonal prediction, enabling a more accurate identification of variety candidates for the sugarcane industry.Entities:
Mesh:
Year: 2021 PMID: 33903985 PMCID: PMC8263546 DOI: 10.1007/s00122-021-03822-1
Source DB: PubMed Journal: Theor Appl Genet ISSN: 0040-5752 Impact factor: 5.574
Fig. 2Genomic additive relationship matrix showing the proportion of genome shared amongst a total of 2,909 clones from 2013 to 2017 that were evaluated in final assessment trials of the Sugar Research Australia breeding program. The top and the side axis both represent the clones. Each coloured point represents the proportion of the genome each pair of clones have in common. Higher degrees of genomic relationships between clones are represented by a light colour (e.g. diagonal elements), while a pink shading represents a weaker genomic relationship.
Fig. 1Accuracies of genomic prediction using a single kernel approach in a reproducible kernel Hilbert space (RKHS) model for a range of bandwidth parameters, h. This validation of h values was performed in order to select h values associated the highest prediction accuracy. A validation data set independent of the three forward prediction scenarios was chosen by using 1,320 clones from 2013 and 2014 as training population to predict 662 clones from 2015. TCH = tonnes of cane per hectare; CCS = commercial cane sugar; Fibre = Fibre content
Estimates of variance components, narrow-sense heritability, dominance ratio, epistatic ratio, broad-sense heritability, heterozygosity effects, and the maximum log-likelihood (Log L) for forward prediction scenario 1a/b (1,825 clones from 2013–2015 used as training population)
| Trait | Model | LogLik | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| TCH | A | 98.13 (3.64) | 36.09 (0.39) | 0.73 (0.008) | − 45,439.59f | ||||||
| AH | 95.78 (3.56) | 36.09 (0.39) | 0.73 (0.008) | 125.72 (19.88) | − 45,416.81e | ||||||
| AD | 49.91 (5.21) | 35.84 (4.70) | 36.08 (0.39) | 0.41 (0.04) | 0.29 (0.04) | 0.70 (0.008) | -45,403.70d | ||||
| ADH | 53.41 (5.33) | 31.70 (4.63) | 36.08 (0.39) | 0.44 (0.04) | 0.26 (0.04) | 0.70 (0.008) | 133.89 (29.95) | ||||
| ADE | 21.91 (4.19) | 9.71 (4.19) | 44.33 (5.43) | 36.06 (0.39) | 0.20 (0.04) | 0.09 (0.04) | 0.40 (0.05) | 0.68 (0.009) | |||
| ADEH | 23.00 (4.22) | 3.13 (3.68) | 48.45 (5.30) | 36.06 (0.39) | 0.21 (0.04) | 0.03 (0.03) | 0.44 (0.05) | 0.67 (0.008) | 118.83 (20.66) | ||
| CCS | A | 0.41 (0.02) | 0.134 (0.001) | 0.75 (0.007) | 6586.014b | ||||||
| AH | 0.41 (0.02) | 0.134 (0.001) | 0.75 (0.007) | 0.093 (1.30) | 6586.277b | ||||||
| AD | 0.40 (0.02) | 0.006 (0.014) | 0.134 (0.001) | 0.74 (0.029) | 0.010 (0.026) | 0.75 (0.008) | 6586.060b | ||||
| ADH | 0.40 (0.02) | 0.008 (0.014) | 0.134 (0.001) | 0.74 (0.03) | 0.014 (0.03) | 0.75 (0.008) | 0.142 (1.36) | 6586.363b | |||
| ADE | 0.256 (0.03) | ~ 0 (NA) | 0.110 (0.02) | 0.134 (0.002) | 0.51 (0.05) | ~ 0 (NA) | 0.22 (0.05) | 0.73 (0.008) | 6596.317a | ||
| ADEH | 0.256 (0.03) | ~ 0 (NA) | 0.11 (0.02) | 0.134 (0.002) | 0.51 (0.05) | ~ 0 (NA) | 0.22 (0.05) | 0.73 (0.008) | 0.204 (1.30) | 6596.591a | |
| Fibre | A | 1.54 (0.05) | 0.16 (0.002) | 0.91 (0.003) | 4118.252b | ||||||
| AH | 1.54 (0.05) | 0.16 (0.002) | 0.91 (0.003) | 1.29 (2.47) | 4119.29b | ||||||
| AD | 1.43 (0.08) | 0.08 (0.05) | 0.16 (0.002) | 0.86 (0.03) | 0.05 (0.03) | 0.91 (0.003) | 4119.76b | ||||
| ADH | 1.43 (0.08) | 0.08 (0.05) | 0.16 (0.002) | 0.86 (0.03) | 0.05 (0.03) | 0.91 (0.003) | 0.921 (2.78) | 4120.84b | |||
| ADE | 1.14 (0.12) | 0.01 (0.05) | 0.28 (0.10) | 0.16 (0.002) | 0.72 (0.06) | 0.006 (0.03) | 0.18 (0.06) | 0.91 (0.004) | 4123.30a | ||
| ADEH | 1.14 (0.12) | 0.01 (0.05) | 0.27 (0.10) | 0.16 (0.002) | 0.72 (0.06) | 0.008 (0.03) | 0.17 (0.06) | 0.91 (0.004) | 0.841 (2.53) | 4124.28a |
= additive genetic variance; = dominance genetic variance; = epistatic (additive–additive) genetic variance; = residual variance; = total phenotypic variance; HET = heterozygosity effect. standard error(se) in parantheses.
a–f models without a common superscript are significantly different at p < 0.05
Model A = additive model; Model AH additive plus heterozygosity, Model AD additive plus dominance model, Model ADH additive, dominance plus heterozygosity, Model ADE additive, dominance and epistatic effect, Model ADEH additive, dominance, epistatic plus heterozygosity. TCH Tonnes of cane per hectare, CCS Commercial cane sugar, measured in percent, Fibre = Fibre content, measured in percent
Estimates of variance components, narrow-sense heritability, dominance ratio, epistatic ratio, broad-sense heritability, heterozygosity effects, and the maximum log-likelihood (LogL) for forward prediction scenario 2 (2,397clones from 2013–2016 used as training population)
| Trait | Model | LogLik | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| TCH | A | 85.99 (2.75) | 37.39 (0.33) | 0.70 (0.007) | − 66,907.76f | ||||||
| AH | 83.26 (2.67) | 37.39 (0.33) | 0.69 (0.007) | 139.70 (16.78) | − 66,870.77e | ||||||
| AD | 48.84 (4.14) | 29.59 (3.80) | 37.38 (0.33) | 0.42 (0.033) | 0.26 (0.032) | 0.68 (0.007) | − 66,872.13d | ||||
| ADH | 53.52 (4.25) | 23.87 (3.70) | 37.38 (0.33) | 0.47 (0.03) | 0.21 (0.03) | 0.67 (0.007) | 157.38 (25.40) | − 66,850.68c | |||
| ADE | 22.82 (3.72) | 11.09 (3.49) | 38.89 (4.82) | 37.37 (0.33) | 0.21 (0.033) | 0.10 (0.032) | 0.35 (0.043) | 0.66 (0.007) | − 66,845.05b | ||
| ADEH | 23.30 (3.67) | 1.24 (2.67) | 46.47 (4.54) | 37.37 (0.33) | 0.21 (0.033) | 0.011 (0.025) | 0.43 (0.04) | 0.66 (0.008) | 140.57 (17.01) | − 66,813.82a | |
| CCS | A | 0.367 (0.012) | 0.125 (0.001) | 0.746 (0.006) | 10,890.366b | ||||||
| AH | 0.367 (0.012) | 0.125 (0.001) | 0.746 (0.006) | − 0.801 (1.106) | 10,890.728b | ||||||
| AD | 0.338 (0.019) | 0.021 (0.012) | 0.125 (0.001) | 0.698 (0.028) | 0.044 (0.025) | 0.74 (0.007) | 10,891.675b | ||||
| ADH | 0.338 (0.019) | 0.022 (0.012) | 0.125 (0.001) | 0.697 (0.028) | 0.045 (0.025) | 0.74 (0.006) | − 0.764 (1.27) | 10,892.093b | |||
| ADE | 0.228 (0.024) | ~ 0 (NA) | 0.110 (0.019) | 0.125 (0.001) | 0.493 (0.045) | ~ 0 (NA) | 0.237 (0.043) | 0.73 (0.007) | 10,903.98a | ||
| ADEH | 0.228 (0.024) | ~ 0 (NA) | 0.109 (0.019) | 0.125 (0.001) | 0.49 (0.045) | ~ 0 (NA) | 0.24 (0.043) | 0.73 (0.007) | − 0.811 (1.11) | 10,904.35a | |
| Fibre | A | 1.38 (0.04) | 0.16 (0.001) | 0.90 (0.002) | 6449.904b | ||||||
| AH | 1.38 (0.04) | 0.16 (0.001) | 0.90 (0.002) | 0.562 (2.12) | 6450.688b | ||||||
| AD | 1.31 (0.06) | 0.05 (0.04) | 0.16 (0.001) | 0.86 (0.03) | 0.03 (0.03) | 0.90 (0.003) | 6450.904b | ||||
| ADH | 1.31 (0.06) | 0.06 (0.04) | 0.16 (0.001) | 0.86 (0.03) | 0.03 (0.03) | 0.90 (0.003) | 0.445 (2.34) | 6451.769b | |||
| ADE | 1.13 (0.10) | 0.02 (0.04) | 0.18 (0.09) | 0.16 (0.001) | 0.76 (0.06) | 0.012 (0.03) | 0.123 (0.06) | 0.89 (0.003) | 6452.861a | ||
| ADEH | 1.13 (0.10) | 0.02 (0.04) | 0.18 (0.09) | 0.16 (0.001) | 0.76 (0.06) | 0.014 (0.03) | 0.120 (0.06) | 0.89 (0.003) | 0.263 (2.21) | 6453.659a |
= additive genetic variance; = dominance genetic variance; = epistatic (additive–additive) genetic variance; = residual variance; = total phenotypic variance; HET = heterozygosity effect. standard error(se) in parantheses.
a–f models without a common superscript are significantly different at p < 0.05
Model A; additive model, Model AH, additive plus heterozygosity; Model AD = additive plus dominance model; Model ADH = additive, dominance plus heterozygosity; Model ADE additive, dominance and epistatic effect, Model ADEH additive, dominance, epistatic plus heterozygosity. TCH Tonnes of cane per hectare, CCS Commercial cane sugar, measured in percent; Fibre Fibre content, measured in percent
Fig. 3Decomposition of genetic variance into additive, dominance, additive–additive epistatic, and residual variance in two forward prediction scenarios. a Proportion of genetic variance in forward prediction scenarios 1 a/1b (1,825 clones from 2013–2015 used as training population) for six different covariance structures (see Table 2). b Proportion of genetic variance in forward prediction scenario 2 (2,397 clones from 2013–2016 used as training population) for six different covariance structures (see Table 2). Va = additive genetic variance; Vd = dominance genetic variance; Vaa = additive–additive epistasis variance; Ve = error variance; Model A = additive model; Model AH additive plus heterozygosity; Model AD additive plus dominance model; Model ADH additive, dominance plus heterozygosity; Model ADE additive, dominance and epistatic effect; Model ADEH additive, dominance, epistatic plus heterozygosity; TCH tonnes of cane per hectare; CCS commercial cane sugar; Fibre = Fibre content
Number of records for Final Assessment Trial (FAT) clones in reference and prediction sets per region
| Year | Burdekin | Central | Northern | Southern | Total |
|---|---|---|---|---|---|
| 2013 | 322 | 185 | 156 | 155 | 818 |
| 2014 | 339 | 139 | 163 | 178 | 819 |
| 2015 | 298 | 125 | 156 | 172 | 751 |
| 2016 | 320 | 132 | 162 | 196 | 810 |
| 2017 | 306 | 187 | 147 | 157 | 797 |
| Total | 1585 | 768 | 784 | 858 | 3995 |
In forward prediction scenario1a/b 1,825 unique clones from 2013–2015 were used as training set to predict 739 and 691 unique clones from 2016 and 2017, respectively. In forward prediction scenario 2, 2,397 unique clones from 2013–2016 were used as training set to predict 691 unique clones from 2017. Clones that overlapped between the training and prediction sets were removed from the analyses. Overall, 739 clones and 25,714 SNPs overlapped between training populations for prediction scenarios 1a/b and scenario 2
Fig. 4Prediction accuracies for three key traits in different forward prediction scenarios measured as a Pearson’s correlation between genomic prediction and adjusted phenotypes of clones. Prediction accuracies for tonnes cane per hectare (TCH), commercial cane sugar (CCS), and Fibre content in, scenario 1a (1,825 clones from 2013–2015 used as training population to predict 739 clones from 2016); scenario 1b (1,825 clone from 2013–2015 used as training population to predict 691 clones from 2017); and scenario 2 (2,397 clones from 2013–2016 used as training population to predict 691 clones from 2017). Model A = additive model; Model AH additive plus heterozygosity; Model AD additive plus dominance model; Model ADH additive, dominance plus heterozygosity; Model ADE additive, dominance and epistatic effect; Model ADEH additive, dominance, epistatic plus heterozygosity. Error bars show the standard errors of the correlations between the genomic prediction and the adjusted phenotypes
Prediction accuracies averaged across four regions of Sugar Research Australia’s breeding program, comparing GBLUP, extended-GBLUP, and RKHS models in forward prediction scenarios 1a (1,825 clones from 2013–2015 used as training population, to predict 739 clones from 2016), scenario 1b (1,825 clones from 2013–2015 used as training population to predict 691 clones from 2017) and scenario 2 (2,397 clones from 2013–2016 used as training population to predict clones from 2017)
| Scenario 1a | Scenario 1b | Scenario 2 | |||||||
|---|---|---|---|---|---|---|---|---|---|
| TS: 2013–2015 | TS: 2013–2015 | TS: 2013–2016 | |||||||
| VS: 2016 | VS: 2017 | VS: 2017 | |||||||
| Model | TCH | CCS | Fibre | TCH | CCS | Fibre | TCH | CCS | Fibre |
| A | 0.248 | 0.405 | 0.445 | 0.218 | 0.297 | 0.396 | 0.247 | 0.348 | 0.431 |
| AH | 0.291 | 0.404 | 0.446 | 0.255 | 0.297 | 0.395 | 0.28 | 0.347 | 0.431 |
| AD | 0.264 | 0.406 | 0.442 | 0.235 | 0.304 | 0.390 | 0.262 | 0.353 | 0.429 |
| ADH | 0.287 | 0.406 | 0.442 | 0.258 | 0.305 | 0.390 | 0.28 | 0.352 | 0.429 |
| ADE | 0.283 | 0.409 | 0.443 | 0.257 | 0.315 | 0.394 | 0.27 | 0.354 | 0.429 |
| ADEH | 0.325 | 0.408 | 0.444 | 0.286 | 0.315 | 0.393 | 0.29 | 0.354 | 0.428 |
| RKHS | 0.272 | 0.390 | 0.407 | 0.246 | 0.318 | 0.389 | 0.247 | 0.348 | 0.431 |
Prediction accuracies were calculated as the Pearson’s correlation between genomic prediction and the adjusted phenotypes
Model A = additive model; Model AH = additive plus heterozygosity; Model AD = additive plus dominance model; Model ADH = additive, dominance plus heterozygosity; Model ADE = additive, dominance and epistatic effect; Model ADEH = additive, dominance, epistatic plus heterozygosity; RKHS = Reproducible kernel Hilbert space. TCH = Tonnes of cane per hectare; CCS = Commercial cane sugar; Fibre = Fibre content. TS = Training data set; VS = Validation data set