| Literature DB >> 35792875 |
Alain J Mbebi1,2, Jean-Christophe Breitler3, Mélanie Bordeaux4, Ronan Sulpice5, Marcus McHale5, Hao Tong1,2,6, Lucile Toniutti3, Jonny Alonso Castillo4, Benoît Bertrand3, Zoran Nikoloski1,2,6.
Abstract
Genomic prediction has revolutionized crop breeding despite remaining issues of transferability of models to unseen environmental conditions and environments. Usage of endophenotypes rather than genomic markers leads to the possibility of building phenomic prediction models that can account, in part, for this challenge. Here, we compare and contrast genomic prediction and phenomic prediction models for 3 growth-related traits, namely, leaf count, tree height, and trunk diameter, from 2 coffee 3-way hybrid populations exposed to a series of treatment-inducing environmental conditions. The models are based on 7 different statistical methods built with genomic markers and ChlF data used as predictors. This comparative analysis demonstrates that the best-performing phenomic prediction models show higher predictability than the best genomic prediction models for the considered traits and environments in the vast majority of comparisons within 3-way hybrid populations. In addition, we show that phenomic prediction models are transferrable between conditions but to a lower extent between populations and we conclude that chlorophyll a fluorescence data can serve as alternative predictors in statistical models of coffee hybrid performance. Future directions will explore their combination with other endophenotypes to further improve the prediction of growth-related traits for crops.Entities:
Keywords: 3-way coffee hybrids; GenPred; Shared Data Resource; chlorophyll a fluorescence; genomic prediction; phenomic prediction
Mesh:
Substances:
Year: 2022 PMID: 35792875 PMCID: PMC9434219 DOI: 10.1093/g3journal/jkac170
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.542
Successive treatment conditions applied on the H3W coffee populations before their transfer to the field.
| Treatment | Altitude | Temperature | Condition | Duration | Mimicking |
|---|---|---|---|---|---|
| 1 | 600 | 23.6 | Shade | 3 | n/a (acclimation) |
| 2 | 600 | 24.5 | Full sun | 2 | Open field |
| 3 | 600 | 23.5 | Shade | 2.5 | AFS established |
| 4 | 1300 | 20 | Full sun | 2 | Cooler temperatures |
Altitude, duration, and temperature are, respectively, measured in meters, months, and °C. AFS denotes agroforestry system.
Fig. 1.Predictability of traits in H3W coffee families based on GP and PP models. We used the following models: L21-joint, RR, mLASSO, EN, BL, mBayesB, and GBLUP to predict LC (left), TH (middle), and TD (right). This is setting S1 with traits and phenomic data obtained by concatenating the respective measurements over all conditions after the acclimation. The predictability is computed as the average Pearson correlation coefficient between observed and predicted values for the 9 traits (i.e. 3 traits for each treatment) in the validation set, based on 20 repetitions of 3-fold cross-validation. Two H3W coffee populations were considered for the comparative analysis: H1xET47 and H1xG, where Centroamericano (H1) is an F1 hybrid cultivated clonally and results from a cross between T.05296 and Rume Sudan, and Geisha 3 (G) and ET47 (the mother plant) are 2 Ethiopian landrace varieties. The average accuracy obtained from repeated cross-validations are reported as the height of the bars, and standard errors are included.
Fig. 2.Comparison between GP and PP models under AFS conditions. We used L21-joint, RR, mLASSO, EN, BL, and mBayesB for PP and the best-performing GP model for each H3W coffee plant and trait. For the selected traits, BL and GBLUP are the best-performing GP models for H1xG, while EN, L21-joint, and GLUP are the best GP models for H1xET47. The predictability is computed as the average Pearson correlation coefficient between observed and predicted trait values in the validation set based on 20 replicates of 3-fold cross-validation. The comparative analysis is concerned with setting S2 where the best-performing genomic prediction models for H1xET47 and H1xG populations (i.e. GP-H1xET47 and GP-H1xG) using their respective SNP data, are contrasted with phenomic predictions of the same hybrid families (PP-H1xET47 and PP-H1xG) under established AFS. Models were evaluated after treatment 3 (Table 1) with phenotypic and phenomic data following setting S2. The average accuracy obtained from repeated cross-validations are presented as the height of the bars along with their corresponding standard errors.
Comparison between GP and PP models based on condition-ahead predictive abilities.
| H1xET47 | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| GP of treatment 3 using treatment 2 | PP of treatment 3 using treatment 2 | |||||||||||
| BL | EN | GBLUP | L21-joint | mBayesB | RR | BL | EN | L21-joint | mBayesB | RR | ||
| LC | 0.276 | 0.164 | 0.086 | 0.06 | 0.032 |
| LC | 0.429 |
| 0.376 | 0.387 | 0.315 |
| TH | 0.055 | 0.324 |
| 0.307 | 0.063 | 0.186 | TH | 0.297 | 0.076 | 0.303 |
| 0.083 |
| TD | 0.016 | 0.049 | 0.079 | 0.164 |
| 0.115 | TD | 0.273 |
| 0.205 | 0.104 | 0.314 |
|
| ||||||||||||
|
|
| |||||||||||
|
|
| |||||||||||
|
|
|
|
|
|
|
|
|
|
|
| ||
|
| ||||||||||||
| LC |
| 0.124 | 0.012 | 0.111 | 0.153 | 0.068 | LC |
| 0.167 | 0.111 | 0.012 | 0.163 |
| TH | 0.182 | 0.168 |
| 0.483 | 0.188 | 0.271 | TH | 0.146 | 0.107 |
| 0.317 | 0.051 |
| TD | 0.202 |
| 0.028 | 0.23 | 0.082 | 0.095 | TD | 0.36 |
| 0.165 | 0.042 | 0.056 |
|
| ||||||||||||
|
|
| |||||||||||
|
|
| |||||||||||
|
|
|
|
|
|
|
|
|
|
|
| ||
|
| ||||||||||||
| LC | 0.279 | 0.122 | 0.106 | 0.173 | 0.173 |
| LC | 0.0187 |
| 0.023 | 0.083 | 0.084 |
| TH | 0.204 | 0.168 | 0.427 |
| 0.036 | 0.052 | TH | 0.359 | 0.131 |
| 0.382 | 0.223 |
| TD | 0.004 | 0.154 | 0.047 |
| 0.061 | 0.13 | TD | 0.069 | 0.138 | 0.0425 |
| 0.287 |
|
| ||||||||||||
|
| ||||||||||||
|
| ||||||||||||
|
|
| |||||||||||
|
|
| |||||||||||
|
|
|
|
|
|
|
|
|
|
|
| ||
|
| ||||||||||||
| LC |
| 0.081 | 0.026 | 0.218 | 0.191 | 0.142 | LC |
| NA | 0.069 | 0.098 | 0.052 |
| TH | 0.043 | 0.009 |
| 0.112 | 0.065 | 0.022 | TH | 0.105 | 0.09 | 0.07 | 0.194 |
|
| TD |
| 0.125 | 0.116 | 0.069 | 0.025 | 0.372 | TD | 0.094 | 0.072 |
| 0.024 | 0.023 |
|
| ||||||||||||
|
|
| |||||||||||
|
|
| |||||||||||
|
|
|
|
|
|
|
|
|
|
|
| ||
|
| ||||||||||||
| LC |
| 0.037 | 0.133 | 0.121 | 0.072 | 0.005 | LC | 0.294 | 0.186 | 0.332 | 0.181 |
|
| TH | 0.08 | 0.076 | 0.053 | 0.188 | 0.094 |
| TH |
| 0.37 | 0.03 | 0.13 | 0.136 |
| TD | 0.012 | 0.23 | 0.059 |
| 0.132 | 0.155 | TD | 0.207 | 0.01 | 0.151 | 0.097 |
|
|
| ||||||||||||
|
|
| |||||||||||
|
|
| |||||||||||
|
|
|
|
|
|
|
|
|
|
|
| ||
|
| ||||||||||||
| LC | 0.334 | 0.103 | 0.136 | 0.167 | 0.096 |
| LC | 0.279 | NA | 0.191 |
| 0.386 |
| TH | 0.027 | 0.11 | 0.068 | 0.081 | 0.063 |
| TH | 0.434 | 0.381 | 0.381 |
| 0.305 |
| TD | 0.015 | 0.175 | 0.036 | 0.149 | 0.19 |
| TD | 0.018 | 0.087 | 0.231 |
| 0.149 |
We used L21-joint, RR, mLASSO, EN, BL, and mBayesB. The performance is computed as the correlation coefficient between measured and predicted LC, TH, and TD, for H1xET47 (i.e. top panel) and H1xG (i.e. bottom panel). This is setting S3, where models are trained on the current environmental condition to predict the next one. Numbers in bold represent the best performance and mLasso is not represented because all the corresponding standard deviations were zero.
Comparison between GP and PP models based on between-family predictive abilities.
| GP of H1xG using H1xET47 | PP of H1xG using H1xET47 | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| BL | EN | GBLUP | L21-joint | mBayesB | mLasso | RR | BL | EN | L21-joint | mBayesB | mLasso | RR | |
| LC2 | 0.038 | 0.13 | 0.056 | 0.038 | 0.114 | 0.03 |
| 0.099 | 0.137 | 0.009 | 0.102 | NA | 0.02 |
| TH2 | 0.002 | 0.095 | 0.022 | 0.061 | 0.168 | 0.072 | 0.103 |
| 0.123 | 0.236 | 0.294 | NA | 0.094 |
| TD2 | 0.025 | 0.07 | 0.078 | 0.007 | 0.079 | 0.168 | 0.042 | 0.111 | 0.273 | 0.01 | 0.225 | NA |
|
| LC3 | 0.1 | 0.118 | 0.046 | 0.003 | 0.052 | 0.194 | 0.222 | 0.027 | 0.164 | 0.128 | 0.177 | NA |
|
| TH3 | 0.066 | 0.203 | 0.035 | 0.083 | 0.127 | 0.039 | 0.073 |
| 0.153 | 0.168 | 0.322 | NA | 0.032 |
| TD3 | 0.078 |
| 0.066 | 0.168 | 0.076 | 0.114 | 0.079 | 0.14 | 0.018 | 0.138 | 0.096 | NA | 0.058 |
| LC4 | 0.025 | 0.095 | 0.064 | 0.037 | 0.143 | 0.108 | 0.138 | 0.144 | 0.087 | 0.096 | 0.143 | NA |
|
| TH4 | 0.009 | 0.09 | 0.068 | 0.128 | 0.018 | 0.015 | 0.224 | 0.252 | 0.084 | 0.277 |
| NA | 0.051 |
| TD4 | 0.069 | 0.074 | 0.154 | 0.027 | 0.114 | 0.139 | 0.051 | 0.211 | 0.096 |
| 0.163 | NA | 0.039 |
|
| |||||||||||||
|
|
| ||||||||||||
|
|
| ||||||||||||
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
| |||||||||||||
| LC2 | 0.201 | 0.025 |
| 0.072 | 0.29 | NA | 0.008 | 0.118 | 0.066 | 0.081 | 0.018 | NA | 0.098 |
| TH2 | 0.143 | 0.169 | 0.078 | 0.04 | 0.006 | NA |
| 0.008 | 0.002 | 0.116 | 0.047 | NA | 0.053 |
| TD2 | 0.009 | 0.196 |
| 0.111 | 0.127 | NA | 0.161 | 0 | 0.037 | 0.054 | 0.007 | NA | 0.052 |
| LC3 | 0.07 |
| 0.052 | 0.157 | 0.106 | NA | 0.025 | 0.005 | 0.11 | 0.018 | 0.021 | NA | 0.144 |
| TH3 | 0.016 |
| 0.077 | 0.122 | 0.029 | NA | 0.194 |
| 0.002 | 0.126 | 0.04 | NA | 0.082 |
| TD3 | 0.038 | 0.021 | 0.142 | 0.099 | 0.117 | NA | 0.093 | 0.086 | 0.068 | 0.067 | 0.151 | NA |
|
| LC4 |
| 0.101 | 0.036 | 0.002 | 0.142 | NA | 0.036 | 0.096 | 0.103 | 0.033 | 0.064 | NA | 0.058 |
| TH4 | 0.004 | 0.116 |
| 0.118 | 0.002 | NA |
| 0.096 | 0.09 | 0.048 | 0.003 | NA | 0.049 |
| TD4 |
| 0.186 | 0.194 | 0.125 | 0.294 | NA | 0.244 | 0.133 | 0.203 | 0.178 | 0.111 | NA | 0.08 |
We used L21-joint, RR, mLASSO, EN, BL, and mBayesB. The performance is computed as the correlation coefficient between measured and predicted LC, TH, and TD at each treatment condition and for H1xET47 (i.e. top panel) and H1xG (i.e. bottom panel). This is setting S4, where models are trained with data from one family to predict traits of the other one, with traits and phenomic data constructed by concatenating the respective measurements over all treatment conditions after the acclimation period. Numbers in bold represent the best performance and NA is used to denote that the corresponding standard deviation was zero.
Selection performance of L21-joint, RR, mLASSO, EN, BL, and mBayesB.
| Selected proportion of best-performing lines | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| RR | Mlasso | EN | GBLUP | BL | mBayesB | L21-joint | RR | Mlasso | EN | GBLUP | BL | mBayesB | L21-joint | |
|
|
| |||||||||||||
|
| ||||||||||||||
| LC | 15 |
| 5 |
| 10 | 5 |
| 15 | 20 | 25 | 25 | 25 | 35 |
|
| TH | 15 | 10 |
| 5 | 15 | 15 | 15 | 30 | 20 | 20 | 20 | 30 | 30 |
|
| TD | 15 | 10 |
| 20 | 15 | 15 | 20 | 10 | 10 | 15 |
| 10 | 20 |
|
|
| ||||||||||||||
|
|
| |||||||||||||
|
| ||||||||||||||
| LC | 15 | 10 | 10 | xx |
|
|
| 25 | 5 | 25 | xx | 30 | 25 | 30 |
| TH | 0 | 10 | 15 | xx | 15 | 5 | 10 | 20 | 15 |
| xx | 30 | 25 | 30 |
| TD | 5 | 10 |
| xx | 15 | 15 | 25 |
| 25 | 25 | xx | 25 |
|
|
The performance is computed as the proportion of correctly selected best-performing lines with respect to LC, TH, and TD. For populations H1xG (i.e. left panel) and H1xET47 (i.e. right panel), the assessment is conducted for genomic and phenomic prediction models accounting for environmental conditions. Numbers in bold represent the best performance and we write xx to express that the corresponding statistical approach was not used for phenomic prediction.