| Literature DB >> 33953221 |
Henry O Awika1, Amit K Mishra2, Haramrit Gill3, James DiPiazza2, Carlos A Avila1,3, Vijay Joshi4,5.
Abstract
The efficient acquisition and transport of nutrients by plants largely depend on the root architecture. Due to the absence of complex microbial network interactions and soil heterogeneity in a restricted soilless medium, the architecture of roots is a function of genetics defined by the soilless matrix and exogenously supplied nutrients such as nitrogen (N). The knowledge of root trait combinations that offer the optimal nitrogen use efficiency (NUE) is far from being conclusive. The objective of this study was to define the root trait(s) that best predicts and correlates with vegetative biomass under differed N treatments. We used eight image-derived root architectural traits of 202 diverse spinach lines grown in two N concentrations (high N, HN, and low N, LN) in randomized complete blocks design. Supervised random forest (RF) machine learning augmented by ranger hyperparameter grid search was used to predict the variable importance of the root traits. We also determined the broad-sense heritability (H) and genetic (rg) and phenotypic (rp) correlations between root traits and the vegetative biomass (shoot weight, SWt). Each root trait was assigned a predicted importance rank based on the trait's contribution to the cumulative reduction in the mean square error (MSE) in the RF tree regression models for SWt. The root traits were further prioritized for potential selection based on the rg and SWt correlated response (CR). The predicted importance of the eight root traits showed that the number of root tips (Tips) and root length (RLength) under HN and crossings (Xsings) and root average diameter (RAvdiam) under LN were the most relevant. SWt had a highly antagonistic rg (- 0.83) to RAvdiam, but a high predicted indirect selection efficiency (- 112.8%) with RAvdiam under LN; RAvdiam showed no significant rg or rp to SWt under HN. In limited N availability, we suggest that selecting against larger RAvdiam as a secondary trait might improve biomass and, hence, NUE with no apparent yield penalty under HN.Entities:
Year: 2021 PMID: 33953221 PMCID: PMC8100178 DOI: 10.1038/s41598-021-87870-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Model training and grid search across 192 hyperparametric combinations. Random forest out-of-bag error rate (a and b) compared to the corresponding validation error rate (c and d) averaged along tree optimization splits. The optimal number of trees and the corresponding mean squared error (out-of-bag error) are shown by the orange arrows on the graphs (a–d). The root mean squared error (RMSE) at which all hyperparameters converged is shown in the histograms (e and f). RMSE at optimal hyperparameters (obtained for training model and used for independent validation sets) are shown by arrows in the histograms Also see Supplementary Table S2-tuning hyperparameters).
Summary of model evaluation and test (validation) by random forest machine learning.
| N-management | Model name | Tuning RMSE at optimal hyperparameters | Optimal model parameters | Plit rule | OOB prediction error (MSE, mean of squared residuals) | Regression R-squared (OOB) | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| Treesa | Nodes | Sample fractionφ | Variablesb | Sample size | ||||||
| LN | Default | 0.4530081 | 500 | 5 | 0.632 | 2 | 127 | Variance | 0.227015 | 0.568 |
| Training | 500 | 8 | 0.7 | 3 | 141 | Variance | 0.239117 | 0.572 | ||
| Validation | 500 | 8 | 0.7 | 3 | 61 | Variance | 0.209563 | 0.612 | ||
| HN | Default | 1.581197 | 500 | 5 | 0.632 | 2 | 121 | Variance | 2.793847 | 0.602 |
| Training | 500 | 2 | 0.56 | 7 | 121 | Variance | 2.798641 | 0.613 | ||
| Validation | 500 | 2 | 0.56 | 7 | 81 | Variance | 2.712218 | 0.646 | ||
aTrees maintained at 500 since all optimal trees (in Fig. 2) fell within this limit.
bNumber of predictor root traits sampled at each tree node; HN, high N; LN, low N.
ϕThe validation sample was kept independent of the training sample, and the same hyperparameters were applied as for the training model.
The genetic correlations and phenotypic correlations of root traits.
| Traits | SWt | RWt | RLenght | RSarea | RAvdiam | RVol | Tips | Forks | Xsings |
|---|---|---|---|---|---|---|---|---|---|
| SWt | 0.789 | 0.766 | 0.759 | 0.168 ns | 0.732 | 0.794 | 0.709 | 0.702 | |
| RWt | 0.804 | 0.921 | 0.950 | 0.348 | 0.955 | 0.869 | 0.886 | 0.856 | |
| RLenght | 0.769 | 0.952 | 0.987 | 0.144 ns | 0.951 | 0.947 | 0.968 | 0.972 | |
| RSarea | 0.757 | 0.967 | 0.992 | 0.282 | 0.988 | 0.915 | 0.953 | 0.939 | |
| RAvdiam | 0.045 ns | 0.199 ns | 0.010 ns | 0.135 ns | 0.402 | 0.042 ns | 0.102 ns | 0.021 ns | |
| RVol | 0.730 | 0.964 | 0.968 | 0.992 | 0.254 | 0.864 | 0.916 | 0.887 | |
| Tips | 0.815 | 0.923 | 0.968 | 0.936 | − 0.112 ns | 0.890 | 0.930 | 0.931 | |
| Forks | 0.703 | 0.921 | 0.989 | 0.973 | − 0.053 ns | 0.941 | 0.958 | 0.988 | |
| Xsings | 0.706 | 0.902 | 0.988 | 0.965 | − 0.107 ns | 0.927 | 0.966 | 0.995 |
The table shows the genetic correlations (lower triangle) and phenotypic correlations (upper triangle) of eight root trait and one shoot trait under high N-management.
SWt, shoot weight, RWt, root weight, RLength, root length, RSarea, root surface area, RAVdiam, root average diameter, RVol, root volume, Tips, number of root tips, Forks, number of forks, Xsings, number of crossings, ns, not significant at P ≤ 0.05; all others, significant at 0.05 > p < 0.001.
Figure 2Ranking of root traits based on machine learning and multi-environment trait analysis methods. (a) and (b) are rankings of the variable importance of eight root traits (vertical axis) in a model for predicting shoot weight under high N-management, HN (a) and low N-management, LN (b) using random forest machine learning algorithm. Each point in (a) and (b) is a cumulative reduction in mean square error (MSE) of regression models each time the corresponding predictor root trait variable is used as a node split in a decision tree. The larger the cumulative reduction in the MSE, the greater the important ace and the higher a root trait ranks across all models tested. The traits have been ranked in ascending order with the least important at the bottom and the most important at the top on the vertical axis. (c) and (d) show the genetic correlation coefficients (r), and (e) and (f) the phenotypic correlation coefficient (r) between each variable and shoot weight. Correlation size and significance are shown on each bar. A negative ( −) before a number indicates it is a negative correlation, *significant at P = 0.05 and **significant at p < 0.01.
The genetic correlations (lower triangle) and phenotypic correlations (upper triangle) of eight root trait and one shoot trait under low N-management.
| Traits | SWt | RWt | RLenght | RSarea | RAvdiam | RVol | Tips | Forks | Xsings |
|---|---|---|---|---|---|---|---|---|---|
| SWt | 0.515 | 0.529 | 0.284 | − 0.481 | 0.112 ns | 0.586 | 0.472 | 0.728 | |
| RWt | 0.521 | 0.880 | 0.849 | 0.104 ns | 0.769 | 0.777 | 0.887 | 0.793 | |
| RLenght | 0.539 | 0.929 | 0.869 | − 0.034 ns | 0.639 | 0.881 | 0.936 | 0.870 | |
| RSarea | 0.047 ns | 0.676 | 0.744 | 0.431 | 0.931 | 0.662 | 0.871 | 0.587 | |
| RAvdiam | − 0.827 | − 0.553 | − 0.139 | 0.533 | 0.690 | − 0.238 | 0.054 ns | − 0.310 | |
| RVol | − 0.296 | 0.218 | 0.439 | 0.920 | 0.794 | 0.419 | 0.685 | 0.324 | |
| Tips | 0.623 | 0.745 | 0.822 | 0.405 | − 0.424 | 0.115 ns | 0.837 | 0.875 | |
| Forks | 0.465 | 0.986 | 0.916 | 0.788 | − 0.004 ns | 0.538 | 0.755 | 0.831 | |
| Xsings | 0.890 | 0.871 | 0.833 | 0.357 | − 0.452 | 0.052 ns | 0.885 | 0.760 |
SWt, shoot weight, RWt, root weight, RLength, root length, RSarea, root surface area, RAVdiam, root average diameter, RVol, root volume, Tips, number of root tips, Forks, number of forks, Xsings, number of crossings, ns, not significant at P ≤ 0.05; all others, significant at 0.05 > p < 0.001.
The genetic correlations (lower triangle) and phenotypic correlations (upper triangle) between traits across the two N managements.
| Traits | SWt | RWt | RLenght | RSarea | RAvdiam | RVol | Tips | Forks | Xsings |
|---|---|---|---|---|---|---|---|---|---|
| SWt | 0.759 | 0.684 | 0.576 | 0.020 ns | 0.406 | 0.618 | 0.640 | 0.692 | |
| RWt | 0.952 | 0.878 | 0.843 | 0.194 | 0.692 | 0.760 | 0.861 | 0.817 | |
| RLenght | 0.999 | 0.999 | 0.934 | 0.116 ns | 0.755 | 0.903 | 0.957 | 0.929 | |
| RSarea | 0.999 | 0.999 | 0.999 | 0.423 | 0.936 | 0.785 | 0.906 | 0.788 | |
| RAvdiam | 0.999 | 0.999 | 0.999 | 0.999 | 0.664 | − 0.099 ns | 0.132 ns | − 0.038 ns | |
| RVol | 0.988 | 0.999 | 0.999 | 0.999 | 0.999 | 0.587 | 0.750 | 0.574 | |
| Tips | 0.629 | 0.911 | 0.905 | 0.719 | 0.702 | 0.751 | 0.885 | 0.869 | |
| Forks | 0.999 | 0.999 | 0.976 | 0.855 | 0.999 | 0.961 | 0.980 | 0.940 | |
| Xsings | NA | NA | NA | NA | NA | NA | NA | NA |
SWt, shoot weight, RWt, root weight, RLength, root length, RSarea, root surface area, RAVdiam, root average diameter, RVol, root volume, Tips, number of root tips, Forks, number of forks, Xsings, number of crossings, ns, not significant at P ≤ 0.05; all others, significant at 0.05 > p < 0.001, NA Not available.
Figure 3The structure of genetic correlations between traits within and between the two N-managements. Ward method was used to construct the dendrograms. In the correlations between N-Managements, Xsings is missing because its heritability was < 0.1 threshold we had set when calculating the genetic correlation coefficients.
Predicting the correlated response and indirect selection efficiency for shoot weight using root traits as the secondary traits under high N-management and low N-management.
| Hswt | Vg-swt | Root trait | rg | Htrait | CRswt | Rswt | CRswt/Rswt | Indirect selection efficiency | |
|---|---|---|---|---|---|---|---|---|---|
| High N-management | 0.646 | 4.740 | RWt | 0.804 | 0.504 | 1.243 | 1.750 | 0.710 | 0.627 |
| Rlenght | 0.769 | 0.649 | 1.349 | 0.771 | 0.772 | ||||
| RSarea | 0.757 | 0.605 | 1.282 | 0.733 | 0.709 | ||||
| Ravdiam | 0.045 | 0.536 | 0.072 | 0.041 | 0.037 | ||||
| RVol | 0.730 | 0.554 | 1.183 | 0.676 | 0.626 | ||||
| Tips | 0.815 | 0.655 | 1.436 | 0.821 | 0.826 | ||||
| Forks | 0.703 | 0.634 | 1.219 | 0.696 | 0.690 | ||||
| Xsings | 0.706 | 0.658 | 1.247 | 0.713 | 0.719 | ||||
| Low N-management | 0.701 | 0.430 | RWt | 0.521 | 0.506 | 0.243 | 0.549 | 0.443 | 0.376 |
| Rlenght | 0.539 | 0.417 | 0.228 | 0.416 | 0.321 | ||||
| RSarea | 0.047 | 0.475 | 0.021 | 0.039 | 0.032 | ||||
| Ravdiam | − 0.827 | 0.957 | − 0.530 | − 0.966 | − 1.128 | ||||
| RVol | − 0.296 | 0.634 | − 0.155 | − 0.281 | − 0.268 | ||||
| Tips | 0.623 | 0.454 | 0.275 | 0.501 | 0.403 | ||||
| Forks | 0.465 | 0.472 | 0.209 | 0.381 | 0.313 | ||||
| Xsings | 0.890 | 0.588 | 0.447 | 0.815 | 0.746 |
V genetic variance of shoot weight (SWt), r, genetic correlation between SWt and a root trait, H broad sense heritability (repeatability) of SWt on a line mean basis; H broad sense heritability (repeatability) a root trait on a line mean basis, CR, Correlated response; R, response to direct selection; Indirect selection efficiency compares H to [(r between SWt and Trait) × H].
Variance, heritability and means separation for shoot weight and root traits.
| Statistic | N management | SWt | RWt | RLength | RSarea | RAdiam | RVol | Tips | Forks | Xsings |
|---|---|---|---|---|---|---|---|---|---|---|
| Heritability | HN | 0.646 | 0.503 | 0.649 | 0.605 | 0.536 | 0.554 | 0.655 | 0.634 | 0.658 |
| LN | 0.701 | 0.506 | 0.417 | 0.475 | 0.957 | 0.634 | 0.455 | 0.471 | 0.588 | |
| Combined | 0.118 | 0.165 | 0.081 | 0.205 | 0.0996 | 0.229 | 0.141 | 0.166 | 0.0415 | |
| Genotype Variance | HN | 4.740*** | 0.073*** | 83,728.7*** | 476.8*** | 0.00023*** | 0.0179*** | 171,457.8*** | 2,050,402.7*** | 171,116.6*** |
| LN | 0.430*** | 0.0404*** | 24,232.9*** | 341.8*** | 0.0039*** | 0.055*** | 115,541.5*** | 754,145.1*** | 35,187.6*** | |
| Combined | 0.265 | 0.0103 | 3942.1 | 86.02 | 0.000118 | 0.0078 | 19,511.98 | 218,626.9 | 3391.99 | |
| Management Variance | Combined | 6.275*** | 0.020** | 5767.0* | 16.6 | 0.0035*** | 0.0155*** | 75.89 | 0 | 46,191.8*** |
| Genotype × Management Variance | Combined | 2.495*** | 0.048**** | 50,039.5*** | 323.3 | 0.0019*** | 0.029*** | 123,987.7*** | 1,179,157.3*** | 99,761.8*** |
| Residual Variance | HN | 7.791 | 0.215 | 135,956.9 | 934.52 | 0.0006 | 0.044 | 271,071.4 | 3,549,899 | 266,834.7 |
| LN | 0.549 | 0.119 | 101,570.5 | 1132.45 | 0.0005 | 0.096 | 416,459.7 | 2,539,164 | 73,997.1 | |
| Combined | 4.461 | 0.171 | 118,741.2 | 1033.6 | 0.0006 | 0.0697 | 343,826.2 | 3,043,825 | 170,311.1 | |
| Trait Mean | HN | 5.172 | 0.657 | 694.3 | 52.8 | 0.238 | 0.324 | 1106.1 | 2417.2 | 672.2 |
| LN | 1.623 | 0.456 | 581.7 | 59.22 | 0.322 | 0.501 | 1157.7 | 2358.7 | 365.4 | |
| LSD on Mean | Combined | 1.356* | 0.261 | 167.9 | 23.08 | 0.0287* | 0.216 | 361.1 | 1190.7 | 159.02* |
| CV | HN | 53.97 | 70.61 | 53.11 | 57.9 | 10.281 | 64.44 | 47.07 | 77.947 | 76.85 |
| LN | 45.65 | 75.54 | 54.79 | 56.82 | 7.133 | 61.83 | 55.74 | 67.56 | 74.44 | |
| Combined | 62.23 | 73.94 | 54.01 | 57.4 | 8.474 | 64.06 | 51.81 | 73.06 | 79.54 |
LN, low N; HN, high N; LSD, least significant difference; CV, coefficient of variations; *, **, ***significant at p = 0.05, 0.05 > p ≥ 0.01, p < 0.01, respectively.