| Literature DB >> 34211487 |
André Froes de Borja Reis1, Luiz Moro Rosso1, Larry C Purcell2, Seth Naeve3, Shaun N Casteel4, Péter Kovács5, Sotirios Archontoulis6, Dan Davidson7, Ignacio A Ciampitti1.
Abstract
Biological nitrogen (N)-fixation is the most important source of N for soybean [Glycine max (L.) Merr.], with considerable implications for sustainable intensification. Therefore, this study aimed to investigate the relevance of environmental factors driving N-fixation and to develop predictive models defining the role of N-fixation for improved productivity and increased seed protein concentration. Using the elastic net regularization of multiple linear regression, we analyzed 40 environmental factors related to weather, soil, and crop management. We selected the most important factors associated with the relative abundance of ureides (RAU) as an indicator of the fraction of N derived from N-fixation. The most relevant RAU predictors were N fertilization, atmospheric vapor pressure deficit (VPD) and precipitation during early reproductive growth (R1-R4 stages), sowing date, drought stress during seed filling (R5-R6), soil cation exchange capacity (CEC), and soil sulfate concentration before sowing. Soybean N-fixation ranged from 60 to 98% across locations and years (n = 95). The predictive model for RAU showed relative mean square error (RRMSE) of 4.5% and an R2 value of 0.69, estimated via cross-validation. In addition, we built similar predictive models of yield and seed protein to assess the association of RAU and these plant traits. The variable RAU was selected as a covariable for the models predicting yield and seed protein, but with a small magnitude relative to the sowing date for yield or soil sulfate for protein. The early-reproductive period VPD affected all independent variables, namely RAU, yield, and seed protein. The elastic net algorithm successfully depicted some otherwise challenging empirical relationships to assess with bivariate associations in observational data. This approach provides inference about environmental variables while predicting N-fixation. The outcomes of this study will provide a foundation for improving the understanding of N-fixation within the context of sustainable intensification of soybean production.Entities:
Keywords: LASSO; elastic net; relative abundance of ureides; ridge; symbiotic nitrogen fixation
Year: 2021 PMID: 34211487 PMCID: PMC8239404 DOI: 10.3389/fpls.2021.675410
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Crop, soil, and weather variable description, units, means, and range of observations.
| Variable | Description | Unit | Mean, [min, max] | Group |
|---|---|---|---|---|
| MG | Maturity group | - | 3.0, [1.0, 4.6] | Crop |
| S.Phe | Ureide sampling relative stage | - | 1.6, [1.3, 1.9] | Crop |
| S.Length | Season length | days | 98, [75, 112] | Crop |
| Sowing | Sowing date of the treatments | day of year | 151, [126, 177] | Crop |
| CEC | Soil cation exchange capacity | cmolc dm−3 | 17, [3, 27] | Soil |
| Clay | Clay relative content | % | 20, [7, 31] | Soil |
| N-NO3 | Soil nitrate content before crop sowing | mg dm−3 | 6.0, [1.0, 10.6] | Soil |
| OMM | Organic matter mineralization | % | 0.04, [0.01, 0.07] | Soil |
| pH | Soil pH | - | 6.5, [5.6, 7.1] | Soil |
| Sand | Sand relative content | % | 29, [9, 50] | Soil |
| Silt | Silt relative content | % | 47, [34, 63] | Soil |
| S-SO4 | Soil sulfate content before crop sowing | mg dm−3 | 7.7, [0.9, 21.3] | Soil |
| SOM | Soil organic matter | g kg−1 | 27, [6.4, 48] | Soil |
| ET0 | Cumulative reference evapotranspiration | mm | 440, [337, 602] | Weather |
| Hum | Daily mean relative air humidity | kPa | 2.1, [1.7, 2.4] | Weather |
| Prec | Cumulative rainfall precipitation | mm | 438, [170, 805] | Weather |
| Rad | Cumulative solar radiation | MJ m−2 | 2,186, [1,680, 2,595] | Weather |
| SDI | Precipitation evenness: SDI | - | 0.66, [0.53, 0.72] | Weather |
| D.str | Drought stress: Cumulative ET reduction | mm | 4.5, [0.0, 19.4] | Weather |
| T.Amp | Daily mean temperature amplitude | °C | 11, [10, 12] | Weather |
| Tmean | Daily mean temperature | °C | 22.9, [19.9, 25.6] | Weather |
| VPD | Daily mean vapor pressure deficit | kPa | 0.78, [0.64, 1.08] | Weather |
Relative phenological stages according to SoySim. Values range from 0 to 2, where 1 and 2 represent R1 and physiological maturity, respectively.
SDI: Shannon diversity index, where 1 denotes an uneven distribution and 0 implies a skewed distribution.
Weather was segmented in vegetative (Wv), pre-seed filling (Wr), and seed-filling (Ws).
Figure 1Locations of the field studies during the 2018 and 2019 growing seasons. Red points are environments with control, reinoculation, and N-fertilizer; and blue points are environments with only control and N-fertilizer.
Figure 2Predicted and observed data for the relative abundance of ureides (RAU; A), seed yield (B), and seed protein concentration (C). Elastic net regression model considering all the environment, crop, and management covariables (full model).
Full models precision and accuracy metrics for the RAU, yield, and protein concentration.
| Model | MAE | RMSE | RRMSE | R2 | |
| RAU (%) | Null | 6.1 (±2.8) | 7.0 (±3.3) | 8.1 (±4.6) | |
| Full | 3.4 (±1.5) | 3.9 (±1.6) | 4.5 (±2.2) | 0.66 (±0.3) | |
| Yield (Mg ha−1) | Null | 0.4 (±0.1) | 0.5 (±0.2) | 11.9 (±4.2) | |
| Full | 0.2 (±0.1) | 0.3 (±0.1) | 6.9 (±3.2) | 0.68 (±0.3) | |
| Protein (g kg−1) | Null | 10 (±3.3) | 12 (±3.8) | 2.9 (±0.9) | |
| Full | 4.8 (±1.6) | 5.45 (±1.8) | 1.4 (±0.4) | 0.70 (±0.3) | |
| Environment | Residual | ||||
| RAU | Null | 31.5 (±3.1) | 26.6 (±1.4) | ||
| Full | 0 (±0) | 9.2 (±0.5) | |||
| Yield | Null | 0.10 (±0.01) | 0.13 (±0.01) | ||
| Full | 0.001 (±0) | 0.01 (±0.01) | |||
| Protein | Null | 118.5 (±4.6) | 56.5 (±1.7) | ||
| Full | 0 (−±0) | 16.1 (±0.7) | |||
Mean absolute error (MAE), root mean squared error (RMSE), relative root mean squared error (RRMSE), and coefficient of determination (R2). (a), Full model variance partition; (b) Random effects variance.
Median value followed by the standard deviation from the cross-validation procedure.
Figure 3Centered and scaled covariables coefficients of the full model for the RAU (A), seed yield (B), and protein concentration (C). Coefficients were grouped in high (red), medium-high (blue), medium-low (green), and low (purple) magnitude clusters by the k-means algorithm. Coefficients with red letters were shrunk to zero by the elastic net regularization process. Points represent the median from the 20 training-test combinations, while horizontal lines range from the minimum to the maximum coefficient values across sets.
Figure 4The overlaying of predicted values of the full and reduced model for the RAU. The dashed black line indicates the 1:1 relation (A). Centered and scaled covariables coefficients of the reduced linear regression model. Colors indicate the coefficients directions: negative are reds and positive are blues. Points represent the median from the 20 training-test combinations, while horizontal lines range from the minimum to the maximum coefficient values across sets (B).
Figure 5Partial dependency plots of environmental effects affecting RAU: Nitrogen (N)-fertilization (A); sowing date (DOY; B); soil nitrate concentration (C); sand concentration (D); soil organic matter (SOM; E); soil sulfate concentration (F); soil cation exchange capacity (CEC; G); precipitation during pre-seed filling (H); radiation during pre-seed filling (I); vapor pressure during pre-seed filling (J); vapor pressure during seed filling (K); drought stress during seed filling (L). Solid lines represent a segmented mean for seven intervals comprised within the variables range. Red lines represent the absence of observations.