| Literature DB >> 34075079 |
Faezeh Akhavizadegan1, Javad Ansarifar2, Lizhi Wang2, Isaiah Huber3, Sotirios V Archontoulis3.
Abstract
The performance of crop models in simulating various aspects of the cropping system is sensitive to parameter calibration. Parameter estimation is challenging, especially for time-dependent parameters such as cultivar parameters with 2-3 years of lifespan. Manual calibration of the parameters is time-consuming, requires expertise, and is prone to error. This research develops a new automated framework to estimate time-dependent parameters for crop models using a parallel Bayesian optimization algorithm. This approach integrates the power of optimization and machine learning with prior agronomic knowledge. To test the proposed time-dependent parameter estimation method, we simulated historical yield increase (from 1985 to 2018) in 25 environments in the US Corn Belt with APSIM. Then we compared yield simulation results and nine parameter estimates from our proposed parallel Bayesian framework, with Bayesian optimization and manual calibration. Results indicated that parameters calibrated using the proposed framework achieved an 11.6% reduction in the prediction error over Bayesian optimization and a 52.1% reduction over manual calibration. We also trained nine machine learning models for yield prediction and found that none of them was able to outperform the proposed method in terms of root mean square error and R2. The most significant contribution of the new automated framework for time-dependent parameter estimation is its capability to find close-to-optimal parameters for the crop model. The proposed approach also produced explainable insight into cultivar traits' trends over 34 years (1985-2018).Entities:
Year: 2021 PMID: 34075079 PMCID: PMC8169860 DOI: 10.1038/s41598-021-90835-x
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The structure of parameter calibration in the simulation of crop growth and development.
Figure 2The proposed framework of time-dependent parameter calibration.
Figure 3Algorithmic diagrams for PBO and BO.
Figure 4(Left) 34-year average thermal time sum (C-days) from June 1 to August 31 for 3 states in the US Corn Belt. The five counties used to test the framework are shown on the map. (Right) The plot shows the scattering of selected locations in terms of soil organic carbon (in %) and plant available water (in cm). The circles with the same color illustrate locations in one specific county. The cultivars maturity in locations #1 to #5 are 114-day hybrids, locations #6 to #10 are 115-day hybrids, location #11 to #15 are 112-day hybrids, for locations #16 to #20 are 110-day hybrids, locations #21 to #22 are 107-day hybrids, and locations #23 to #25 are 108-day hybrids.
List of parameters and acronyms used in this study, with their definitions, values, and units.
| Parameter or abbreviation | Definition | Value or Range | Unit |
|---|---|---|---|
| tt_emerg_to_endjuv | Thermal time from emergence to end of juvenile phase | [150,300] | |
| tt_flower_to_maturity | Thermal time from flowering to maturity | [600,900] | |
| head_grain_no_max | Potential grains per head | [600,850] | kernel/ear |
| grain_gth_rate | Potential grain growth rate | [5,9] | mg/grain/d |
| tt_flower_to_start_grain | Thermal time from flowering to start grain fill | [120,200] | |
| n_conc_crit_grain | Critical N concentration of grain | [0.008,0.016] | g N/g biomass |
| leaf_app_rate1 | Thermal time required to develop a leaf ligule for first leaves | [50,75] | deg day |
| Rue (max_stage) | Radiation use efficiency in each stage | [1.4,1.85] | g dm/mj |
| transp_eff_cf | Transpiration efficiency coefficient | [0.075,0.095] | kpa |
RMSE in kg/ha (and RRMSE) for five selected counties over simulation period (1985 to 2018).
| Method | Logan (IL) | Greene (IN) | Keokuk (IA) | Boone (IA) | Obrien (IA) |
|---|---|---|---|---|---|
| APSIM-Manual | 2386 (23.03%) | 2820 (33.48%) | 2612 (29.36%) | 2357 (23.69%) | 2482 (24.42%) |
| APSIM-BO | 1442 (13.93%) | 1520 (18.05%) | 1590 (17.88%) | 1288 (12.95%) | 992 (9.77%) |
| APSIM-PBO | 1266 (12.23%) | 1459 (17.32%) | 1358 (15.26%) | 1076 (10.82%) | 865 (8.51%) |
Figure 5Yield predictions of five selected counties for the simulation period (1985–2018). Five sub-figure corresponds to five counties, and one indicates the average of all five counties. The black line indicates the observed county-level yield from the National Agricultural Statistics Service of the United States. For each county, the median predicted yields of five locations from PBO, BO, and manual methods are shown by a solid blue line, dashed red line, and dash-dot green line, respectively. The error bar illustrates our prediction interval that shows the standard deviation of prediction over five locations at each year.
Prediction performance of the proposed approach and 8 other approaches for five selected counties for five test years (2014 to 2018) at the end of the growing season.
| Criteria | Method | Test year | ||||
|---|---|---|---|---|---|---|
| 2014 | 2015 | 2016 | 2017 | 2018 | ||
| RMSE | Linear regression I | 2170 | 738 | 1567 | 742 | 1463 |
| Linear regression II | 1734 | 1476 | 1723 | 1110 | 1405 | |
| Lasso | 1143 | 927 | 999 | 834 | 657 | |
| Ridge | 1153 | 2137 | 1565 | 1427 | 1123 | |
| Elastic net | 1214 | 1430 | 1848 | 979 | 997 | |
| Bayesian ridge | 1153 | 1099 | 1382 | 758 | 698 | |
| Gradient boosting | 944 | 840 | 1162 | 713 | 359 | |
| Random forest | 910 | 547 | 1103 | 430 | ||
| Deep neural network | 923 | 705 | 992 | 717 | 493 | |
| APSIM-Manual | 1549 | 2041 | 1623 | 2926 | 1337 | |
| APSIM-BO | 955 | 631 | 904 | 685 | 498 | |
| APSIM-PBO | 630 | |||||
| RRMSE | Linear regression I | 17.80% | 6.43% | 12.45% | 6.18% | 11.48% |
| Linear regression II | 14.22% | 12.86% | 13.68% | 9.24% | 11.03% | |
| Lasso | 9.38% | 8.08% | 7.94% | 6.94% | 5.16% | |
| Ridge | 9.45% | 18.62% | 12.43% | 11.88% | 8.82% | |
| Elastic net | 9.96% | 12.46% | 14.67% | 8.15% | 7.83% | |
| Bayesian ridge | 9.46% | 9.58% | 10.98% | 6.31% | 5.48% | |
| Gradient boosting | 7.74% | 7.32% | 9.22% | 5.94% | 2.82% | |
| Random forest | 7.46% | 4.77% | 8.76% | 3.38% | ||
| Deep neural network | 7.57% | 6.14% | 7.87% | 5.97% | 3.87% | |
| APSIM-Manual | 12.70% | 17.79% | 12.89% | 24.36% | 10.50% | |
| APSIM-BO | 7.84% | 5.50% | 7.18% | 5.71% | 3.91% | |
| APSIM-PBO | 5.25% | |||||
| R2 | Linear regression I | -2.36 | 0.36 | -0.21 | 0.66 | -0.53 |
| Linear regression II | 0.03 | -0.24 | -0.42 | 0.75 | 0.37 | |
| Lasso | 0.09 | 0.11 | 0.63 | 0.60 | 0.69 | |
| Ridge | 0.43 | -0.20 | -0.19 | 0.71 | 0.40 | |
| Elastic net | 0.42 | 0.01 | -0.57 | 0.81 | 0.55 | |
| Bayesian ridge | 0.30 | 0.00 | 0.05 | 0.78 | 0.70 | |
| Gradient boosting | 0.47 | 0.31 | 0.36 | 0.81 | 0.91 | |
| Random forest | 0.46 | 0.66 | 0.42 | 0.87 | ||
| Deep neural network | 0.39 | 0.43 | 0.59 | 0.73 | 0.82 | |
| APSIM-Manual | -0.05 | 0.16 | -0.19 | 0.11 | -0.02 | |
| APSIM-BO | 0.37 | 0.56 | 0.69 | 0.75 | 0.84 | |
| APSIM-PBO | 0.79 | |||||
Bold values are used to highlight the result of the best models.
Figure 6Each subplot indicates the linear trend of each county’s tuned parameter and its interval value during the simulation period (1985–2018). For each county, the average trend of its locations is shown with the dash-dot line. The dash-dot colored lines show the linear trend of tuned parameters at different counties. The solid black lines and dashed black lines illustrate the average ± standard deviation of linear parameters’ trends over all five counties.
The linear trends of estimated parameters with the proposed parameter calibration framework with PBO method.
| Parameter or abbreviation | Measurement | Change per year |
|---|---|---|
| tt_emerg_to_endjuv | Determines silking time | |
| tt_flower_to_maturity | Determines the silking to maturity duration | 2.32 |
| head_grain_no_max | Determines the potential number of kernels per ear | 1.41 kernel/ear |
| grain_gth_rate | Determines the grain growth rate (proxy for harvest index) | 0.03 mg/grain/d |
| tt_flower_to_start_grain | Determines the duration from silking to start grain filling | |
| n_conc_crit_grain | Determines the critical grain N concentration | |
| leaf_app_rate1 | Determines the how fast the first 10 leaves appear | |
| Rue (max_stage) | Reflects the canopy photosynthetic capacity | 0.001 g dm/mj |
| transp_eff_cf | Determines the efficiency of water |