Literature DB >> 35024393

Simulated dataset of corn response to nitrogen over thousands of fields and multiple years in Illinois.

German Mandrini1, Sotirios V Archontoulis2, Cameron M Pittelkow3, Taro Mieno4, Nicolas F Martin1.   

Abstract

Nitrogen (N) fertilizer recommendations for corn (Zea mays L.) in the US Midwest have been a puzzle for several decades, without agreement among stakeholders for which methodology is the best to balance environmental and economic outcomes. Part of the reason is the lack of long-term data of crop responses to N over multiple fields since trial data is often limited in the number of soils and years it can explore. To overcome this limitation, we designed an analytical platform based on crop simulations run over millions of farming scenarios over extensive geographies. The database was calibrated and validated using data from more than four hundred trials in the region. This dataset can have an important role for research and education in N management, machine leaching, and environmental policy analysis. The calibration and validation procedure provides a framework for future gridded crop model studies. We describe dataset characteristics and provide thorough descriptions of the model setup.
© 2021 The Author(s).

Entities:  

Keywords:  APSIM; Crop modeling; Maize; Nitrogen fertilizer; Nitrogen leaching

Year:  2021        PMID: 35024393      PMCID: PMC8728579          DOI: 10.1016/j.dib.2021.107753

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table

Value of the Data

This datasets provide an enormous amount of calibrated response curves of several variables to increasing N rates in one of the most productive areas in the world It can be used in many different types of studies focused on N management, from an agricultural and environmental perspective Some possible ideas are comparing different strategies can be N prediction methods, evaluate policies designed to lower N leaching, evaluation of variable rate N applications Machine learning researchers can use the datasets for benchmarking the performance of different algorithms for predicting N rates; Educators can use the datasets for machine learning problems, statistics, or data mining training. The simulation and calibration methodology is innovative and can be used for other simulations, including different crops or areas than the ones shown here

Data Description

We provide several datasets in this paper used in the research article “Understanding differences between static and dynamic nitrogen fertilizer tools using simulation modeling” [1]. The datasets consist of soil and weather information, the fields’ locations, and the simulations’ output for 4270 fields over 30 years.

Spatial files

cells_sf: polygons of the 10 x 10 km cells on which the state of Illinois was divided. The id_10 is an identifier of each cell. It also includes the region (South, Central, North), the county, and the average area planted to corn (ha/year) in 2008 to 2019. fields_sf: polygons of the 40-ha fields. The id_field (1–4) is the identifier of each field inside a cell. It also contains the id_10, the region, and if a field was used as a trial or evaluation field. soils_sf: polygons of the soils inside each field. The mukey is the identifier of each soil. It also contains the id_10, the id_field. Only the three main soils were selected for the simulations, and the column mukey_rank identifies them with a number from 1 to 3 (being 1 the largest and 3 the smallest).

Weather data series

The file weather_historic_dt is a table that describes the weather based on the grid of cells (10 x 10 km) provided by Daymet [2]. The table contains the id_10, the year, the daily temperature (minimum, medium, and maximum), rainfall, and radiation.

Soil information

The file soils_horizons_dt is a table that describes the soils’ layers of each field. The mukey is the identifier of each soil. It includes the water table depth, slope, sand, clay, organic matter, vertical saturated hydraulic conductivity (ksat), lower and upper volumetric water content limit (ll and dul), and ph.

APSIM output

The file yield_curve_soil_dt contains the output of the simulations at the soil level (mukey). The file yield_curve_field_dt includes the output of the simulations at the field level (id_10 and id_field), aggregated considering the area of each of the soils inside a field. A description of the columns is provided (Table 1).
Table 1

Database characterization, with identification variables and APSIM output variables.

VariableDescriptionUnits
regionregion identification (1-South, 2-Cental, 3-North)
id_10cell identification number
id_fieldfield identification number (1 to 4)
stationtrial field (1) or evaluation field (0)
yearyear of the corn simulation (1989-2018)
N_fertNitrogen added as fertilizer in v5kg/ha
YieldYield of the corn in with 15% Moisturekg/ha
LTotal 2-years N leaching during corn and soybean.From April 1st year (x) to March 31st year (x+2)N kg/ha
clay_40cmClay content (0-20 cm)%
day_sowPlanting dateJulian date
day_v5Date when the corn reached v5Julian date
dul_depDrained upper limit (DUL) soil water capacitymm
esw_pct_v5Extractable soil water (ESW) at v5%
LAI_maxmaximum LAI achieved by the cornm2/m2
lai_v5Leaf Area Index at v5m2/m2
ll15_depCrop lower limit soil water capacitymm
n_0_60cm_v5Soil N (NO3 and NH4) from 0 to 60 cm at v5kg/ha
n_20cm_v5Soil N (NO3 and NH4) from 0 to 20 cm at v5kg/ha
n_40cm_v5Soil N (NO3 and NH4) from 0 to 40 cm at v5kg/ha
n_60cm_v5Soil N (NO3 and NH4) from 0 to 60 cm at v5kg/ha
n_deep_v5Soil N (NO3 and NH4) from top to bottom at v5kg/ha
n_uptakeTotal N uptaken by the corn crop during the seasonkg/ha
oc_20cm_v5Soil Organic Carbon at v5 (0-20 cm)%
oc_40cm_v5Soil Organic Carbon at v5 (0-40 cm)%
rad_1Average solar radiation during first period (1 Jan. to planting)MJ/m2/day
rad_2Average solar radiation during second period (planting to v5)MJ/m2/day
rad_3Average solar radiation during third period (v5- R1)MJ/m2/day
rad_4Average solar radiation during fourth period (R1-R3)MJ/m2/day
rad_5Average solar radiation during fifth period (R3-R6)MJ/m2/day
rad_6Average solar radiation during sixth period (harvest-Dec 31)MJ/m2/day
rain_1Total precipitation during first period (1 Jan. to planting)mm
rain_2Total precipitation during second period (planting to v5)mm
rain_3Total precipitation during third period (v5-R1)mm
rain_4Total precipitation during fourth period (R1-R3)mm
rain_5Total precipitation during fifth period (R3-R6)mm
rain_6Total precipitation during sixth period (darvest-Dec 31)mm
restrictionSoil restrictionmm
sand_40cmSand content (0-20 cm)%
surfaceom_wt_v5Surface residue weight at v5kg/ha
sw_dep_v5Soil water content at v5mm
swdef_expan_fwMean water stress on expansion around flowering (APSIM corn stages 6 to 8)0-1
swdef_pheno_fwMean water stress on phenology around flowering (APSIM corn stages 6 to 8)0-1
swdef_photo_fwMean water stress on photosinthesis around flowering (APSIM corn stages 6 to 8)0-1
tmean_1Average air temperature during first period (1 Jan. to planting)C
tmean_2Average air temperature during second period (planting to v5)C
tmean_3Average air temperature during third period (v5-R1)C
tmean_4Average air temperature during fourth period (R1-R3)C
tmean_5Average air temperature during fifth period (R3-R6)C
tmean_6Average air temperature during sixth period (harvest-Dec 31)C
whcWater holding capacitymm
Y_corn_lt_avgMean yield at EONR (for the other 29 years)kg/ha
Y_soyYield of soy with 13% Moisture (year+1)kg/ha
Yld_lt_avgMean yield at EONR (for the other 29 years)kg/ha
Database characterization, with identification variables and APSIM output variables.

Tutorial script

This R Markdown file shows an example of how the data can be used for education or research purposes. It loads the needed files on the script and trains a static and dynamic model with the research fields and the first 15 years of data. Then, it evaluates both models in the evaluation fields in the following 15 years. It finally shows the economic and environmental value of dynamic recommendations on a map.

Experimental Design, Materials and Methods

The APSIM software

The main goal of the simulations is to obtain information on corn response to increasing N rates for a broad combination of weather and soil conditions. For that, we built upon the simulations presented on [3] with the following adjustments: no-spin up simulations were run, and initial N in the soil was set randomly among a reasonable range, simulations were updated to include a water table when needed, and hybrid parameters were modified to match corn yields per region better. More details about these adjustments are explained later in detail in this work, and the validation results will be presented. The simulations were conducted using the Agricultural Production Systems sIMulator (APSIM) [4] version 7.10 to generate and calibrate a database for thousands of fields in Illinois. A total of more than 6 million simulations were executed using the Illinois Campus Cluster, a computing resource supported by the University of Illinois at Urbana-Champaign. It is operated by the Illinois Campus Cluster Program (ICCP) in conjunction with the National Center for Supercomputing Applications (NCSA). The APSIM simulation framework reproduces various processes related to the crop-soil system and environmental factors, allowing for the interaction of these processes in daily simulations. Related processes are grouped into modules. In this study, we used the maize and soybean modules to simulate crop growth, SoilWat for water balance simulation, SurfaceOM for simulation of residue decomposition, and soilN for simulation of soil carbon and N cycle. In order to reflect the conditions of the Midwest, we modified the soybean and SurfaceOM models according to [5] and the maize module according to [6]. These parameters were guided by calibration and literature and allowed APSIM to better represent the Midwest’s growing conditions, as evidenced in the results.

Input files creation

To guide the simulations and represent the soil and weather variation seen in the region, we divided the state of Illinois into a grid of 10 x 10 km “cells” (Fig. 1a). Four 40-ha square, artificially determined “fields” were then located within each cell (Fig. 1b). These fields were selected from within areas that had been planted to corn for at least five years between 2008 and 2019 according to the USDA Crop Frequency Layer (target area). The fields did not follow actual field boundaries and were allowed to contain parts of multiple actual fields. For cells that did not contain enough target area to create four fields, the maximum possible number of fields were selected, even if this equaled less than four fields. This process yielded 4270 fields, and provided a strong foundation for simulations by increasing simulations in areas with high contribution to crop production and limiting simulations in areas with a low contribution to crop production.
Fig. 1

(a) Map of Illinois, showing the grid of cells, the three regions, and the 4270 fields (blue dots). (b) One cell with four fields randomly placed in the target area. (c) Soils obtained for one of those fields.

(a) Map of Illinois, showing the grid of cells, the three regions, and the 4270 fields (blue dots). (b) One cell with four fields randomly placed in the target area. (c) Soils obtained for one of those fields. Simulations were conducted using the historical weather for the period 1989-2019. The weather information was obtained from DAYMET [2], using the R “daymetr” package [7]. The weather information consisted of daily radiation, temperature (minimum and maximum), and precipitation, and the same weather was used for all fields in a particular cell. The makeup of the soil on each of the fields was determined using the USDA Soil Survey Geographic Database (SSURGO) [8]. For this, we first obtain the soil map unit polygons (Fig. 1 c). These polygons provided the identifier of the soil (known as mukey) and the area. For each soil, we obtained profile information by searching gSSURGO using the R soilDB2 package [9], and transformed it into APSIM parameters following the methodology found in [10]. If SSURGO (through the mukey) indicated the presence of several soils in a particular field, the three largest ones were chosen for the simulation, and additional soils from the field were distributed proportionally among the main three. A maximum, constant soil depth of 200 cm was applied to all fields. The Root exploration factor (xF) in the south region was set to 0.1 for soil layers below 1.5 m, slowing the root’s front advance, while in the other two regions, it was set to 1 for all the layers. The adjusted FBiom and FInert are presented in Table 2.
Table 2

Soil parameters for each region in the state of Illinois.

South
Central
North
Depth (cm)FbiomFinertFbiomFinertFbiomFinert
0-50.030.50.070.450.080.4
5-100.0250.550.060.50.070.45
10-150.020.60.050.550.060.47
15-200.0150.750.0350.60.050.48
20-400.0150.80.0150.650.040.49
40-600.010.850.010.70.020.5
60-800.0050.90.0050.750.010.55
80-1000.0010.950.0050.80.010.75
100-1500.0010.970.0010.920.0050.9
150-2000.0010.990.0010.980.0010.98
Soil parameters for each region in the state of Illinois. The creation of input files also requires setting for the initial conditions from which simulations are started. The initial N concentration of the soil was randomly selected between 1 and 40 kg/ha of N–. This range of N concentrations was decided by performing a 9-year “spin-up” period test on sampled fields to determine the distribution of possible initial N rates. Soil water was set to field capacity. The initial soybean surface residue was 2000 kg/ha with a C:N ratio of 20. The initial root weight was set to 1000 kg/ha, with a C:N ratio of 13. The organic carbon was obtained from SSURGO data, and the calibration procedure obtained the fractions of the different organic pools (FBiom and FInert), explained later. Previous studies have shown that shallow water tables in the US Midwest can have a significant effect on root growth, crop growth, and yield [5]. The simulation of the water table and its impacts on the soil-plant system is complex. Previous field-scale studies used the Richard equation (SWIM model within APSIM) to enable simulation of the water table [5]. However, using this soil water module for big runs across the landscape is challenging because of the lack of physical-based parameters. In this study, we developed a simple approach to account for the impact of the water table using the SoilWat soil water module in APSIM. For this, we included a rule that saturates the soil layer at the water table depth indicated by SSURGO data. The rule was not included in soils that did not have a water table, and in them, a free drainage condition was assumed. If SSURGO informed a water table above 1 meter, it was set at 1 meter because the SSURGO database does not consider installing tile drainage systems in production fields (about 1 m depth) that decrease the depth of the water table to tile depth. This simple addition, increased crop yields in dry years, decreased root depth in wet years, and increased N losses in wet years and overall was a significant addition to more accurate optimal N rate simulation (Fig. 3, Fig. 2, Fig. 4 and 4).
Fig. 3

Validation of the EONR distribution by region, comparing multiple real trials from the MRTN data-set with simulated trials.

Fig. 2

Validation of the average shape by region of the response yield to N, comparing multiple real trials from the MRTN dataset with simulated trials.

Fig. 4

Validation of the yield distribution by region, comparing multiple real trials from the MRTN data-set with simulated trials.

Validation of the average shape by region of the response yield to N, comparing multiple real trials from the MRTN dataset with simulated trials. Validation of the EONR distribution by region, comparing multiple real trials from the MRTN data-set with simulated trials. Validation of the yield distribution by region, comparing multiple real trials from the MRTN data-set with simulated trials. Our approach to simulate a water table allows us to keep the water table at the depth informed by SSURGO. Nevertheless, the SoilWat module does not allow the inclusion of subsurface tile drainage. Tiles had been shown to accelerate N leaching [11], since when the water table reaches them, the water drains from the soil carrying all the dissolved N. To compensate for the absence of tile drainage N losses, we measured N-leaching over the corn that received the fertilizer and the following soybean. This two-year period allowed the excess N to leave the soil.

Management practices configuration

Simulations were conducted for the period 1989–2019, following a corn and soybean rotation, which is the most common cropping system in the Midwest. For that, we numbered the fields on each cell from 1 to 4, and simulations were started so that odd-numbered fields had corn in odd-numbered years and vice versa. This way, approximately half of the fields had corn, and half had soybean each year. Since our primary goal is to generate the dataset with the response of the crop and environmental variables to increasing N rates, every time a field was assigned to corn, simulations with increasing N rates, from 0 to 320 kg/ha with 10 kg/ha increments were performed -i.e., 33 N rates on each soil every two years. At the end of the period, each field provided fifteen N response curves for each soil it contains, one every two years. All simulations were divided into two-year individual simulations, one for each year x field x soil x N treatment combination. Simulations were started on January 1st on the corn year and extended until December 31st during the soybean year. Management practices for the crops included tilling of the soil every year on March 20th. Corn was planted on the mean historic last frost date of each cell (ranging from April 1st to April 30th). The plant population was nine plants/m2. All N fertilizer was applied when the crop reached the stage of five expanded leaves. We used the “B_110” hybrid included in APSIM installation, for which we adjusted the following genetic parameters: radiation use efficiency (), length from emergence to end of juvenil face (), cycle length from flower to maturity (), maximum grain number (), and maximum kernel weight (). For soybeans, the planting date was twenty days after each cell’s mean historic last frost date. This rule also determined the maturity group of the cultivar used. A group III variety (“MG_4”) of seed was planted up to May 5th, after which a group III variety (“MG_3”) was planted up to May 10th and a group II variety (“MG_2”) was planted later than May 10th. The plant population was 30 plants/m2, and no fertilizer was applied.

Validation

We used field data from the Maximum Return to Nitrogen (MRTN) [12] calculator tool (available at http://cnrc.agron.iastate.edu/) to calibrate genetic and soil parameters. MRTN is among the most significant trial networks in the area, summarizing multiple N rate trials under different weather years (461 trials at the time of accessing the tool). The MRTN tool divided the state into three regions (southern, central, and northern Illinois) whose soil and crops each differ in their response to N (Fig. 1a), and they show the results of their trials aggregated by region. We focused on three variables that summarize the response of corn to N. The first one was the shape of the yield response to N, expressed in relative values to the maximum (Fig. 2). The second and third one compared the distribution of the EONR (Fig. 3) and the yield (figure 4) for the base-level condition. The APSIM model reproduced the response of yield to N in the region accurately, including the year-to-year variations in these variables. Additionally, the simulations captured the differences in south-to-north observed in the state. Two main factors impact this pattern of response. First, the temperature change (a decrease from south to north) delays planting dates and reduces the length of the growing season. Second, the more northern areas have a higher percentage of organic matter in the soil, increasing soil N mineralization. These factors and their interaction are responsible for the lower need for N fertilizer in the northern region, demonstrated by both the higher relative yield with zero N applied (Fig. 2) and the shift of the EONR histogram towards lower N rates (Fig. 3). Simultaneously, deeper soils and milder weather growing conditions create conditions for higher yields (Fig. 4). The yield distribution showed that simulations provided lower values than the observed data. We attribute this to a “trial bias” that could have affected the MRTN experiments, where low-yielding areas were avoided to place a trial, or trials that explored extreme weather were dismissed. We decided to keep the simulations since they are still representative of the growing conditions of the region. We also validated our simulated state-wide N leaching flow. In this fourth validation, we used a methodology similar to [13] and compared the simulated N leaching for the whole state with averages of N–NO3 reported on the Mississippi River near Grafton. The simulated N leaching consisted of the base-level situation (using N rates recommended by a tool based on the MRTN methodology [1] for the period between 1990 and 2018. The N leaching for the fields was area-weighted averaged for the whole state, considering each cell average area of soybean and corn planted from USDA Crop Frequency Layer. The streamflow and water nitrate concentration was obtained from the National Water Information System (https://waterdata.usgs.gov/nwis) for the same period (1990–2018). The chosen measurement station is located at Grafton, Illinois (#05587455), and the reason is that this station represents the N loss from the state since it is located at the last point of the Mississippi River in its flow through Illinois. The measurements were cleaned with the following procedure: all values for the same month and year were averaged; if some months did not have values, they were linearly interpolated. Then, the N–NO3 concentration was multiplied by water flow to estimate monthly N–NO3-flow. Finally, the simulated N-leaching and N-flow monthly values were averaged across the different years into twelve monthly values. It is important to note that these variables are expected to show a cause-effect relationship since agricultural N loses flow slowly to the Mississippi River. However, there are other sources of N into the Mississippi river other than Illinois cropland, like urban runoff and livestock operations [14]. Additionally, other states located in the Upper Mississippi River Basin also contribute to the streamflow of N at this location. Consequently, we do not expect the relationship to be perfect, but we expect some association between both variables since Illinois agriculture is one of the major sources of N leaching in the mentioned basin. The time graph shows an association between the simulated N leaching and real N–NO3 flow. Moreover, causation is suggested since there is a lag of approximately one month between N leaching peak and valley and the corresponding peak and valley in N-NO3 flow (Fig. 5). In early spring, the increase in temperature and rain causes an increase in N mineralization, which, since there is no crop growing at that time of the year, is transported outside of the soil-crop system. At the end of spring, crops start to uptake water and N from the soil, and the flow of N leaching decreases. The flow starts to increase again at the end of the summer when crops reduce uptake when getting closer to maturity. At this time of the year, flow is lower than in spring because temperature decreases, reducing N mineralization and freezing water streams. This validation is encouraging, suggesting that our N leaching simulations capture the pattern of N losses in the state.
Fig. 5

Comparison of simulated N-leaching and reported N–NO3 flow at the USGS Gage Station on the Mississippi River near Grafton, Illinois. Monthly flow of both sources was averaged across years.

Comparison of simulated N-leaching and reported N–NO3 flow at the USGS Gage Station on the Mississippi River near Grafton, Illinois. Monthly flow of both sources was averaged across years.

Ethics Statement

This work presented involved extensive crop simulations across the state of Illinois and it did not involve working with animals or humans. This manuscript presents a dataset that is the authors’ original work and co-submitted with the manuscript “Understanding differences between static and dynamic nitrogen fertilizer tools using simulation modeling” published at Agricultural Systems (https://doi.org/10.1016/j.agsy.2021.103275) and is not currently being considered for publication elsewhere. The paper reflects the authors’ own research and analysis in a truthful and complete manner. In addition, the paper properly credits the meaningful contributions of co-authors and co-researchers. All sources used are adequately disclosed. All authors have been personally and actively involved in substantial work leading to the paper and will take public responsibility for its content

CRediT authorship contribution statement

German Mandrini: Conceptualization, Methodology, Software, Validation, Writing – original draft. Sotirios V. Archontoulis: Methodology, Validation, Writing – review & editing. Cameron M. Pittelkow: Methodology, Validation, Writing – review & editing. Taro Mieno: Methodology, Software, Writing – review & editing. Nicolas F. Martin: Supervision, Conceptualization, Funding acquisition.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships which have, or could be perceived to have, influenced the work reported in this article.
SubjectAgronomy and Crop Sciences
Specific subject areaCrop and N leaching response to N fertilizer
Type of dataR objects
How data were acquiredWeather data from Daymet
Soil data from SSURGO
Simulations using APSIM version 7.1
Data formatRaw
Analyzed
Parameters for data collectionSoil level simulations were performed. Inputs for the crop model were obtained from public available data sets (DAYMET, SSURGO)
Description of data collectionData were collected for 4270 fields. The fields had a soy-corn rotation from 1989-2018. During the corn year, the field received N rates from 0 to 320 kg/ha, with 10 kg/ha. Yield, N leaching and multiple other variables were obtained as output from APSIM
Data source locationCity/Town/Region:Illinois
Country:US
Data accessibilityRepository Name: Mendeley Data:
Data identification number: 10.17632/xs5nbm4w55.1
Direct URL to data: https://data.mendeley.com/datasets/xs5nbm4w55/1
Related research articleMandrini, C. M. Pittelkow, S. V. Archontoulis, T. Mieno, N. F. Martin, Understanding differences between static and dynamic nitrogen fertilizer tools using simulation modeling, Agricultural Systems 194 (2021) 103275
  1 in total

1.  4R Water Quality Impacts: An Assessment and Synthesis of Forty Years of Drainage Nitrogen Losses.

Authors:  L E Christianson; R D Harmel
Journal:  J Environ Qual       Date:  2015-11       Impact factor: 2.751

  1 in total
  1 in total

1.  Exploring Trade-Offs Between Profit, Yield, and the Environmental Footprint of Potential Nitrogen Fertilizer Regulations in the US Midwest.

Authors:  German Mandrini; Cameron Mark Pittelkow; Sotirios Archontoulis; David Kanter; Nicolas F Martin
Journal:  Front Plant Sci       Date:  2022-04-15       Impact factor: 6.627

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.