| Literature DB >> 36052879 |
Fabrizio Albanito1, David McBey1, Matthew Harrison2, Pete Smith1, Fiona Ehrhardt3,4, Arti Bhatia5, Gianni Bellocchi6, Lorenzo Brilli7, Marco Carozzi8, Karen Christie9, Jordi Doltra10, Christopher Dorich11, Luca Doro12,13, Peter Grace14, Brian Grant15, Joël Léonard16, Mark Liebig17, Cameron Ludemann18, Raphael Martin6, Elizabeth Meier19, Rachelle Meyer20, Massimiliano De Antoni Migliorati14,21, Vasileios Myrgiotis22, Sylvie Recous23, Renáta Sándor24, Val Snow25, Jean-François Soussana3, Ward N Smith15, Nuala Fitton1.
Abstract
There is a growing realization that the complexity of model ensemble studies depends not only on the models used but also on the experience and approach used by modelers to calibrate and validate results, which remain a source of uncertainty. Here, we applied a multi-criteria decision-making method to investigate the rationale applied by modelers in a model ensemble study where 12 process-based different biogeochemical model types were compared across five successive calibration stages. The modelers shared a common level of agreement about the importance of the variables used to initialize their models for calibration. However, we found inconsistency among modelers when judging the importance of input variables across different calibration stages. The level of subjective weighting attributed by modelers to calibration data decreased sequentially as the extent and number of variables provided increased. In this context, the perceived importance attributed to variables such as the fertilization rate, irrigation regime, soil texture, pH, and initial levels of soil organic carbon and nitrogen stocks was statistically different when classified according to model types. The importance attributed to input variables such as experimental duration, gross primary production, and net ecosystem exchange varied significantly according to the length of the modeler's experience. We argue that the gradual access to input data across the five calibration stages negatively influenced the consistency of the interpretations made by the modelers, with cognitive bias in "trial-and-error" calibration routines. Our study highlights that overlooking human and social attributes is critical in the outcomes of modeling and model intercomparison studies. While complexity of the processes captured in the model algorithms and parameterization is important, we contend that (1) the modeler's assumptions on the extent to which parameters should be altered and (2) modeler perceptions of the importance of model parameters are just as critical in obtaining a quality model calibration as numerical or analytical details.Entities:
Keywords: AgMIP; biogeochemical models; climate change; greenhouse gases; model calibration; model ensembles; model intercomparison; multi-criteria decision-making; soil carbon
Mesh:
Substances:
Year: 2022 PMID: 36052879 PMCID: PMC9494747 DOI: 10.1021/acs.est.2c02023
Source DB: PubMed Journal: Environ Sci Technol ISSN: 0013-936X Impact factor: 11.357
Figure 1Framework of the variable partition between input categories and variables used in the five stages of the model ensemble protocol described in Ehrhardt et al. (2018).[3]
Figure 2Steps of the multi-criteria decision method process combining the decision-making trial and evaluation laboratory (DEMATEL) and the analytic network process (ANP) methods. Through DEMATEL, we visualize the perceived relationship existing between different variable categories. While in ANP, the strength of the relationships outlined in DEMATEL is integrated in a network of dependencies and feedback among input variables to determine their relative importance across the five stages of the modeling protocol.
Background Information Reported by Age Class of the Modelers Participating in the Multi-stage Intercomparison Protocol and MCDM Surveya
| AC | N | F (%) | PhD (%) | FTC (%) | E (%) | MU | MP | MT |
|---|---|---|---|---|---|---|---|---|
| 25–34 | 4 | 0 | 50 | 75 | 50 | from 1 to 4 | from 1 to 4 | Daycent, DNDC, Manure DNDC, Century, SPA/DALEC, EU-Rotate_N, FASSET, FarmAC |
| 35–44 | 7 | 29 | 100 | 71 | 86 | from 1 to 7 | from 1 to 4 | CERES-EGC, PaSim, FarmSim, EcoSys, Armosa, Daycent, DSSAT, EPIC, APEX, ModVege, Gemini, DairyMod, APSIM, GrassGro, AusFarm, GrazFeed, SGS, FarMax |
| 45–54 | 7 | 43 | 57 | 43 | 100 | from 1 to 7 | from 1 to 7 | AusFarm, DNDC, Daycent, Century, Tier II IPCC, RZWQM2, LEACHM, InfoCrop, DSSAT, STICS, Daycent, Century, SGS, DairyMod, RothC, DairyMod, GrassGro |
| 55–64 | 1 | 100 | 0 | 0 | 100 | 3 | 3 | Overseer, FarMax, APSIM |
N = number of modelers, F = proportion of modelers identified as female, PhD = proportion of modelers holding a PhD degree, FTC = the proportion of modelers with a fix-term contract, MU = knowledge on the number of models, MP = number of models published in peer-reviewed articles, and MT = type of models used.
Summary of the Importance of the Input Variables and their Categories, the Consistency Ratio of the Modeler’s Judgments in the Pairwise Comparison Matrix of Each Input Category, and the Level of Kendall Concordance between the Modelers
The ranking scores (i.e., importance) with letters m and e are significantly different at the p < 0.05 level between model types and modeler’s experience groups, respectively. * indicates the level of concordance significant within each variable category at the p < 0.05 level. Footnote: the color gradient indicates where the relative importance of each input variable falls within each variable category.
Cumulative Importance of the Five Stages of the Model Ensemble Protocola
| input category | input variable | stage 1 (0.67 ± 0.02) | stage 2 (0.11 ± 0.01) | stage 3 (0.06 ± 0.03) | stage 4 (0.05 ± 0.01) | stage 5(0.11 ± 0.02) |
|---|---|---|---|---|---|---|
| soil information | soil type | 0.01 ± 0.01 | ||||
| soil texture | 0.05 ± 0.04 | |||||
| bulk density | 0.04 ± 0.15 | |||||
| SOC stock | 0.03 ± 0.13 | |||||
| SON stock | 0.02 ± 0.09 | |||||
| pH | 0.02 ± 0.09 | |||||
| soil mineral N | 0.02 ± 0.07 | |||||
| FC, WFPS, and CEC | 0.04 ± 0.19 | |||||
| other soil information | 0.01 ± 0.06 | |||||
| climate exp. | air temperature | 0.03 ± 0.03 | ||||
| precipitation | 0.04 ± 0.03 | |||||
| solar radiation | 0.03 ± 0.01 | |||||
| air humidity | 0.01 ± 0.01 | |||||
| atm. Pressure | 0.01 ± <0.00 | |||||
| other climate factors | 0.01 ± <0.00 | |||||
| management practices during experiment | crop residues | 0.02 ± 0.01 | ||||
| fertilization rates | 0.06 ± 0.03 | |||||
| fertilization mode | 0.02 ± 0.01 | |||||
| fertilizer type | 0.03 ± 0.02 | |||||
| irrigation | 0.05 ± 0.02 | |||||
| frequency of plowing | 0.02 ± 0.01 | |||||
| frequency other activities | 0.02 ± 0.01 | |||||
| intercropping | 0.02 ± 0.01 | |||||
| freq. harvest, grazing, and cut in grass | 0.03 ± 0.01 | |||||
| general site information | crop type | 0.02 ± 0.01 | ||||
| location | 0.01 ± 0.01 | |||||
| terrain info | <0.00 ± <0.00 | |||||
| experimental length | 0.01 ± 0.02 | |||||
| mean regional yield | 0.01 ± 0.01 | |||||
| long-term climate | air temperature | 0.01 ± <0.00 | ||||
| precipitation | 0.01 ± 0.01 | |||||
| solar radiation | 0.01 ± 0.01 | |||||
| air humidity | <0.00 ± <0.00 | |||||
| atm. Pressure | <0.00 ± <0.00 | |||||
| other climate factors | <0.00 ± <0.00 | |||||
| long-term management practices | fertilization rates | 0.01 ± 0.01 | ||||
| fertilization mode | <0.00 ± <0.00 | |||||
| fertilizer type | <0.00 ± <0.00 | |||||
| irrigation | 0.01 ± 0.01 | |||||
| frequency of harvest | 0.01 ± 0.01 | |||||
| frequency of plowing | 0.01 ± <0.00 | |||||
| frequency other activities | <0.01 ± <0.00 | |||||
| crop residues | 0.01 ± <0.00 | |||||
| intercropping | 0.01 ± <0.00 | |||||
| land use history | 0.01 ± 0.01 | |||||
| experimental data from site | annual extracted yield | 0.03 ± 0.03 | ||||
| vegetation data (phenology, LAI) | 0.03 ± 0.03 | |||||
| soil temperature | 0.01 ± 0.01 | |||||
| soil moisture | 0.03 ± 0.01 | |||||
| soil mineral N | 0.02 ± 0.01 | |||||
| SOC and SON | 0.02 ± 0.01 | |||||
| GPP and NEP | 0.02 ± 0.02 | |||||
| NEE and Reco | 0.02 ± 0.02 | |||||
| soil N losses | 0.02 ± 0.01 | |||||
| N2O and/or CH4 | 0.03 ± 0.03 |
Within each stage, the ranking of the input variables shown in Table was normalized over the importance score of their corresponding categories. Footnote: SOC = soil organic carbon, SON = soil organic nitrogen, FC = field capacity, WFPS = water field pore space, CEC = cation exchange capacity, GPP = gross primary production, NEP = net ecosystem production, NEE = net ecosystem exchange, and Reco = ecosystem respiration.
Figure 3Table reports the total relation matrix of DEMATEL summarizing the level (mean ± sd) of direct and indirect influence given (G) and received (R) in each input category, the net influence (G – R), and the total level of influence (or dominance) (G + R) of the model input category used in the model ensemble study. Categories with positive G – R have a net influence toward the value of other variable categories and are denoted as “influential” categories. The circular diagram outlines the causal relationship in the model ensemble protocol between general site information (SI; red lines), climate during experiment (CL; green lines), long-term climate (LTCL; purple lines), management practices during experiments (MPDE), long-term management practices (LTMP), environmental data from site (EDS), and soil information (SOI). The arrows in the diagram show the direction and the level of influence that each input category gives and receives from other categories. The colored arrows highlight the three variable categories that resulted in being net influencers in the model ensemble protocol (i.e., positive G – R). Radial bar numbers represent the total level of influence R + C and the relative percentage of the casual relationship within each input category.
Figure 4Table summarizes the relative root mean square error (RRMSE) averaged across 19 models for the ensemble simulations of soil N2O emissions from arable and grassland systems, crop yields of annual crop monocultures such as maize, wheat, and rice, and above-ground net primary productivity in grassland (ANPP). Pi corresponds to the cumulative modeling importance of the input variable accessed in the five stages of the model ensemble framework. MER represents the rate of model simulation error for yield, N2O, and ANPP per unit of modeling importance in each stage. The bar chart below the table outlines the trend of MER across the five stages of the model ensemble protocol.