Literature DB >> 32336915

Importance of genetic parameters and uncertainty of MANIHOT, a new mechanistic cassava simulation model.

Leidy Patricia Moreno-Cadena1,2,3,4, Gerrit Hoogenboom3,5, Myles James Fisher1, Julian Ramirez-Villegas1,6, Steven Dean Prager1, Luis Augusto Becerra Lopez-Lavalle1, Pieter Pypers4, Maria Sara Mejia de Tafur2, Daniel Wallach7, Rafael Muñoz-Carpena3, Senthold Asseng3.   

Abstract

We identified the most sensitive genotype-specific parameters (GSPs) and their contribution to the uncertainty of the MANIHOT simulation model. We applied a global sensitivity and uncertainty analysis (GSUA) of the GSPs to the simulation outputs for the cassava development, growth, and yield in contrasting environments. We compared enhanced Sampling for Uniformity, a qualitative screening method new to crop simulation modeling, and Sobol, a quantitative, variance-based method. About 80% of the GSPs contributed to most of the variation in maximum leaf area index (LAI), yield, and aboveground biomass at harvest. Relative importance of the GSPs varied between warm and cool temperatures but did not differ between rainfed and no water limitation conditions. Interactions between GSPs explained 20% of the variance in simulated outputs. Overall, the most important GSPs were individual node weight, radiation use efficiency, and maximum individual leaf area. Base temperature for leaf development was more important for cool compared to warm temperatures. Parameter uncertainty had a substantial impact on model predictions in MANIHOT simulations, with the uncertainty 2-5 times larger for warm compared to cool temperatures. Identification of important GSPs provides an objective way to determine the processes of a simulation model that are critical versus those that have little relevance.
© 2020 The Authors.

Entities:  

Keywords:  DSSAT; Enhanced sampling uniformity (eSU); Sensitivity analysis; Sobol

Year:  2020        PMID: 32336915      PMCID: PMC7161911          DOI: 10.1016/j.eja.2020.126031

Source DB:  PubMed          Journal:  Eur J Agron        ISSN: 1161-0301            Impact factor:   5.124


Introduction

Crop models require many input variables and parameters, which can limit their usefulness to represent crop performance (Grassini et al., 2015; Hunt and Boote, 1998; Makowski et al., 2006; Zhang et al., 2016). For example, genotype specific parameters (GSPs) must apply to crops grown under both limiting and non-limiting conditions. Estimating them, therefore, requires observations from experiments for a wide range of environments. The quantity and quality of the data used to estimate the GSPs is important; inaccurate GSPs cause large uncertainty in simulated crop performance (Hoogenboom et al., 2012; Makowski et al., 2006; Muñoz-Carpena et al., 2007). It is, therefore, critical to understand the sensitivity of a model to each GSP and its impact on model uncertainty (Confalonieri et al., 2010c; Esmaeili et al., 2014; Saltelli et al., 2000b; Xing et al., 2017). A sensitivity analysis of a model evaluates the sensitivity of model outputs based on the variation of the input variables and GSPs (Cariboni et al., 2007; Saltelli et al., 2008a, 2000a; Vazquez-Cruz et al., 2014). A sensitivity analysis can be local or global, depending on whether it evaluates the effect of changing parameters one at a time, or varying all the parameters simultaneously (Confalonieri et al., 2010a; Saltelli et al., 2008a, 2000a). Uncertainty analysis, in contrast, quantifies how variation in input parameters affects model outputs and is used to select the factors that cause most uncertainty in the outputs (Saltelli et al., 2008b, 2004). Sensitivity and uncertainty analysis are often conducted at the same time (Saltelli et al., 2008a). Cassava (Manihot esculenta Crantz) is an important crop for food security and income for small-holder farmers in developing countries. Cassava is resilient to drought and high temperatures and responds positively to an increase in the atmospheric CO2 concentration. Cassava may, therefore, become more relevant under climate change (Jarvis et al., 2012; Rosenthal and Ort, 2012). A new simulation model for cassava, MANIHOT, was incorporated in the Decision Support System for Agrotechnology Transfer package (DSSAT;www.DSSAT.net) (Hoogenboom et al., 2019a, 2019b; Jones et al., 2003). MANIHOT is based on the earlier CROPSIM cassava model (Matthews and Hunt, 1994). The changes to the CROPSIM model required new or modified GSPs. The definition and function of each GSP is specified, but we do not know the range each can take for different cultivars nor how it might interact with other GSPs. Moreover, MANIHOT has only been evaluated for a limited number of different regions. We undertook a sensitivity and uncertainty analysis of MANIHOT to identify which parameters contribute the most to model uncertainty. Sensitivity and uncertainty analyses have been applied extensively in crop modeling. Most studies, however, used uniform distribution to sample the parameter space (e.g. Esmaeili et al., 2014; Makowski et al., 2006; Vazquez-Cruz et al., 2014; White et al., 2005), and rarely sampled from a normal distribution (e.g. Confalonieri et al., 2010b; Dzotsi et al., 2013; Richter et al., 2010). Only one study so far has used distributions of the variables that were analyzed beyond a uniform and normal distribution (Ma et al., 2000). The goal of this study was to determine the nature of the distribution of each GSP and to conduct a global sensitivity and uncertainty analysis (GSUA) based on an enhanced sampling for uniformity (eSU).

Materials and methods

We conducted the GSUA with the MANIHOT crop simulation model of DSSAT (Hoogenboom et al., 2019a) for four contrasting growing environments using a two-step process. We first defined the probability distribution of each parameter based on a literature review and we then used the eSU method to select parameter values (Chitale et al., 2017; Khare et al., 2015). The eSU is a new qualitative method untested for crop models. Next, we used a quantitative variance-based method (Sobol, 1993) to determine the percentage of variance in the outputs that was explained by each parameter (Saltelli et al., 2009).

The MANIHOT model

MANIHOT was released as a new module for cassava in version 4.7 of DSSAT (Hoogenboom et al., 2019b). It has a simplified branching, treats leaves as cohorts, and new algorithms for the rates of leaf formation and stem growth. It also includes a new water stress factor, a spill-over strategy for biomass allocation, and different cardinal temperatures for branching, leaf growth and leaf age. MANIHOT represents the indeterminate growth and development of cassava, which, unlike many crops, does not have critical phenological phases, nor distinct physiological maturity. Crop development is driven by accumulated thermal time with different cardinal temperatures for the process of branching, and for potential size, age, and growth of leaves. The main changes to the GSPs from the CROPSIM model are: The GSPs defining branching time were reduced from 6 to 2; The GSPs that define the fraction of assimilate designated for storage root growth (SRFR) and storage root number per unit of canopy weight (SR#W) were removed; The modified algorithm for leaf growth reduced the GSPs from 6 to 1; and The GSPs for leaf appearance slope (LNSLP), node weight (NODWT) and node length (NODLT) were added. We refer to the GSPs by the mnemonic used in the DSSAT genetic descriptions for cultivar and ecotype (Table 1). B01ND (time to first branching) establishes the difference between early- and late-branching cultivars. After the first branch has formed, branching continues at a constant rate specified by B12ND. The rate of leaf formation is described by a saturation growth rate, where the interval between the appearance of new nodes increases as the crop ages. MANIHOT uses node as the basic growth unit, which includes the leaf and internode section of stem. The nodal growth rate is represented by a logistic function based on node age and cumulative number of leaves when the node appears. Potential leaf size increases from planting and reaches a maximum value when the cumulative thermal age of the crop is 900-degree days (°Cd). LAXS is the potential area of an individual leaf (Table 1) at 900 °C d and declines as the crop ages. LLIFA is the duration in thermal time after the leaf reaches its full size and before it senesces. A cohort of nodes is a group that are all created at the same time. All new branches and nodes of a cohort are symmetric. The number of nodes in a cohort is determined by the total number of apices, which is defined by the number of branches per fork (BR1F–BR4F, Table 1).
Table 1

Input probability distribution function (PDF) for the crop parameters in the MANIHOT cassava model. The PDFs were used to define the sampling for the global sensitivity and uncertainty analysis and were obtained from reported data of each parameter through a literature review.

GSPParameter descriptionPDFaStatisticsK–SbAICcndSources
B01NDThermal time from planting to first branching (°Cd)Triangularemin(a) = 189; max(b) = 1447; mode(c)=7640.105 (0.97)313.422i
B12NDMean thermal time between branching levels after the first branching (°Cd)Triangularmin = 284; max = 899; mode=4560.084 (0.99)279.922i
LAXSMaximum individual leaf area (cm2)LognormalμY = 5.748; σY = 0.314; truncation= (0.001-0.9)0.070 (0.93)725.260j
SLASSpecific leaf area (cm2/g)Normalμ = 242.613; σ = 59.975; truncation= (0.1-0.9)0.217 (0.25)246.622k
LLIFAActive leaf area duration in thermal time (°Cd) after full expansionWeibullshape(a) = 4.183; scale(b) = 1015.34; left truncation (value, c) = 100; right truncation (probability) = 0.90.039 (0.69)4682.5338l
LPEFRLeaf-petiole weight fraction (-)Gammashape(a) = 8.984; scale(s) = 0.0237; left truncation (value, b) = 0.1; right truncation (probability) = 0.90.087 (0.01)−845.9337m
LNSLPLeaf appearance slope (-) as proportion of the leaf appearance curve of reference fUniformmin = 0.7; max = 1.30g
NODWTIndividual node weight (g)Weibullshape(a) = 3.157; scale(b) = 9.253; left truncation (value, c) = 1; right truncation (probability) = 0.90.082 (0.69)359.972n
NODLInternode length (cm)LognormalμY = 0.502; σY = 0.398; truncation= (0.001-0.9)0.090 (0.76)113.955o
PARUERadiation use efficiency (g dry matter MJ−1)LognormalμY = 0.337; σY = 0.310; truncation= (0.1-0.9)0.201 (0.20)35.627p
TBLSZBase temperature for leaf development (˚C)Uniformmin = 11; max = 173hq
BR1F-BR4FBranch number per fork at fork 1–4 (#)Uniformmin = 1; max=42r
KCANPhotosynthetically active radiation (PAR) extinction parameter (-)Uniformmin = 0.58; max = 1.010.219 (0.61)12s

See Appendix B. Probability Density Functions (PDF).

Numbers in parenthesis are the p-values of the Kolmogorov-Smirnov. Null hypothesis: True distribution function of the data is equal to the hypothesized distribution function.

Akaike information criterion.

n: Number of observations.

Weibull distribution had the best fit followed by the triangular distribution; however triangular distribution was selected because it is already truncated.

The leaf appearance curve of reference was estimated in thermal time from 4 varieties in 3 locations using data from Irikura et al. (1979).

The threshold of the parameter LNSLP was defined based on initial attempts of calibration of this parameter for different varieties and locations.

The uniform distribution was set for the parameters with few observations.

(Bolaños, 1987; Irikura et al., 1979; Lian and Cock, 1979; Veltkamp, 1986).

(Alves, 2002; Alves and Setter, 2004, 2000; Ceballos and Cruz, 2012; CIAT, 1978, 1975; Connor et al., 1981; Gabriel et al., 2014; Irikura et al., 1979; Keating et al., 1982; Lian and Cock, 1979; Matthews and Hunt, 1994; Okogbenin et al., 2013; Pellet and El-Sharkawy, 1993; Pinheiro et al., 2014; Santhosh Mithra et al., 2013; Streck et al., 2014; Veltkamp, 1986).

(Fukai and Hammer, 1987; Gabriel et al., 2014; Gijzen et al., 1990; Keating et al., 1982; Matthews and Hunt, 1994; Pellet and El-Sharkawy, 1993; Rosenthal and Ort, 2012; Vandegeer et al., 2013).

(Alves, 2002; Bolaños, 1987; CIAT, 1978; Connor et al., 1981; Fukai and Hammer, 1987; Howeler, 2011; Irikura et al., 1979; Lian and Cock, 1979; Veltkamp, 1986).

(Cadavid, 1988; CIAT, 2013a; Cock, 2011; De Tafur et al., 1994; Fukai and Hammer, 1987).

(CIAT. Unpublished results., n.d.; CIAT, 2013b; Lian and Cock, 1979; Porto, 1983).

(CIAT. Unpublished results., n.d.).

(De Souza et al., 2016; El-Sharkawy and Mejia de Tafur, 2010; Ezui, 2017; Leepipatpaiboon et al., 2009; Pellet and El-Sharkawy, 1997; Veltkamp, 1986; Yamamoto et al., 2004).

(Fukai and Hammer, 1987; Keating et al., 1982; Manrique, 1992).

(Ceballos and Cruz, 2012; Lian and Cock, 1979).

(Cock, 2011; Ezui, 2017; Fukai and Hammer, 1987; June, 1993; Pellet and El-Sharkawy, 1993; Veltkamp, 1986).

Input probability distribution function (PDF) for the crop parameters in the MANIHOT cassava model. The PDFs were used to define the sampling for the global sensitivity and uncertainty analysis and were obtained from reported data of each parameter through a literature review. See Appendix B. Probability Density Functions (PDF). Numbers in parenthesis are the p-values of the Kolmogorov-Smirnov. Null hypothesis: True distribution function of the data is equal to the hypothesized distribution function. Akaike information criterion. n: Number of observations. Weibull distribution had the best fit followed by the triangular distribution; however triangular distribution was selected because it is already truncated. The leaf appearance curve of reference was estimated in thermal time from 4 varieties in 3 locations using data from Irikura et al. (1979). The threshold of the parameter LNSLP was defined based on initial attempts of calibration of this parameter for different varieties and locations. The uniform distribution was set for the parameters with few observations. (Bolaños, 1987; Irikura et al., 1979; Lian and Cock, 1979; Veltkamp, 1986). (Alves, 2002; Alves and Setter, 2004, 2000; Ceballos and Cruz, 2012; CIAT, 1978, 1975; Connor et al., 1981; Gabriel et al., 2014; Irikura et al., 1979; Keating et al., 1982; Lian and Cock, 1979; Matthews and Hunt, 1994; Okogbenin et al., 2013; Pellet and El-Sharkawy, 1993; Pinheiro et al., 2014; Santhosh Mithra et al., 2013; Streck et al., 2014; Veltkamp, 1986). (Fukai and Hammer, 1987; Gabriel et al., 2014; Gijzen et al., 1990; Keating et al., 1982; Matthews and Hunt, 1994; Pellet and El-Sharkawy, 1993; Rosenthal and Ort, 2012; Vandegeer et al., 2013). (Alves, 2002; Bolaños, 1987; CIAT, 1978; Connor et al., 1981; Fukai and Hammer, 1987; Howeler, 2011; Irikura et al., 1979; Lian and Cock, 1979; Veltkamp, 1986). (Cadavid, 1988; CIAT, 2013a; Cock, 2011; De Tafur et al., 1994; Fukai and Hammer, 1987). (CIAT. Unpublished results., n.d.; CIAT, 2013b; Lian and Cock, 1979; Porto, 1983). (CIAT. Unpublished results., n.d.). (De Souza et al., 2016; El-Sharkawy and Mejia de Tafur, 2010; Ezui, 2017; Leepipatpaiboon et al., 2009; Pellet and El-Sharkawy, 1997; Veltkamp, 1986; Yamamoto et al., 2004). (Fukai and Hammer, 1987; Keating et al., 1982; Manrique, 1992). (Ceballos and Cruz, 2012; Lian and Cock, 1979). (Cock, 2011; Ezui, 2017; Fukai and Hammer, 1987; June, 1993; Pellet and El-Sharkawy, 1993; Veltkamp, 1986). Potential crop growth is estimated by summing potential stem and leaf growth plus 10% to account for the growth of fibrous roots. The stem growth rate is calculated by adding the growth rate for all the cohorts, while the leaf growth rate is calculated by summing the potential leaf size for each cohort multiplied by the specific leaf area (SLAS). MANIHOT uses a spill-over strategy where the daily assimilation is first allocated to satisfy the requirements for growth of aboveground biomass and fibrous roots with only the remainder allocated to growth of storage roots. Storage roots therefore have no fixed initiation. Daily assimilate from photosynthesis in MANIHOT is calculated by multiplying the solar radiation intercepted each day by radiation use efficiency (PARUE). The potential crop growth rate is estimated each day as the total demand for assimilate to satisfy the growth of leaves, stems and fibrous roots. When the daily assimilation is less than the amount required to satisfy the potential growth, the actual growth of nodes and fibrous roots is reduced proportionally. Only when daily assimilation is greater than the demand to satisfy the potential growth is the excess allocated to the storage roots. MANIHOT (Moreno-Cadena, 2018), includes a new drought stress factor based on soil water content rather than the ratio between potential and actual transpiration as used in several other DSSAT crop models (Ritchie, 1998). The drought stress factor affects the germination, leaf appearance, branching, leaf size and biomass increase. The sensitivity to drought can be different for germination but it is the same for the other processes. Appendix A includes a brief evaluation of the results. The source code is available from GitHub under the 3-Clause BSD License (https://github.com/DSSAT/dssat-csm-os).

Input parameters and their distributions

MANIHOT currently has 18 GSPs. The effects of photoperiod (photoperiod sensitivity, PPS1) and nitrogen (nitrogen concentration in the storage roots, SRN%S) so far have not implemented in the code and were, therefore, omitted from this study. The distribution of the remaining 16 GSPs was defined based on a literature review of 35 publications and an unpublished database of the International Center for Tropical Agriculture (CIAT) (Table 1 and Fig. 1). Where necessary, we converted reported chronological time to thermal time using the mean temperature of the study site.
Fig. 1

Cumulative distribution functions (red line) of ten GSPs based on literature data (dots) (Table 1). GSPs with less than ten observations were assumed uniform and are not presented. (a) Thermal time from planting to first branching (B01ND); (b) thermal time between branches after first branching (B12ND); (c) maximum individual leaf area (LAXS); (d) specific leaf area (SLAS); (e) active leaf area duration in thermal time (LLIFA), (f) leaf-petiole weight fraction (LPEFR), (g) individual node weight (NODWT), (h) internode length (NODL), (i) radiation use efficiency (PARUE), (j) photosynthetically active radiation (PAR) extinction (KCAN). (For interpretation of the references to colour in the Figure, the reader is referred to the web version of this article).

Cumulative distribution functions (red line) of ten GSPs based on literature data (dots) (Table 1). GSPs with less than ten observations were assumed uniform and are not presented. (a) Thermal time from planting to first branching (B01ND); (b) thermal time between branches after first branching (B12ND); (c) maximum individual leaf area (LAXS); (d) specific leaf area (SLAS); (e) active leaf area duration in thermal time (LLIFA), (f) leaf-petiole weight fraction (LPEFR), (g) individual node weight (NODWT), (h) internode length (NODL), (i) radiation use efficiency (PARUE), (j) photosynthetically active radiation (PAR) extinction (KCAN). (For interpretation of the references to colour in the Figure, the reader is referred to the web version of this article). The distribution of each GSP represents the structure of the variability among cultivars including different timing and patterns of branching, leaf sizes and leaf retention. We fitted different distributions to the data of each GSP using the R fitdistrplus package. We selected the distribution that provided the best fit according to the Kolmogorov-Smirnov (K–S) statistic and the Akaike information criterion (AIC) (Delignette-Muller and Dutang, 2015; R Core Team, 2018). We used a uniform distribution for those GSPs that had less than 10 observations to avoid unrealistic assumptions about the distribution (Muñoz-Carpena et al., 2007; Vanuytrecht et al., 2014). MANIHOT uses a reference curve for the rate of leaf appearance based on experimental data for four varieties from three locations (Irikura et al., 1979). The GSP LNSLP is an index that adjusts the slope of the reference curve to provide the observed values of leaf number for the relevant cultivar.

Study sites and model input data

The sensitivity of a simulation model to the variation of a GSP depends on the growth environment (Confalonieri et al., 2010b; Silvestro et al., 2017; Vanuytrecht et al., 2014). We, therefore, conducted the global sensitivity analysis of MANIHOT under combinations of warm and cool temperatures under either rainfed or no water limitations (subsequently termed unlimited). We selected two sites in Colombia, Popayan (2.4278 °N, 76.6208 °W, 1750 masl), and Cereté (8.8397 °N, 75.8019 °W, 20 masl) with contrasting temperatures because of their different altitudes. Both are important national cassava areas and represent 27 % and 62 % of the global and equatorial cassava production, respectively (IFPRI and IIASA, 2005; Kottek et al., 2006). The mean temperature for the growing season from late April to February was 18 °C in Popayan (cool), and 28 °C in Cereté (warm). Rainfall and mean solar radiation during the growing season were 890 mm and 16 MJ/m2/day for Popayan and 1130 mm and 17 MJ/m2/day for Cereté (Table 2).
Table 2

Description of the study sites used for the sensitivity and uncertainty analysis. The temperature, solar radiation and rainfall are mean values during the growing season for the study period. The temperature classifications reflect the 10 °C difference between the two sites due to altitude.

SiteTemperature classificationLatitude (°)Longitude (°)Altitude (masl)Mean temperature (°C)Mean solar radiation (MJ/m2/day)Mean rainfall (mm)
PopayanCool2.4278−76.620817501816890
CeretéWarm8.8397−75.80192028171130
Description of the study sites used for the sensitivity and uncertainty analysis. The temperature, solar radiation and rainfall are mean values during the growing season for the study period. The temperature classifications reflect the 10 °C difference between the two sites due to altitude. We obtained 30 years’ data of maximum and minimum temperature, precipitation and solar radiation for each site (Popayan, 1984–2013, Cereté, 1980–2009). The weather data for Cereté were obtained from IDEAM, the Colombian National Institute of Meteorology; the weather data for Popayan were obtained from NASA/POWER (Stackhouse et al., 2018; Van Wart et al., 2015). We quality checked the data (Esquivel et al., 2018) with the RClimTool software (Llanos-Herrera, 2014). The soil data for Cereté were from the IRI database (IRI et al., 2015) and for Popayan the site’s profile data in the DSSAT database (see Table C1, Table C2 in Appendix C).
Table C1

Physical and chemical soil properties of the experiment in Popayan.

IDColorAlbedoEvaporation limit (mm)Drainage coefficientRunoff curvePhotosynthesis factor1
CCBN880060Black0.098.70.6761
Soil depth (cm)Lower limit (cm3 cm−3)Drained upper limit (cm3 cm−3)Saturation (cm3 cm−3)Soil root growth factorSaturated hydraulic conductivity (cm h−1)Bulk density (g cm−3)Organic carbon (%)Clay (%)Silt (%)pH (water)
50.0750.2150.4060.5−990.5816.8961.65.7
130.0750.2150.4060.5−990.5816.8961.65.7
300.0390.1750.3710.2−990.5515.80.9534.9
390.050.1640.3020.1−990.526.87018.55.1
660.0430.1390.3020.1−990.483.80145.3
820.0410.130.3020.1−990.512.19012.45.3
1020.0440.140.3020.1−990.621.88014.25.3
1240.040.1280.3020.1−990.641.510125.2
1370.0450.1450.3020−990.511.66015.25.2
1760.0370.1170.3020−990.481.310105.2
2000.0330.1040.3020−990.41.6507.75.6

The photosynthesis factor was set to 1 although the original value in the database of DSSAT is 0.9.

Table C2

Physical and chemical soil properties of the experiment in Cereté.

IDColorAlbedoEvaporation limit (mm)Drainage coefficientRunoff curvePhotosynthesis factor
CO04204610Black0.160.5751
Soil depth (cm)Lower limit (cm3 cm−3)Drained upper limit (cm3 cm−3)Saturation (cm3  cm−3)Soil root growth factorSaturated hydraulic conductivity (cm h-1)Bulk density (g cm−3)Organic carbon (%)Clay (%)Silt (%)pH (water)
50.1540.2650.39910.621.422.0125.6225.315.78
150.1660.2780.4030.850.491.441.727.6824.55.85
300.1820.2950.4080.70.371.471.330.2623.335.94
600.1970.3120.4140.50.271.520.8332.7822.116.06
1000.1960.310.4130.380.281.580.4832.7221.546.2
2000.1860.2980.4080.050.341.630.2831.0621.336.37

Global sensitivity analysis by enhanced sampling Uniformity (eSU)

We used eSU to rank GSPs according their relative influence on MANIHOT’s outputs of the simulated processes. We selected eSU because it creates multiple combinations of GSPs with improved sample uniformity, sample spread and screening efficiency compared with other methods (Chitale et al., 2017; Khare et al., 2015). We used the eSU procedure in the MATLAB package (The MathWorks Inc, 2015) to create a set of 408 combinations within 8 levels of each of the 16 GSPs (k) specifying 24 trajectories (r). Trajectories are defined as the succession of points within the sampling space that create the r*(k+1) combinations (Chitale et al., 2017). We specified 8 levels of each GSP, which is more than 4 that is commonly used for these analyses (Chitale et al., 2017; Khare et al., 2015; Saltelli et al., 2008b). The number of trajectories specified must be a multiple of the number of levels and typically ranges 10–30 (Chitale et al., 2017). We ran MANIHOT using each of the 408 GSPs combinations for each location for 30 years. We selected six output variables: aboveground biomass, yield, maximum leaf area index (LAI), number of leaves at harvest, and time to appearance of the first and the second branch and calculated their 30-year means. We used sensitivity analysis to identify the first- and high-order effects of the individual GSPs. First-order or elementary effects (μ) are a direct influence of a GSP in an output. High-order effects of GSPs on an output are either interactions between one GSPs and all the others or the non-linear effects of a particular GSP (Saltelli et al., 2004; Wallach et al., 2014). We used a modified version of the elementary effects (μ*) (Campolongo et al., 2007) and standard deviation (σ) for the high interactions or non-linear effects of the GSPs on the output variables. We normalized values of the elementary effects (μ*) for each output variable by dividing them by the maximum value of the elementary effects of all GSPs across the four treatments. We similarly normalized the values of the standard deviations (σ) for each output by dividing by the maximum value of the σ for all GSPs.

Global uncertainty and sensitivity analysis using the Sobol methodology

We selected as important the GSPs with normalized μ* or σ greater than 0.5 for at least one output variable for any of the two locations following Loubière et al. (2016) and Chitale et al. (2017). We used the Sobol method to evaluate the uncertainty of the model outputs due to the uncertainty of the selected GSPs. Again, we used the mean value of the output variables over the 30 years of simulations to estimate the cumulative probability function for the selected output variables. We also estimated the 95% confidence intervals due to the weather variation over the 30 years. Sobol decomposes output variability into the contributions of the individual GSPs and their corresponding interactions, termed first and total order sensitivity indices, respectively. The first sensitivity index of a given GSP is the proportion of the total variance of the output variable explained by its main effect (Giglioli and Saltelli, 2003). The total sensitivity index of a GSP includes the first order sensitivity index plus the interactions of the GSP with all others. We used the same probability distribution functions as in the eSU analysis to create the Monte Carlo samples for the Sobol method (Table 1 and Fig. 1). The Sobol analysis required 15,360 GSP combinations, given by 2n(k+1), where k is the number of GSPs and n (>500) is the number of samples to estimate the individual effects. We specified 14 GSPs (k) and a sample size of 512 (n) in SimLab v2.2.1 (Khare et al., 2015). We used the data for the same six output variables as in the eSU analysis. To evaluate the similarities between the two methods for each GSP, we compared the elementary effects from eSU with the total order indices from Sobol.

Crop model simulations

The simulations for both sites started 30 days prior to planting with the initial soil water content set to field capacity. We did not simulate additional limitations due to nutrients, weeds or pests.

Results

Distribution of reported data of GSPs

The number of observations in the literature of the GSPs differed widely (Table 1). We found more than 330 observations for the active leaf area duration (LLIFA) and leaf-petiole weight fraction (LPEFR). In contrast, we found few data for the base temperature for leaf development (TBLSZ) and for the number of branches per fork (BR1F-BR4F). Either normal, lognormal, gamma, Weibull, uniform or triangular distributions fitted the data of those GSPs for which we found enough data for the fitting routine to converge (Fig. 1). The fits met the K–S test at a confidence level P < 0.05 for all the GSPs except for LPEFR, which indicates that the functions describe their true distribution. Although the gamma function gave an acceptable fit to the observed data of LPEFR (Fig. 1(f)) the K–S test rejected the null hypothesis.

Global sensitivity analysis: Screening method using the enhanced Sampling Uniformity (eSU)

For all six output variables there was no difference in sensitivity to the GSPs between rainfed and unlimited conditions (Fig. 2), in contrast to the warm and cold environments. The base temperature for leaf development (TBLSZ) is more important under low temperatures. Values of both μ* and σ were higher for the warm environment because thermal time accumulates faster, which increases the values of the simulated outputs. The most important GSPs were individual node weight (NODWT), maximum individual leaf size (LAXS), thermal time from planting to first branching (B01ND) and PAR extinction coefficient (KCAN). Most GSPs affected the maximum LAI (Fig. 2 a, d) with 10 of the 16 values of μ* values above 0.5 in the warm environment with water unlimited. In contrast, only seven of them had μ* values above 0.15 in the cool environment. As expected, branching behavior simulated by MANIHOT is sensitive to the GSPs B01ND and B12ND (data not shown).
Fig. 2

Normalized elementary effects (μ*) in the output variables of MANIHOT due to the variation of each of the GSPs using the enhanced Sampling for Uniformity method of maximum leaf area index (LAI) (a, d), aboveground biomass (b, e), and yield (c, f) at warm (top) and cool (bottom) environments under unlimited (green) and rainfed (red) conditions. Note that the maximum values of μ* are shown at the left top corner of each column. The minimum value was 0 overall. The definitions for the GSPs are listed in Table 1.

Normalized elementary effects (μ*) in the output variables of MANIHOT due to the variation of each of the GSPs using the enhanced Sampling for Uniformity method of maximum leaf area index (LAI) (a, d), aboveground biomass (b, e), and yield (c, f) at warm (top) and cool (bottom) environments under unlimited (green) and rainfed (red) conditions. Note that the maximum values of μ* are shown at the left top corner of each column. The minimum value was 0 overall. The definitions for the GSPs are listed in Table 1. Like μ*, the values of σ under rainfed and unlimited conditions were similar, but differed between warm and cool environments (Fig. 3). Most of the GSPs that had high direct effects (μ*) (Fig. 2) also had high interactions (σ). For warm temperatures, active leaf area duration (LLIFA) and number of branches at the third branching level (BR3F) showed a larger interaction than direct effects. Radiation use efficiency (PARUE) also had larger values of σ than of μ*.
Fig. 3

Normalized standard deviation of elementary effects (σ) in the output variables of MANIHOT due to the variation of each of the GSPs using the enhanced Sampling for Uniformity method of maximum leaf area index (LAI) (a, d), aboveground biomass (b, e), and yield (c, f) at warm (top) and cool (bottom) environments under unlimited (green) and rainfed (red) conditions. Note that the maximum values of σ are shown at the left top corner of each column. The minimum value was 0 overall. The definitions for the GSPs are listed in Table 1.

Normalized standard deviation of elementary effects (σ) in the output variables of MANIHOT due to the variation of each of the GSPs using the enhanced Sampling for Uniformity method of maximum leaf area index (LAI) (a, d), aboveground biomass (b, e), and yield (c, f) at warm (top) and cool (bottom) environments under unlimited (green) and rainfed (red) conditions. Note that the maximum values of σ are shown at the left top corner of each column. The minimum value was 0 overall. The definitions for the GSPs are listed in Table 1. As expected, both PARUE and LAXS are key GSPs that increase LAI, aboveground biomass and yield. High values for the base temperature for leaf development (TBLSZ) decreased maximum LAI, aboveground biomass and yield. The effect of node weight (NODWT) on aboveground biomass depended on its magnitude (termed non-monotonic). Increasing values up to a threshold increased aboveground biomass while values higher than the threshold also affected partitioning, because it also reduced the assimilate available for leaf growth.

Uncertainty analysis: Sobol method

As expected, the Sobol method provided a wide range of values for each GSP, which caused the simulation outputs of MANIHOT to vary widely. All six outputs (Fig. 4) for both warmer environments covered wider ranges than those for the cool environment except for the time to appearance of the first branch (Fig. 4a). Under cool growing conditions, uncertainty in the MANIHOT outputs due to uncertainty in the input GSPs was similar for both rainfed and water-unlimited conditions (Fig. 4). For the warm environment, maximum LAI and number of leaves at harvest with unlimited water were higher than rainfed, which increased aboveground biomass but reduced yield slightly.
Fig. 4

Cumulative probability for the output variables of MANIHOT obtained from the Sobol method at cool and warm environments rainfed and under unlimited conditions: (a) Time for appearance of first branch; (b) Time for appearance of second branch, (c) aboveground biomass at harvest (kg/ha); (d) Yield (kg/ha); (e) Maximum LAI; and (f) Number of leaves at harvest. The solid lines represent the cumulative probability of the 30-year means of the simulated values. The shaded areas are the cumulative probabilities of the 95 % confidence intervals.

Cumulative probability for the output variables of MANIHOT obtained from the Sobol method at cool and warm environments rainfed and under unlimited conditions: (a) Time for appearance of first branch; (b) Time for appearance of second branch, (c) aboveground biomass at harvest (kg/ha); (d) Yield (kg/ha); (e) Maximum LAI; and (f) Number of leaves at harvest. The solid lines represent the cumulative probability of the 30-year means of the simulated values. The shaded areas are the cumulative probabilities of the 95 % confidence intervals. The time to branching had a coefficient of variation (CV) of 30% and 20% in the time to first and second branching, respectively (Table 3). The aboveground biomass had a CV between 74% and 88% due to parameter uncertainty, while the maximum LAI and yield had a higher CV with values above 120%.
Table 3

Standard deviation and coefficient of variation of six output variables of MANIHOT obtained from the global uncertainty analysis of the input GSPs using the Sobol method in the four test environments (see text). Estimates of the standard deviation used the mean values of the 30 years of simulations for each environment.

VariableStandard deviation
Coefficient of variation (CV, %)
warm unlimitedwarm rainfedcool unlimitedcool rainfedwarm unlimitedwarm rainfedcool unlimitedcool rainfed
Time for appearance 1 st branch (days)303151293026
Time for appearance 2nd branch (days)33352020
Aboveground biomass at harvest (kg/ha)2498198159855686748876
Yield (kg/ha)20372152481452163149227177
Maximum LAI (-)1.1630.6880.3320.273125117159129
Number of leaves at harvest (#)5145252125252525
Standard deviation and coefficient of variation of six output variables of MANIHOT obtained from the global uncertainty analysis of the input GSPs using the Sobol method in the four test environments (see text). Estimates of the standard deviation used the mean values of the 30 years of simulations for each environment.

Global sensitivity analysis: Sobol method

Individual node weight (NODWT), radiation use efficiency (PARUE) and maximum individual leaf size (LAXS) were the most important GSPs (Fig. 5) based on their direct effect or first order indices. Temperature modified the importance of some GSPs. Base temperature for leaf development (TBLSZ) accounted for the variation for the cool compared to the warmer environments. Leaf appearance slope (LNSLP) had higher direct effect under warm temperatures than under low temperatures.
Fig. 5

Proportion of variance of aboveground biomass, maximum LAI and yield explained by each GSP in the Sobol analysis. The first order sensitivity index (main effect) (top) and total order sensitivity index (main effect plus interactions) (bottom) for cool and warm temperatures under rainfed and water-unlimited conditions.

Proportion of variance of aboveground biomass, maximum LAI and yield explained by each GSP in the Sobol analysis. The first order sensitivity index (main effect) (top) and total order sensitivity index (main effect plus interactions) (bottom) for cool and warm temperatures under rainfed and water-unlimited conditions. The direct effect of individual GSPs explained 55–80% of the variance of the simulated output variables, depending on the temperature and water regime. Individual node weight (NODWT) contributed most to total uncertainty for yield for the cool and unlimited environment, explaining 30% of the variance. The base temperature for leaf development (TBLSZ) was the next most important GSP, explaining 27% of the variance for aboveground biomass for the same environment (Fig. 5). Interactions between GSPs account for 20–45% of the variance in the output variables. Both eSU and Sobol showed that GSPs with high direct effects also had high interactions (eSU) or high total order indices (Sobol). Sobol also agrees with eSU in attributing the variance of simulated maximum LAI to the influence of many GSPs and their interactions. Sobol showed that the GSP specific leaf area (SLAS) had low first and total order sensitivity for all simulated output variables for all four environments. There was a strong positive linear relationship between the total order sensitivity index (main effect plus interactions between parameters, STi in Sobol) and the elementary effects of each parameter (eSU) (Fig. 6). The relation was similar for all the output variables with the highest R2 for yield. The slopes of the relationship differ according to the temperature of each environment.
Fig. 6

Elementary effects from eSU (μ*) versus the total order indices of Sobol (STi) for cool and warm temperatures under rainfed and water-unlimited conditions. Note that the units of μ* are those of the simulated output variables.

Elementary effects from eSU (μ*) versus the total order indices of Sobol (STi) for cool and warm temperatures under rainfed and water-unlimited conditions. Note that the units of μ* are those of the simulated output variables.

Discussion

This study presents a new comprehensive approach to sensitivity and uncertainty analysis of a crop model for contrasting environments. We defined the probability distributions for the range of uncertainty of each GSP based on experimental measurements from a thorough literature search. Previous studies on the sensitivity and uncertainty analysis for crop modeling commonly used uniform distribution of input parameters based on limited references (e.g. Esmaeili et al., 2014; He et al., 2016; Makowski et al., 2006; Silvestro et al., 2017; Vazquez-Cruz et al., 2014; Xing et al., 2017). Knowing the actual distribution of the input parameters (GSPs) allowed us to specify a more realistic representation of the results of the sensitivity and uncertainty analysis. In contrast, an inadequate representation of the range of the parameters inevitably degrades the quality of the sensitivity analysis (Wang et al., 2013; Zhao et al., 2014). Plasticity of a simulation model is its tendency to change its sensitivity to the GSPs under different environments (e.g. Silvestro et al., 2017). The MANIHOT simulation model showed low plasticity (little variation) in the relevance of the GSPs between rainfed and water unlimited conditions. Both the WOFOST model for wheat (He et al., 2016) and CSM-CROPGRO-Cotton model (Pathak et al., 2007) showed similar plasticity in the response to the water regime. In contrast, the Aquacrop model for wheat (Xing et al., 2017) and the SALUS model for maize (Dzotsi et al., 2013) are more sensitive to some GSPs than to others when the crop is rainfed. Temperature had a significant effect on the plasticity of MANIHOT. The GSP base temperature for leaf development (TBLSZ) is much more important at low temperatures. The maximum value for TBLSZ is close to the mean temperature of the cool environment (18 °C). Thermal time for leaf growth therefore accumulates only slowly, resulting in a low LAI and hence less interception of solar radiation and thus a smaller production of assimilates. The result is less aboveground biomass and low yield. The SALUS model for maize, peanut and cotton showed a similar sensitivity to the GSPs for different locations (Dzotsi et al., 2013), although the ranking changed. The base temperature used to estimate photothermal time was therefore more important for cooler climates, which is similar to MANIHOT. Sensitivity analyses conducted with other crop models also found that the base temperature is commonly identified as an important GSP. Examples include the STAMINA model for wheat (Richter et al., 2010), the WOFOST model for maize (Wang et al., 2013), the GROSUB model for rye (Feyereisen et al., 2006), and the CropSyst and WARM models for rice (Confalonieri et al., 2006). Other studies have shown plasticity of their GSPs under different environmental conditions for the Aquacrop model for maize (Vanuytrecht et al., 2014) and the APSIM model for wheat (Zhao et al., 2014). The different conditions included varying temperature and rainfall distribution. Plasticity allows a model to reflect performance of a cultivar in different environments, which may require more GSPs. The disadvantage is the cost to estimate the extra GSPs, especially if there are many cultivars with different phenotypes (Silvestro et al., 2017). As discussed above, cassava grows slower under low temperatures. This is reflected in lower values of μ* and σ for all the GSPs, and is a common characteristic of crop models (e.g. Silvestro et al. (2017) for wheat for both the Aquacrop and SAFYE models). Overall, the most important GSPs were the individual node weight (NODWT), radiation use efficiency (PARUE), and maximum individual leaf area (LAXS). The base temperature for leaf development (TBLSZ) only became important at low temperatures. Radiation use efficiency is commonly one of the most important GSPs because it controls the carbohydrates available for daily growth (e.g. Confalonieri et al., 2010a, 2010b, 2006; Dzotsi et al., 2013; Feyereisen et al., 2006; Makowski et al., 2006; Wang et al., 2013). This version of MANIHOT does not include the detailed stomatal response of VPD but it has a GSP for the day-to-day effect of VPD on photosynthesis, which likely modifies the sensitivity of PARUE. It was not included in the sensitivity analysis because it is a species-wide parameter. Sensitivity analyses for short-season annual crops has shown that crop yield is very sensitive to GSPs associated with accumulation of thermal time and phenological development (He et al., 2015; Zhao et al., 2014). However, the GSPs in MANIHOT that define branching time, which are the analogs of phenological stages in other crops, accounted for less than 20 % of the variance in yield. Neither leaf-petiole weight fraction (LPEFR) nor the fourth branching level (BR4F) affected the simulation outputs of MANIHOT. The sensitivity analysis used over 300 values for LPEFR, but they varied little from 0.2. We suggest that LPERF is redundant as a GSP and can be replaced with a constant as part of the species description. Under cool temperatures, the MANIHOT model rarely simulates to four branches within 10 months so that the GSP for the fourth branching level (BR4F) has much less impact in the analyses. Nevertheless, under warm temperatures the fourth branching level (BR4F) interacts with GSPs that control leaf size and thus affects the plant’s leaf area. The simulation outputs maximum LAI, aboveground biomass and yield, showed low sensitivity to the GSP specific leaf area (SLAS), which we did not expect. SLAS does control leaf growth through its direct effect on LAI. Nevertheless, aboveground biomass is dominated by stem weight so that the effect of SLAS alone may not be detectable. The simulated outputs were sensitive to about 80% of the GSPs through direct effects and 20% through interactions. This emphasizes that it is important to account for the interactions when estimating the values of GSPs during model calibration. GSPs with higher interactions may require evaluation of multiple combinations of GSPs. The variability for simulated variables in MANIHOT due to parameter uncertainty was larger than for crop simulation models such as wheat in which the coefficient of variation for yield is about 13% (Asseng et al., 2013). The uncertainty of simulation outputs in MANIHOT was larger for warm and rainfed conditions than in either water regime in cool environments, in which temperature was the main limiting factor. In contrast, for a warm temperature environment, water-unlimited simulations produced higher values of maximum leaf area index, number of leaves and aboveground biomass. The water-unlimited regime for warm temperatures gave different cumulative probabilities and higher CVs for the simulated outputs compared with rainfed conditions. MANIHOT is based on the spill-over concept in which assimilate is allocated to satisfy demand to fulfil potential growth of aboveground organs. Only assimilate surplus above these demands are allocated to the storage roots. Under warm temperatures, water-unlimited conditions, therefore, resulted in a yield that was slightly lower than under rainfed conditions because proportionally more of the available assimilate was allocated to the aboveground organs, leaving less surplus. This contrasts with cereal crops where the harvested grain is an integral aboveground organ and well-watered crops produce a higher yield compared to rainfed (DeJonge et al., 2012). The elementary effects from eSU and the total order indices from Sobol were highly correlated. This shows the utility of eSU to screen GSPs for models with many input parameters when Sobol could challenge marginal computational infrastructure. Similar findings of high correlations between qualitative and quantitative methods have been reported (e.g. Confalonieri et al., 2010a; Stella et al., 2014). However, this study is the first instance where the qualitative eSU procedure has been used for analysis of a crop simulation model. Sensitivities to GSPs values in MANIHOT differed between growing environments giving a large range of model uncertainty. GSPs must, therefore, be estimated across contrasting growing environments to ensure reliable model simulations. We only considered the uncertainty of the GSPs but the same procedures could be applied to other input variables that contribute to model uncertainty such as soil characteristics (DeJonge et al., 2012; Jones et al., 2012).

Conclusions

The enhanced Sampling for Uniformity (eSU), showed that maximum LAI, yield, and aboveground biomass at harvest simulated by MANIHOT were sensitive to about 80% of the genotype specific parameters (GSPs). The importance of GSPs in the simulated variables did not change between water-unlimited and rainfed conditions but differed between warm and cool environments. The most important GSPs were individual node weight, radiation use efficiency and maximum individual leaf area. The base temperature for leaf development was more relevant at cool than at warm temperatures. About 20% of the variance in the output variables was due to GSP interactions, which is important when estimating parameter values. Further research should include sensitivity and uncertainty analysis of other input variables such as soil characteristics. The comprehensive global sensitivity analysis approaches that we developed here can readily be applied to other crop models. It also provides an objective way to identify processes included in simulation models that have little importance.

CRediT authorship contribution statement

Leidy Patricia Moreno-Cadena: Conceptualization, Methodology, Validation, Formal analysis, Writing - original draft. Gerrit Hoogenboom: Conceptualization, Methodology, Resources, Writing - review & editing, Supervision. Myles James Fisher: Writing - review & editing. Julian Ramirez-Villegas: Writing - review & editing. Steven Dean Prager: Writing - review & editing. Luis Augusto Becerra Lopez-Lavalle: Project administration, Supervision. Pieter Pypers: Funding acquisition, Project administration. Maria Sara Mejia de Tafur: Conceptualization. Daniel Wallach: Methodology, Writing - original draft. Rafael Muñoz-Carpena: Conceptualization, Methodology. Senthold Asseng: Conceptualization, Methodology, Writing - review & editing, Supervision.

Declaration of Competing Interest

None.
  4 in total

Review 1.  Rooting for cassava: insights into photosynthesis and associated physiology as a route to improve yield potential.

Authors:  Amanda P De Souza; Lynnicia N Massenburg; Deepak Jaiswal; Siyuan Cheng; Rachel Shekar; Stephen P Long
Journal:  New Phytol       Date:  2016-10-25       Impact factor: 10.151

2.  Response of cassava leaf area expansion to water deficit: cell proliferation, cell expansion and delayed development.

Authors:  Alfredo A C Alves; Tim L Setter
Journal:  Ann Bot       Date:  2004-08-19       Impact factor: 4.357

3.  Phenotypic approaches to drought in cassava: review.

Authors:  Emmanuel Okogbenin; Tim L Setter; Morag Ferguson; Rose Mutegi; Hernan Ceballos; Bunmi Olasanmi; Martin Fregene
Journal:  Front Physiol       Date:  2013-05-10       Impact factor: 4.566

4.  Sensitivity analysis of the Aquacrop and SAFYE crop models for the assessment of water limited winter wheat yield in regional scale applications.

Authors:  Paolo Cosmo Silvestro; Stefano Pignatti; Hao Yang; Guijun Yang; Simone Pascucci; Fabio Castaldi; Raffaele Casa
Journal:  PLoS One       Date:  2017-11-06       Impact factor: 3.240

  4 in total
  2 in total

1.  Yield Response of Spring Maize under Future Climate and the Effects of Adaptation Measures in Northeast China.

Authors:  Jackson K Koimbori; Shuai Wang; Jie Pan; Liping Guo; Kuo Li
Journal:  Plants (Basel)       Date:  2022-06-21

Review 2.  Modeling growth, development and yield of cassava: A review.

Authors:  Patricia Moreno-Cadena; Gerrit Hoogenboom; James H Cock; Julian Ramirez-Villegas; Pieter Pypers; Christine Kreye; Meklit Tariku; Kodjovi Senam Ezui; Luis Augusto Becerra Lopez-Lavalle; Senthold Asseng
Journal:  Field Crops Res       Date:  2021-06-15       Impact factor: 5.224

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.