Literature DB >> 32730284

Adaptability and stability evaluation of maize hybrids using Bayesian segmented regression models.

Tâmara Rebecca A Oliveira1, Hélio Wilson L Carvalho2, Moysés Nascimento3, Emiliano Fernandes N Costa2, Gustavo Hugo F Oliveira1, Geraldo A Gravina4, Antonio T Amaral Junior4, José Luiz S Carvalho Filho5.   

Abstract

The occurrence of genotype by environment interaction (G x E), which is defined as the differential response of genotypes to environmental variation, is frequently reported in maize cultures, making it challenging to recommend cultivars. Methods allowing to study the potential nonlinear pattern of genotype responses to environmental variation allied to prior beliefs on unknown parameters are interesting to evaluate the phenotypic adaptability and stability of genotypes. In this context, the present study aimed to assess the adaptability and stability of maize hybrids, by using the Bayesian segmented regression model, and evaluate the efficacy of using informative and minimally informative prior distributions for the selection of cultivars. Randomized complete-block design experiments were carried out to study the yield (kg/ha) of 25 maize hybrids, in 22 different environments, in Northeastern Brazil. The Bayesian segmented regression model fitted using informative prior distributions presented lower credibility intervals and Deviance Criterium of Information values, compared to those obtained by fitting using minimally informative distributions. Therefore, the model using informative prior distributions was considered for the adaptability and stability evaluation of maize genotypes. Once most northeastern farmers in Brazil have limited capital, the genotype P4285HX should be considered for planting, due to its high yield performance and adaptability to unfavorable environments.

Entities:  

Mesh:

Year:  2020        PMID: 32730284      PMCID: PMC7392259          DOI: 10.1371/journal.pone.0236571

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Maize (Zea mays L.) cultures are appreciated worldwide. Thus it has tremendous relevance due to its several uses and applications in areas ranging from animal feed to technological industries. Furthermore, because maize is grown under different environmental conditions, it interacts with various environments, resulting in varied genotype performances [1]. Such interactions hinder the genotype sealing works given that the best-suited genotype for a specific environment may not be best suited for another environment where such interactions take place. Thus, recommendations for the broad adaptability and stability of cultivars become costly [1]. The literature presents several methodologies to study phenotypic adaptability and stability. Examples of such methods are those based on simple linear regression [2], piecewise linear regression [3, 4], mixed models (REML/BLUP) [5], non-parametric methods [6, 7], multivariate analysis, for example, multiple and expanded centroid methods [8, 9] and Bayesian inference for simple linear regression [10]. Unlike the deterministic and frequentist methods, the Bayesian framework allows for the incorporation of additional information relating to the parameters through prior distributions, which are characterized by the probability distribution. According to [11], all information is useful and must be used in the Bayesian analysis. Additionally, owing to the large quantity of information available from previous studies, incorporating this information during modeling is reasonable [12]. Despite being interesting, the Bayesian approach to adaptability and stability studies is based on a simple regression model [10]. According to [13], the simple linear regression models are unable to fit a potential nonlinear pattern to genotype responses to environmental variations. Aiming to solve this deficiency, under a statistical “frequentist” framework, [3, 4] proposed the segmented regression model allowing the identification of the “ideal” genotype, which presents high yield performance, high stability and low sensitivity to adverse conditions. Nascimento et al. [12] proposed the Bayesian segmented regression model approach to analyze phenotypic adaptability and stability. This approach differs from the “frequentist” framework, allowing the addition of prior beliefs to unknown parameters, bringing new insights for plant breeders. Additionally, this method allows for the exploitation of potential nonlinear patterns of genotype responses to environmental variations, aiming to identify genotypes that present high yield performance, and high stability under adverse conditions. This genotype is denoted as “ideal,” according to [4]. In light of the above, the present study aimed to assess the adaptability and stability of maize hybrids by using the Bayesian segmented regression model and evaluate the efficacy of using informative and minimally informative prior distributions in the selection of cultivars.

Materials and methods

During the agricultural years 2012 and 2013, 25 maize hybrids from public and private companies from the states of Maranhão (Balsas, Brejo, Colinas, and São Raimundo das Mangabeiras counties), Piauí (Nova Santa Rosa, Teresina, and Uruçuí counties), and Sergipe (Nossa Senhora das Dores, Frei Paulo, and Umbaúba counties) were assessed. The assessments comprised 11 environments, where the Nossa Senhora das Dores County had two different fertilization and each one was assumed as a different environment (Table 1).
Table 1

List of hybrid maize cultivars and their respective origins, types, cycles, colors, grain textures and companies.

CultivarTransgenic/ConventionalType1Cycle2Grain color3Grain texture4Company
30A95HXTransgenicTHEORSMHARDMORGAN
30A68HXTransgenicSHEEORSMHARDMORGAN
BM820ConventionalSHERHARDBIOMATRIX
DKB330YGConventionalSHEER/ORSMDENTDEKALB
AS1596R2TransgenicSHERSMDENTAGROESTE
P4285HTransgenicSHEY/ORHARDDU PONT
2B710HXTransgenicSHEY/ORSMHARDDOW
30A16HXTransgenicSHEORSMHARDMORGAN
DKB370ConventionalSHmEY/ORSMHARDDEKALB
AG8041YGTransgenicSHEY/ORSMHARDSEMENTES
20A55HXTransgenicTHEORSMHARDMORGAN
30F53HRTransgenicSHEORSMHARDDU PONT
30A37HXTransgenicSHEEY/ORSMHARDMORGAN
30A91HXTransgenicSHmEY/ORSMHARDMORGAN
2B587HXTransgenicSHEY/ORSMDENTDOW
2B433HXTransgenicTHEEY/ORSMDENTDOW
AS1555YGTransgenicSHEORSMHARDAGROESTE
BRS2022ConventionalDHEORSMDENTEMBRAPA
StatusvipTransgenicSHEORHARDSYNGENTA
BRS2020ConventionalDHEORSMHARDEMBRAPA
2B707HXTransgenicSHEORSMHARDDOW
20A78HXTransgenicSHEORSMHARDDOW
2B604HXTransgenicSHmEORSMHARDDOW
30K73HTransgenicSHEY/ORSMHARDDU PONT
2B688HXTransgenicTHEORSMHARDDOW

1DH: Double hybrid; TH: Triple hybrid; SHm: Modified single hybrid.

2EE: Extra early; E: Early.

3OR: Orange; R: Reddish; Y:Yellow.

4SMDENT:Semi-dent; SMHARD:Semi-hard.

1DH: Double hybrid; TH: Triple hybrid; SHm: Modified single hybrid. 2EE: Extra early; E: Early. 3OR: Orange; R: Reddish; Y:Yellow. 4SMDENT:Semi-dent; SMHARD:Semi-hard. During the trials, samples considered to have high fertilization ranges were treated with a total of 180.00 kg ha-1 of N, 149.80 kg ha-1 of P2O5 and 85.60 kg ha-1 of K2O, whereas samples considered to have low fertilization ranges were treated with 45.00 kg ha-1 of N, 37.80 kg ha-1 of P2O5 and 21.60 kg ha-1 of K2O, in the form of 535 and 135 kg ha-1 of 8-28-16 Zn at the time of sowing, respectively. The experimental design was based on randomized blocks, with two repetitions, wherein each plot comprised four 5.0 m-long rows, with spacings of 0.70 m x 0.20 m, between rows, and between holes within the rows, respectively. Fertilization was performed according to the results of the soil analysis from each experimental area. Irrigation was not carried out, and weed and pest control was performed according to the crop’s requirement in each region. The maize yield data were subjected to variance analysis for each environment. A joint analysis was carried out by adopting the model: y = μ + r/e + e + g + ge + ε, where y is the phenotypic mean, μ is the overall mean, r/e is the effect of the kth repetition in the jth environment, g is the fixed effect of the ith genotype, e is the effect of the jth environment normally and independently distributed (NID) , ge is the effect of the interaction of the ith genotype in the jth environment NID , and ε is the experimental error

Model and Bayesian inference

The bi-segmented regression model is given by where y is the response of genotype i in environment j, β is the mean response of genotype i, β is the slope under the first regime (the linear regression coefficient related to the unfavorable environments), and β represents the change in slope from the first to the second regime (β + β is the slope after the change-point, that is, the linear response to the favorable environments). Further, I is the coded environmental index, T(I) = 0 if I ≤ 0, or if I > 0, and is mean of the coded environmental index considering only environments with positive indexes and e is the error term, NID (0, σ2). The Bayesian approach for the bi-segmented model is described in Nascimento et al. [12]. In summary, assuming , each observation yij has a distribution , and the likelihood function for each genotype is given by The prior distributions for the parameters (β, β, β, ) are given by and where , ,, , , , and α, β are the known parameters. This last prior distribution is the Gamma distribution with mean and variance equal to e , respectively. Additionally, the precision is equal to . The joint posterior distribution is proportional to the product of the likelihood function (Eq 1.1) and the prior distributions (Eqs 1.2–1.5). To make inferences regarding the parameters in Eq 2, the Markov chain Monte Carlo (MCMC) was used to obtain the posterior marginal distributions for each parameter. The marginal distribution samples of the stability parameter, , were obtained indirectly. This parameter is a function of . Therefore, using the values from each interaction, we obtain according to the following expression: , where MSR is the residual mean square obtained from the variance analysis and r is the number of repetitions of the experiment. The hypotheses of interest were tested by calculating the 95% credibility intervals for the parameters.

Priors distributions

Two models were fitted to assess the model’s goodness of fit. Model 1 (M1—minimally informative prior distributions) was characterized by minimally informative prior distributions, which were represented by distributions with large variances: ,, , and τ|α, β ~ Gamma(α = 0.001, β = 0.001). Model 2 (M2—informative prior distributions), similar to the method employed in [12], was characterized by the estimates obtained from the frequentist analysis of the bi-segmented model, used as information to define the hyperparameters.

Assessing the model’s goodness of fit

Models M1 (minimally informative priors) and M2 (informative priors) were compared by means of the Deviance Information Criterion (DIC) [14]: . Here, is a point estimate of the deviance obtained by replacing the parameters with their posterior mean estimates in the likelihood function and 2p is given by the effective number of parameters in the models. Models with lower DIC are preferred.

Bayesian analysis

We adopted MCMC chains considering 100,000 iterations of the Gibbs sampler algorithm. We set the burn-in to 10,000 iterations and thinned every five iterations. In each chain, we analyzed the posterior mean, standard deviation, 95% credibility intervals, and convergence criterion statistics [15, 16]. The methodology was implemented in software R [17], and the joint distribution samples were obtained using the rbugs function of the rbugs package [18], which was accomplished by fusing R and OpenBugs (a software application for the Bayesian analysis of complex statistical models using MCMC methods). The MCMC chain convergence was accessed by Geweke and Raftery-Lewis diagnostics using the package [19] provided in the R software [17].

Results and discussion

The analysis of variance of the maize yield (kg/ha) demonstrated that the genotypes, environments and the genotype x environment interaction (G×E) presented a significant effect (P < 0.05) (Table 2). The significance of G x E interaction indicates contrasts between environments and differential genotypic responses to environmental effects. The occurrence of G × E interaction, which can be defined as the differential response of genotypes to environmental variation, is frequently reported in maize cultures, making it challenging to recommend cultivars [20-24].
Table 2

Mean squares from analysis of variance (ANOVA) for yield of 25 hybrid maize genotypes assessed in 22 environments.

Sources of variationDFMean square (MS)
Blocks/Environments221989297
Genotypes (G)2416241624**
Environments (E)21114452906**
Genotypes x Environments (G x E)5042454148**
Error528708661
Mean (kg/ha)8682.99

**Significant at 0.01 probability levels by F test.

**Significant at 0.01 probability levels by F test. The posteriori means and their respective credibility intervals (CI) provided estimates for the adaptability and stability parameters. Considering the results provided by the model M1, which is characterized by the minimally informative prior distributions, most genotypes (14 genotypes) presented the linear regression coefficient related to the unfavorable environments equal to 1 (β = 1), except 30A16HX, 2B707HX, 2B587HX, 30A37HX, 2B604HX, 20A55HR, 20A78HX and DKB370, which presented values higher than 1 (β > 1), and the genotypes P4285HX and BRS2020, which presented values lower than 1 (β < 1) (Table 3). Among those genotypes that presented the linear regression coefficient related to the unfavorable environments equal to 1, only two (30A68HX and AS1555YG) exhibited the linear response to the favorable environments higher than 1 (β + β > 1) (Table 3). No genotype presented stability parameter () equal to zero. On the other hand, the genotype AS1555YG presented coefficient of determination higher than 80% (Table 3). However, only the genotype 30A68HX presented higher mean productivity ().
Table 3

Estimates of the a posteriori means () and of the credible intervals (95%) of the adaptability () and stability parameters (, R2), by taking into consideration informative and non-informative priors for maize hybrids.

GenotypesLIβ-0iβ-0iLSβ-0iLIβ-1iβ-1iLSβ-1iLIβ-1i+β-2iβ-1i+β-2iLSβ-1i+β-2iLIσ-di2σ-di2LSσ-di2R2
Minimally informative priors (M1)
30A68HX9316.009465.569911.000.790.901.241.531.792.58841275.001115511.902114100.0077.44
30A16HX9295.009431.879838.031.441.541.851.721.952.67701600.00930323.501763075.0089.85
2B707HX9134.009288.449748.031.051.171.521.171.442.25896975.001189343.802254100.0080.12
2B587HX9088.009200.569536.001.241.321.581.261.452.04477375.00632954.501199075.0090.03
30A37HX9072.009195.199560.031.141.231.510.750.961.61566600.00751357.901424050.0086.00
2B604HX9052.009194.079615.031.021.131.451.571.822.56752675.00998071.401891100.0083.58
2B710HX9067.009181.739524.000.981.071.330.801.001.60496300.00658055.001247050.0084.56
30A95HX8894.009075.289615.031.081.221.631.171.482.431237000.001640324.803108150.0076.18
P4285HX8915.009055.949474.030.520.620.940.791.031.77743475.00985850.501868100.0063.06
30F53HR8816.009009.929585.030.740.881.32-0.160.181.191404000.001862281.103529150.0056.08
2B433HX8871.758960.699226.000.971.041.240.790.951.42298500.00395843.90750037.5089.45
20A55HR8733.008834.609136.001.141.221.450.560.731.26385100.00510603.50967545.0089.34
20A78HX8652.008817.599311.031.061.181.560.590.881.751034000.001370711.402597125.0075.88
DKB3708610.008759.799184.001.111.211.531.241.492.27788600.001055663.802012125.0083.03
2B688HX8612.008757.449190.000.820.931.260.310.561.32792775.001051201.201992100.0071.56
AG8041YG8601.008744.849173.030.730.841.170.560.811.57779900.001034193.001960075.0069.57
30K73H8578.008732.159190.000.981.091.440.791.061.86888300.001177882.702232100.0076.85
DKB3308295.008475.949015.030.901.031.440.811.122.071233000.001634763.203098125.0069.38
AS1596YG8200.008363.828851.000.730.851.220.801.081.941006000.001333766.702527125.0066.98
BM8208140.008269.688654.000.901.001.29-0.27-0.040.63626600.00830832.001574075.0077.21
AS1555YG7877.007996.518365.030.650.741.021.671.882.54580050.00777663.901477075.0080.68
Statusvip7495.007780.318629.030.750.961.610.420.922.413056000.004052797.707680325.0046.09
BRS20227453.007576.317944.030.630.721.000.090.300.95574200.00761371.301443050.0067.12
30A91HX7293.007487.648067.000.420.561.00-0.40-0.070.961424000.001887726.003577175.0037.25
BRS20207346.007466.967826.030.400.490.76-0.170.040.67547575.00726058.601376050.0050.86
Informative priors (M2)
30A68HX9464.009464.009464.000.810.901.171.591.802.41679400.00863019.301503000.0076.90
30A16HX9430.009430.009430.001.461.541.801.771.962.53576100.00731857.101274025.0089.75
2B707HX9287.009287.009287.001.081.171.451.241.452.07720600.00915214.001593025.0079.60
2B587HX9199.009199.009199.001.251.321.541.291.461.95409675.00520498.60907202.5090.00
30A37HX9194.009194.009194.001.161.231.460.790.971.49475900.00604773.601054000.0085.78
2B604HX9193.009192.799193.001.041.131.391.631.822.41613900.00779879.201358000.0083.29
2B710HX9180.009180.339181.000.991.071.290.841.011.50423700.00538376.70937920.0084.36
30A95HX9073.009073.009073.001.111.221.541.251.492.19971375.001233075.602145000.0075.36
P4285HX9054.009054.019054.000.540.620.880.841.041.62607100.00771220.601343000.0061.97
30F53HR9008.009008.009008.000.770.881.22-0.060.190.921095000.001388961.002415000.0053.95
2B433HX8960.008960.008960.000.981.041.220.810.951.37276400.00351203.30612807.5089.50
20A55HR8833.008833.588834.001.151.221.410.580.741.19340900.00433232.50755505.0089.29
20A78HX8816.008816.008816.001.081.181.480.670.891.55821775.001043252.901815025.0075.06
DKB3708756.008756.008756.000.850.931.200.370.571.16643500.00817506.601424000.0070.66
2B688HX8755.008755.008755.001.121.211.481.291.492.09649075.00824492.501436000.0082.39
AG8041YG8743.008743.008743.000.760.841.110.620.821.41634000.00805464.401403000.0068.65
30K73H8730.008730.268731.001.001.101.370.861.071.69714275.00907119.601579025.0076.18
DKB3308474.008474.008474.000.931.041.350.891.131.83968275.001229178.302138000.0068.21
AS1596YG8362.008362.008362.000.760.861.150.871.091.74801200.001017188.201770000.0065.83
BM8208268.008268.008268.000.921.001.24-0.22-0.040.51520500.00661250.801152025.0076.53
AS1555YG7990.007990.447991.000.670.740.981.701.882.41496000.00630259.801099000.0080.08
Statusvip7777.007777.007777.000.820.971.410.610.931.862302000.002913389.605059000.0042.69
BRS20227575.007575.007575.000.640.720.950.130.310.84481600.00611894.501067000.0066.20
30A91HX7485.007485.657486.000.450.560.90-0.30-0.060.681109000.001406803.702445025.0033.92
BRS20207466.007466.007466.000.420.490.72-0.130.040.56461800.00586780.901022025.0049.27

1LI: Lower Bound; LS: Upper Bound.

2: is the linear regression coefficient related to the unfavorable environments; : is the linear response to the favorable environments.

3 : is the stability parameter; R2: is the coefficient of determination.

1LI: Lower Bound; LS: Upper Bound. 2: is the linear regression coefficient related to the unfavorable environments; : is the linear response to the favorable environments. 3 : is the stability parameter; R2: is the coefficient of determination. Considering the results provided by the model M1, which is characterized by the minimally informative prior distributions, most genotypes (14 genotypes) presented the linear regression coefficient related to the unfavorable environments equal to 1 (β = 1), except 30A16HX, 2B707HX, 2B587HX, 30A37HX, 2B604HX, 20A55HR, 20A78HX and DKB370, which presented values higher than 1 (β > 1) and the genotypes P4285HX and BRS2020, which presented values lower than 1 (β < 1) (Table 3). Among those genotypes that presented the linear regression coefficient related to the unfavorable environments equal to 1, only two (30A68HX and AS1555YG) presented the linear response to the favorable environments higher than 1 (β + β > 1) (Table 3). No genotype presented stability parameter () equal to zero. On the other hand, the genotype AS1555YG presented coefficient of determination higher than 80% (Table 3). However, only the genotype 30A68HX presented higher mean productivity (). According to the results obtained by Model 2 (M2), which is characterized by the informative prior distributions, out of the 25 genotypes, 11 (30A68HX, 2B710HX, 30F53HR, 2B433HX, DKB370, AG8041YG, 30K73H, DKB330, AS1596YG and BM820) presented the linear regression coefficient related to the unfavorable environments equal to 1 (β = 1). The nine genotypes observed in the previous analysis presented values higher than 1 (β > 1) and five (P4285HX, AS1555YG, BRS2022, 30A91HX and BRS2020) presented values lower than 1 (β < 1) (Table 3). Among those genotypes that presented the linear regression coefficient related to the unfavorable environments equal to 1, only the genotype 30A68HX showed the linear response to the favorable environments higher than 1 (β + β > 1) and high mean productivity () (Table 3). This genotype is suitable for growers who employ high-level technology, since it responds well to improved environments [13]. In line with the M1 results, the genotype 30A68HX presented stability parameter () higher than zero and coefficient of determination lower than 80% (Table 3). The analysis considering M2 was able to better discriminate the genotypes, since, out of 25 genotypes, 14 and 11 presented the linear regression coefficient related to the unfavorable environments equal to 1 for M1 and M2 fitted models, respectively. A comparative analysis of the limits of credibility intervals obtained by the two fitted models (M1 and M2) reveals that the use of informative prior distributions (M2) reduced the limits of credibility intervals, when compared to minimally informative prior distributions (M1). Similar results were observed by [11], who used the Bayesian segmented regression model for adaptability and stability evaluation of cotton genotypes. Nascimento et al. [10], Couto et al. [20] and Teodoro et al. [25] used the Eberhart and Russel’s Bayesian method to evaluate the phenotypic stability and adaptability of alfalfa and popcorn cultivars and obtained similar results. Additionally, the difference in DIC values between models using minimally informative and informative priors ranged between 1.59 and 2.01. Once smaller DIC values indicate better data fitting, these results demonstrate that M2 should be considered for the adaptability and stability evaluation of maize genotypes (Table 4).
Table 4

Deviance Information Criterion (DIC) values obtained through the difference of DIC values between models using minimally informative (i) and informative priors (j) for maize hybrids.

GenotypesDicij
30A68HX1.80
30A16HX1.78
2B707HX1.80
2B587HX1.75
30A37HX1.77
2B604HX1.79
2B710HX1.76
30A95HX1.84
P4285H1.79
30F53HR1.85
2B433HX1.65
20A55HX1.72
20A78HX1.82
DKB3702.00
2B688HX1.59
AG8041YG1.79
30K73H1.80
DKB330YG1.84
AS1596R21.81
BM8201.78
AS1555YG1.71
Statusvip2.01
BRS20221.77
30A91HX1.86
BRS20201.77
Overall, the Bayesian framework of the segmented regression model allowed the incorporation of additional information related to the parameters, through prior distributions, which reduced the ranges of the credibility intervals, increased the precision of parameter estimates, and, consequently, provided reliable genotype selection. In practice, this information can be obtained from previous studies, including [10, 20]. Due to the lack of prior information related to the evaluated maize hybrids in the literature, in this work, the estimates obtained from the frequentist analysis of the segmented model were used to define the hyperparameters. In practice, most northeastern farmers in Brazil have limited capital, which prevents them from investing in production technology. Therefore, genotypes adapted to unfavorable environments should be considered for low technology planting [26]. The recommendation of cultivars not adapted to regional conditions leads to low yield and other serious problems, such as the indiscriminate use of pesticides and excessive cultural treatment [27]. Considering the results provided by the model M2 (informative prior distributions), only the genotype P4285HX presented the linear regression coefficient related to the unfavorable environments lower than 1 (β < 1) and high mean productivity () (Table 3).

Conclusions

Incorporating additional information about the parameters through prior distributions decreases the credibility interval ranges. The difference in DIC values between models using minimally informative (M1) and informative priors (M2) was positive, which indicates a better data fitting, considering M2. Therefore, it should be an alternative for the adaptability and stability evaluation of maize genotypes. The genotype P4285HX presents high yield performance and adaptability to unfavorable environments and should be considered for low technology planting, which is practiced by northeastern Brazilians farmers. (XLSX) Click here for additional data file.
  1 in total

1.  Limitations of conventional regression analysis a proposed modification.

Authors:  M M Verma; G S Chahal; B R Murty
Journal:  Theor Appl Genet       Date:  1978-09       Impact factor: 5.699

  1 in total
  1 in total

1.  Adaptability and stability of Coffea canephora to dynamic environments using the Bayesian approach.

Authors:  Fabio Luiz Partelli; Flavia Alves da Silva; André Monzoli Covre; Gleison Oliosi; Caio Cezar Guedes Correa; Alexandre Pio Viana
Journal:  Sci Rep       Date:  2022-07-08       Impact factor: 4.996

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.