| Literature DB >> 32442213 |
Murilo Viotto Del Conte1, Pedro Crescêncio Souza Carneiro2, Marcos Deon Vilela de Resende3, Felipe Lopes da Silva1, Luiz Alexandre Peternelli4.
Abstract
Path analysis allows understanding the direct and indirect effects among traits. Multicollinearity in correlation matrices may cause a bias in path analysis estimates. This study aimed to: a) understand the correlation among soybean traits and estimate their direct and indirect effects on gain oil content; b) verify the efficiency of ridge path analysis and trait culling to overcome colinearity. Three different matrices with different levels of collinearity were obtained by trait culling. Ridge path analysis was performed on matrices with strong collinearity; otherwise, a traditional path analysis was performed. The same analyses were run on a simulated dataset. Trait culling was applied to matrix R originating the matrices R1 and R2. Path analysis for matrices R1 and R2 presented a high determination coefficient (0.856 and 0.832, respectively) and low effect of the residual variable (0.379 and 0.410 respectively). Ridge path analysis presented low determination coefficient (0.657) and no direct effects greater than the effects of the residual variable (0.585). Trait culling was more effective to overcome collinearity. Mass of grains, number of nodes, and number of pods are promising for indirect selection for oil content.Entities:
Year: 2020 PMID: 32442213 PMCID: PMC7244132 DOI: 10.1371/journal.pone.0233290
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Code, description and assessment methodology of traits evaluated in soybean plants.
| CODE | DESCRIPTION | ASSESSMENT METHODOLOGY |
|---|---|---|
| DF | Days to Flowering | from emergency stage (VE stage) until beginning bloom (R1 stage) |
| HF | Height of plants on Flowering | from the base until the last apical meristem at R1 stage |
| DM | Days to Maturity | measured from the VE stage until full maturity (R8 stage) |
| HD | Hypocotyl Diameter | measured 2 centimeters below the cotyledon node at R8 stage |
| HM | Height of plants on Maturity | measured from the base to the last apical meristem at R8 stage |
| NP | Number of Pods per plant | number of pods per plant measured at R8 stage |
| NN | Number of Nodes per plant | number of nodes in the main stem per plant measured at R8 stage |
| NB | Number of Branches per plant | number of lateral branches per plant measured at R8 stage |
| NG | Number of Grains per plant | measured at R8 stage |
| MG | Mass of Grains per plant | corrected to 13% of humidity |
| OC | Oil content of grains | measured in percentage of soybean intact grains at R8 stage and corrected to 13% of humidity |
| PC | Portein content of grains |
Fig 1Path analysis diagram for matrices R and Rs’s.
Oil content was considered the main trait and DF—days to flowering; HF—height of plants on flowering; DM—days to maturity; HD—hypocotyl diameter; HM—height of plants on maturity; NP—number of pods per plant; NN—number of nodes in the main stem per plant; NB—number of branches per plant; NG—number of grains per plant; MG—mass of grains per plant; PC—protein content in grains; OC—oil content in grains and ERV–effect of residual variable, the explanatory variables.
Fig 2Path analysis diagram for matrices R1 and R1s’.
Oil content was considered the main trait and DM—days to maturity; HD—hypocotyl diameter; HM—height of plants on maturity; NP—number of pods per plant; NN—number of nodes in the main stem per plant; NG—number of grains per plant; MG—mass of grains per plant; PC—protein content in grains; OC—oil content in grains and ERV–effect of residual variable the explanatory variables.
Fig 3Path analysis diagram for matrices R2 and R2s’s.
Oil content was considered the main trait DM—days to maturity; HD—hypocotyl diameter; HM—height of plants on maturity; NN—number of nodes in the main stem per plant; NG—number of grains per plant; MG—mass of grains per plant; PC—protein content in grains; OC—oil content in grains and ERV–effect of residual variable the explanatory variables.
Genetic and environmental parameters estimated for days to flowering (DF), height of plants on flowering (HF), days to maturity (DM), hypocotyl diameter (HD), height of plants on maturity (HM), number of pods per plant (NP), number of nodes in the main stem per plant (NN), number of branches per plant (NB), number of grains per plant (NG), mass of grains per plant (MG), oil content in grains (OC), and protein content in grains (PC) evaluated in 15 soybean genotypes.
| Parameters | DF | HF | DM | HD | HM | NP | NN | NB | NG | MG | OC | PC |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| σ²g | 34.3 | 201 | 90.2 | 0.333 | 174 | 52.5 | 1.84 | 0.486 | 151 | 2.85 | 0.667 | 1.38 |
| σ²e | 2.81 | 24.4 | 28.5 | 0.228 | 71.9 | 10.6 | 0.544 | 0.832 | 65.4 | 1.79 | 1.24 | 1.21 |
| σ²f | 37.1 | 224 | 119 | 0.561 | 246 | 63.2 | 2.39 | 1.32 | 217 | 4.64 | 1.91 | 2.58 |
| h2g | 0.924 | 0.891 | 0.76 | 0.593 | 0.708 | 0.832 | 0.772 | 0.368 | 0.698 | 0.613 | 0.35 | 0.532 |
| râa | 0.992 | 0.988 | 0.97 | 0.938 | 0.961 | 0.98 | 0.972 | 0.863 | 0.959 | 0.942 | 0.854 | 0.922 |
| CVg% | 14.09 | 32.1 | 8.07 | 11.5 | 19.5 | 23.9 | 11.206 | 15.5 | 18.4 | 14.8 | 4.28 | 3 |
| CVe% | 4.03 | 11.2 | 4.53 | 9.54 | 12.5 | 10.8 | 6.091 | 20.3 | 12.1 | 11.7 | 5.84 | 2.81 |
| CVr | 3.49 | 2.86 | 1.78 | 1.21 | 1.56 | 2.22 | 1.84 | 0.764 | 1.52 | 1.26 | 0.733 | 1.07 |
| PEV | 0.552 | 4.76 | 5.36 | 0.04 | 13.3 | 2.04 | 0.103 | 0.124 | 12 | 0.319 | 0.181 | 0.206 |
| SEP | 0.743 | 2.18 | 2.31 | 0.2 | 3.64 | 1.43 | 0.321 | 0.352 | 3.47 | 0.565 | 0.425 | 0.453 |
| Mean | 41.5 | 44.1 | 118 | 5.01 | 67.8 | 30.3 | 12.1 | 4.49 | 66.8 | 11.4 | 19.1 | 39.1 |
* Significant at the 0.01 probability level by the LRT test (Likelihood Ratio Test) using chi-squared test with one degree of freedom;
σ²g = genotypic variance; σ²e = residual variance; σ²f = individual phenotypic variance; h2g = wide hereditability of individual plots; râa = selection accuracy of genotypes, assuming no plots lost; CVg% = genotypic coefficient of variation; CVe% = residual coefficient of variation; CVr = CVg/CVe = relative coefficient of variation; PEV = prediction error variance, assuming no plots lost; SEP = standard deviation of predicted genetic values, assuming no plots lost and Mean = general mean of three experiment.
Genotypic values (u + g) of the genotypes evaluated for days to flowering (DF), height of plants on flowering (HF), days to maturity (DM), hypocotyl diameter (HD), height of plants on maturity (HM), number of nodes in the main stem per plant (NN), number of pods per plant (NP), number of branches per plant (NB), number of grains per plant (NG), mass of grains per plant (MG), oil content in grains (OC), and protein content in grains (PC) evaluated in 15 soybean genotypes.
| Variety | Traits | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| DF | HF | DM | HD | HM | NN | NP | NB | NG | MG | OC | PC | |
| u + g | ||||||||||||
| BMX Apolo RR | 36.0 | 21.4 | 106.0 | 4.5 | 45.4 | 20.9 | 10.1 | 3.5 | 54.2 | 9.0 | 17.9 | 41.2 |
| DM 5958 IPRO | 34.4 | 28.2 | 106.2 | 4.0 | 51.2 | 23.4 | 10.9 | 4.4 | 54.1 | 9.1 | 19.4 | 38.4 |
| NA 5909 RR | 35.2 | 42.7 | 107.4 | 4.6 | 72.4 | 26.1 | 10.5 | 4.9 | 60.0 | 10.7 | 18.8 | 38.6 |
| TMG 7062 IPRO | 36.7 | 35.6 | 103.5 | 4.7 | 59.5 | 22.1 | 10.9 | 4.1 | 53.0 | 10.9 | 18.9 | 38.8 |
| DM 6563 IPRO | 36.7 | 30.6 | 114.1 | 4.8 | 61.2 | 26.6 | 11.9 | 3.7 | 64.5 | 10.9 | 19.0 | 38.0 |
| BMX Potência RR | 37.1 | 29.7 | 111.3 | 4.7 | 59.9 | 24.0 | 11.3 | 4.2 | 58.6 | 9.7 | 18.3 | 39.6 |
| BMX Desafio RR | 34.3 | 30.8 | 117.3 | 5.6 | 62.7 | 23.8 | 11.7 | 3.9 | 58.2 | 10.8 | 18.1 | 40.2 |
| M 7739 IPRO | 45.4 | 50.5 | 119.4 | 5.5 | 66.5 | 33.1 | 12.3 | 5.0 | 76.4 | 12.5 | 19.5 | 37.6 |
| P98Y30 RR | 44.8 | 56.7 | 123.3 | 5.0 | 79.9 | 37.1 | 13.6 | 5.5 | 80.1 | 12.9 | 20.6 | 38.8 |
| M8349 IPRO | 45.4 | 46.0 | 124.3 | 5.1 | 63.7 | 35.5 | 12.6 | 5.0 | 82.6 | 13.9 | 19.8 | 37.8 |
| TMG 132 RR | 47.4 | 52.0 | 128.8 | 4.8 | 70.4 | 36.5 | 13.3 | 5.1 | 80.3 | 11.2 | 19.1 | 38.8 |
| M 9056 RR | 51.2 | 65.1 | 127.3 | 5.8 | 94.1 | 42.6 | 15.0 | 5.0 | 88.4 | 13.4 | 19.5 | 38.2 |
| M 8221 RR | 46.2 | 50.7 | 119.7 | 5.8 | 69.2 | 40.8 | 12.5 | 4.5 | 66.8 | 12.3 | 19.0 | 40.3 |
| P98N71 | 45.8 | 57.1 | 131.2 | 4.9 | 76.3 | 31.1 | 13.1 | 4.5 | 61.0 | 10.9 | 19.3 | 39.8 |
| Conquista | 46.3 | 64.0 | 125.7 | 5.5 | 84.6 | 31.0 | 12.1 | 4.1 | 63.6 | 13.4 | 19.3 | 40.1 |
u + g = general mean summed to genotypic effects for each genotype.
Estimates of genotypic correlation between the traits DF, HF, DM, HD, HM, NP, NN, NB, NG, MG, OC and PC for fifteen soybean genotypes from the original data set below the diagonal and for the simulated populations above the diagonal, including the standard deviation bellow.
| Traits | DF | HF | DM | HD | HM | NP | NN | NB | NG | MG | OC | PC |
| DF | 1 | 0.9 | 0.86 | 0.61 | 0.75 | 0.91 | 0.86 | 0.61 | 0.81 | 0.78 | 0.59 | -0.19 |
| 0.02 | 0.03 | 0.06 | 0.04 | 0.02 | 0.03 | 0.06 | 0.04 | 0.04 | 0.07 | 0.1 | ||
| HF | 0.9 | 1 | 0.82 | 0.6 | 0.92 | 0.83 | 0.8 | 0.65 | 0.68 | 0.8 | 0.65 | -0.2 |
| 0.03 | 0.06 | 0.01 | 0.03 | 0.03 | 0.06 | 0.05 | 0.04 | 0.07 | 0.1 | |||
| DM | 0.86 | 0.82 | 1 | 0.59 | 0.72 | 0.79 | 0.86 | 0.51 | 0.71 | 0.69 | 0.51 | -0.1 |
| 0.07 | 0.05 | 0.05 | 0.03 | 0.08 | 0.05 | 0.06 | 0.09 | 0.11 | ||||
| HD | 0.62 | 0.61 | 0.59 | 1 | 0.62 | 0.63 | 0.58 | 0.22 | 0.5 | 0.73 | 0.14 | 0.06 |
| 0.06 | 0.06 | 0.08 | 0.1 | 0.07 | 0.05 | 0.11 | 0.1 | |||||
| HM | 0.75 | 0.92 | 0.73 | 0.63 | 1 | 0.73 | 0.79 | 0.56 | 0.62 | 0.73 | 0.53 | -0.19 |
| 0.04 | 0.03 | 0.07 | 0.05 | 0.04 | 0.08 | 0.1 | ||||||
| NP | 0.91 | 0.83 | 0.78 | 0.65 | 0.73 | 1 | 0.88 | 0.72 | 0.87 | 0.78 | 0.65 | -0.29 |
| 0.03 | 0.05 | 0.02 | 0.04 | 0.06 | 0.09 | |||||||
| NN | 0.86 | 0.80 | 0.86 | 0.59 | 0.79 | 0.88 | 1 | 0.64 | 0.85 | 0.7 | 0.64 | -0.33 |
| 0.06 | 0.03 | 0.06 | 0.07 | 0.09 | ||||||||
| NB | 0.62 | 0.66 | 0.51 | 0.24 | 0.58 | 0.72 | 0.65 | 1 | 0.77 | 0.56 | 0.76 | -0.57 |
| 0.04 | 0.06 | 0.04 | 0.07 | |||||||||
| NG | 0.81 | 0.68 | 0.72 | 0.52 | 0.62 | 0.87 | 0.85 | 0.77 | 1 | 0.77 | 0.66 | -0.54 |
| 0.04 | 0.06 | 0.07 | ||||||||||
| MG | 0.78 | 0.81 | 0.69 | 0.73 | 0.73 | 0.79 | 0.70 | 0.57 | 0.77 | 1 | 0.67 | -0.36 |
| 0.06 | 0.09 | |||||||||||
| OC | 0.60 | 0.66 | 0.51 | 0.16 | 0.54 | 0.65 | 0.64 | 0.75 | 0.67 | 0.67 | 1 | -0.6 |
| 0.06 | ||||||||||||
| PC | -0.19 | -0.2 | -0.09 | 0.05 | -0.19 | -0.28 | -0.33 | -0.56 | -0.53 | -0.36 | -0.60 | 1 |
* Significant at the 0.05 probability level;
** Significant at the 0.01 probability level according to Mantel’s test.
Eigenvalues, condition number (CN) and its standard deviations between parenthesis of the correlation matrices R, R1, R2, Rs, R1s and R2s between explanatory variables.
| Rank | Eigenvalues | |||||
|---|---|---|---|---|---|---|
| R | R1 | R2 | Rs | R1s | R2s | |
| 1 | 7.67 | 5.13 | 4.72 | 7.65(0.28) | 5.46(0.21) | 4.61(0.20) |
| 2 | 1.40 | 1.36 | 1.11 | 1.44(0.16) | 1.19(0.11) | 1.19(0.11) |
| 3 | 0.590 | 0.550 | 0.388 | 0.618(0.09) | 0.534(0.09) | 0.523(0.09) |
| 4 | 0.454 | 0.338 | 0.336 | 0.444(0.06) | 0.336(0.05) | 0.317(0.05) |
| 5 | 0.314 | 0.282 | 0.240 | 0.302(0.05) | 0.203(0.03) | 0.185(0.03) |
| 6 | 0.226 | 0.183 | 0.128 | 0.215(0.03) | 0.150(0.03) | 0.121(0.02) |
| 7 | 0.147 | 0.111 | 0.077 | 0.139(0.02) | 0.075(0.01) | 0.050(0.01) |
| 8 | 0.104 | 0.048 | 0.096(0.01) | 0.047(0.01) | ||
| 9 | 0.058 | 0.054(0.01) | ||||
| 10 | 0.037 | 0.033(0.01) | ||||
| 11 | 0.004 | 0.003(0.00) | ||||
| CN | 2081 | 105 | 86.5 | 2337(559.91) | 119(22.53) | 95.3(18.22) |
| Multicollinearity level | Severe | Moderate | Weak | Severe | Moderate | Weak |
R and Rs = original set of the correlation matrix between explanatory variables from original and simulated dataset, respectively; R1 and R1s = correlation matrix with trait culling to three explanatory variables from original and simulated dataset, respectively; R2 and R2s = correlation matrix with trait culling to four explanatory variables from original and simulated dataset, respectively.
Estimates of direct and indirect effects for: DF, HF, DM, HD, HM, NP, NN, NB, NG, MG and PC on OC evaluated in 15 soybean genotypes onto the matrix R.
| Explanatory variables | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Effect | DF | HF | DM | HD | HM | NP | NN | NB | NG | MG | PC |
| Direct on OC | 0.035 | 0.141 | 0.019 | -0.207 | 0.020 | 0.086 | 0.105 | 0.196 | 0.022 | 0.216 | -0.205 |
| Indirect via DF | - | 0.032 | 0.031 | 0.022 | 0.027 | 0.032 | 0.031 | 0.022 | 0.029 | 0.028 | -0.007 |
| Indirect via HF | 0.127 | - | 0.115 | 0.086 | 0.131 | 0.117 | 0.113 | 0.094 | 0.096 | 0.114 | -0.029 |
| Indirect via DM | 0.017 | 0.016 | - | 0.011 | 0.014 | 0.015 | 0.016 | 0.010 | 0.014 | 0.013 | -0.002 |
| Indirect via HD | -0.128 | -0.126 | -0.122 | - | -0.129 | -0.134 | -0.121 | -0.049 | -0.106 | -0.151 | -0.010 |
| Indirect via HM | 0.015 | 0.018 | 0.014 | 0.012 | - | 0.014 | 0.016 | 0.011 | 0.012 | 0.014 | -0.004 |
| Indirect via NP | 0.079 | 0.071 | 0.068 | 0.056 | 0.063 | - | 0.076 | 0.063 | 0.075 | 0.068 | -0.024 |
| Indirect via NN | 0.090 | 0.084 | 0.090 | 0.061 | 0.083 | 0.092 | - | 0.068 | 0.089 | 0.073 | -0.034 |
| Indirect via NB | 0.121 | 0.130 | 0.101 | 0.046 | 0.113 | 0.142 | 0.126 | - | 0.151 | 0.111 | -0.109 |
| Indirect via NG | 0.018 | 0.015 | 0.016 | 0.012 | 0.014 | 0.019 | 0.019 | 0.017 | - | 0.017 | -0.012 |
| Indirect via MG | 0.169 | 0.174 | 0.148 | 0.158 | 0.158 | 0.170 | 0.152 | 0.122 | 0.167 | - | -0.077 |
| Indirect via PC | 0.039 | 0.041 | 0.018 | -0.010 | 0.040 | 0.057 | 0.067 | 0.114 | 0.108 | 0.073 | - |
| Total | 0.60 | 0.66 | 0.51 | 0.16 | 0.54 | 0.65 | 0.64 | 0.76 | 0.67 | 0.67 | -0.60 |
| R² | 0.657 | ||||||||||
| K | 0.445 | ||||||||||
| ERV | 0.585 | ||||||||||
Total = correlation coefficient between the trait and OC; R² = Determination coefficient; K = constant used in the analysis and; ERV = effect of the residual variable.
Estimates of direct and indirect effects for: DM, HD, HM, NP, NN, NG, MG and PC on OC evaluated in 15 soybean genotypes onto the matrix R1.
| Explanatory variables | ||||||||
|---|---|---|---|---|---|---|---|---|
| Effect | DM | HD | HM | NP | NN | NG | MG | PC |
| Direct on OC | -0.126 | -0.758 | -0.110 | 0.427 | 0.632 | -0.556 | 0.967 | -0.223 |
| Indirect via DM | - | -0.075 | -0.092 | -0.099 | -0.109 | -0.091 | -0.087 | 0.011 |
| Indirect via HD | -0.449 | - | -0.474 | -0.491 | -0.445 | -0.391 | -0.556 | -0.037 |
| Indirect via HM | -0.080 | -0.069 | - | -0.081 | -0.087 | -0.069 | -0.081 | 0.021 |
| Indirect via NP | 0.333 | 0.276 | 0.312 | - | 0.374 | 0.371 | 0.335 | -0.119 |
| Indirect via NN | 0.543 | 0.371 | 0.500 | 0.554 | - | 0.536 | 0.443 | -0.207 |
| Indirect via NG | -0.398 | -0.286 | -0.347 | -0.483 | -0.472 | - | -0.429 | 0.294 |
| Indirect via MG | 0.663 | 0.709 | 0.709 | 0.760 | 0.679 | 0.747 | - | -0.344 |
| Indirect via PC | 0.020 | -0.011 | 0.043 | 0.062 | 0.073 | 0.118 | 0.079 | - |
| Total | 0.51 | 0.16 | 0.54 | 0.65 | 0.64 | 0.67 | 0.67 | -0.60 |
| R² | 0.856 | |||||||
| ERV | 0.379 | |||||||
Total = correlation coefficient between the trait and OC; R² = determination coefficient; and ERV = effect of the residual variable.
The traits DF, HF and NB were eliminated from the matrix R in originating the matrix R1.
Estimates of direct and indirect effects for: DM, HD, HM, NN, NG, MG, PC on OC evaluated in 15 soybean genotypes onto the matrix R2.
| Explanatory variables | |||||||
|---|---|---|---|---|---|---|---|
| Effect | DM | HD | HM | NN | NG | MG | PC |
| Direct on OC | -0.158 | -0.744 | -0.095 | 0.804 | -0.347 | 1.05 | -0.148 |
| Indirect via DM | - | -0.094 | -0.115 | -0.136 | -0.113 | -0.018 | 0.014 |
| Indirect via HD | -0.440 | - | -0.466 | -0.437 | -0.384 | -0.546 | -0.036 |
| Indirect via HM | -0.069 | -0.059 | - | -0.075 | -0.059 | -0.070 | 0.018 |
| Indirect via NN | 0.690 | 0.472 | 0.636 | - | 0.682 | 0.564 | -0.260 |
| Indirect via NG | -0.249 | -0.179 | -0.217 | -0.295 | - | -0.268 | 0.184 |
| Indirect via MG | 0.718 | 0.768 | 0.768 | 0.735 | 0.809 | - | -0.372 |
| Indirect via PC | 0.013 | -0.007 | 0.029 | 0.048 | 0.079 | 0.054 | - |
| Total | 0.51 | 0.16 | 0.54 | 0.64 | 0.67 | 0.67 | -0.60 |
| R² | 0.832 | ||||||
| ERV | 0.410 | ||||||
Total = correlation coefficient between the trait and OC; R² = determination coefficient; and ERV = effect of the residual variable.
The traits DF, HF, NN and NB were eliminated from the matrix R in originating the matrix R2.
Mean direct effects and standard deviation between parentheses for explanatory traits, k-value (K), effect of residual variable (ERV) and R-squared (R²) of the path analysis for the simulated populations.
| DF | HF | DM | HD | HM | NP | NN | NB | NG | MG | PC | K | ERV | R² | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Rs’s | 0.011 | 0.169 | 0.016 | -0.265 | -0.013 | 0.099 | 0.140 | 0.211 | -0.024 | 0.283 | -0.203 | 0.334 | 0.544 | 0.701 |
| (0.04) | (0.05) | (0.04) | (0.09) | (0.07) | (0.04) | (0.07) | (0.04) | (0.08) | (0.1) | (0.04) | (0.14) | (0.05) | (0.04) | |
| R1s’s | - | - | -0.124 | -0.772 | -0.106 | 0.416 | 0.631 | - | -0.552 | 0.969 | -0.214 | - | 0.363 | 0.866 |
| - | - | (0.10) | (0.10) | (0.07) | (0.11) | (0.16) | - | (0.12) | (0.11) | (0.07) | - | (0.04) | (0.03) | |
| R2s’s | - | - | -0.146 | -0.762 | -0.091 | - | 0.803 | - | -0.357 | 1.048 | -0.143 | - | 0.394 | 0.844 |
| - | - | (0.10) | (0.10) | (0.08) | - | (0.16) | - | (0.12) | (0.11) | (0.07) | - | (0.04) | (0.03) |
Rs’s = matrices from simulated dataset composed of DF, HF, DM, HD, HM, NP, NN, NB, NG, MG and PC; R1s’s = matrices from simulated dataset composed of DM, HD, HM, NP, NN, NG, MG and PC; R2s’s = matrices from simulated dataset composed of DM, HD, HM, NN, NG, MG and PC.