| Literature DB >> 35614333 |
Geovane da Silva André1, Paulo Carteri Coradi2,3,4, Larissa Pereira Ribeiro Teodoro1, Paulo Eduardo Teodoro1.
Abstract
The monitoring and evaluating the physical and physiological quality of seeds throughout storage requires technical and financial resources and is subject to sampling and laboratory errors. Therefore, machine learning (ML) techniques could help optimize the processes and obtain accurate results for decision-making in the seed storage process. This study aimed to analyze the performance of ML algorithms from variables monitored during seed conditioning (temperature and packaging) and storage time to predict the physical and physiological quality of stored soybean seeds. Data analysis was performed using the Artificial Neural Networks, decision tree algorithms REPTree and M5P, Random Forest, and Linear Regression. In predicting seed quality, the combination of the input variables temperature and storage time for REPTree and Random Forest algorithms outperformed the linear regression, providing higher accuracy indices. Among the most important results, it was observed for apparent specific mass that T + P + ST, T + ST, P + ST, and ST had the highest r means and the lowest MAE means, however, Person's r coefficient for these inputs was 0.63 and the MAE between 9.59 to 10.47. The germination results for inputs T + P + ST and T + ST had the best results (r = 0.65 and r = 0.67, respectively) in the ANN, REPTree, M5P and RF models. Using computational intelligence algorithms is an excellent alternative to predict the quality of soybean seeds from the information of easy-to-measure variables.Entities:
Mesh:
Year: 2022 PMID: 35614333 PMCID: PMC9132987 DOI: 10.1038/s41598-022-12863-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Experimental scheme.
Experimental design and grouping of storage environments.
| Packaging | Storage temperature (°C) | Storage time (months) |
|---|---|---|
| With coating | 25 | 0 |
| With coating | 25 | 3 |
| With coating | 25 | 6 |
| With coating | 25 | 9 |
| With coating | 25 | 12 |
| With coating | 15 | 0 |
| With coating | 15 | 3 |
| With coating | 15 | 6 |
| With coating | 15 | 9 |
| With coating | 15 | 12 |
| With coating | 10 | 0 |
| With coating | 10 | 3 |
| With coating | 10 | 6 |
| With coating | 10 | 9 |
| With coating | 10 | 12 |
| Uncoating | 25 | 0 |
| Uncoating | 25 | 3 |
| Uncoating | 25 | 6 |
| Uncoating | 25 | 9 |
| Uncoating | 25 | 12 |
| Uncoating | 15 | 0 |
| Uncoating | 15 | 3 |
| Uncoating | 15 | 6 |
| Uncoating | 15 | 9 |
| Uncoating | 15 | 12 |
| Uncoating | 10 | 0 |
| Uncoating | 10 | 3 |
| Uncoating | 10 | 6 |
| Uncoating | 10 | 9 |
| Uncoating | 10 | 12 |
The P-value from the analysis of variance for Pearson's correlation coefficient (r) between observed and estimated values of moisture content (MC), apparent specific mass (ASM), electrical conductivity (EC), germination (G), and vigor (V) of soybean seeds by different machine learning models and inputs.
| Sources of variation | MC | ASM | EC | G | V | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| r | MAE | r | MAE | R | MAE | r | MAE | r | MAE | |
| Models (M) | < 0.00 | < 0.00 | 0.99 | 0.00 | 0.03 | < 0.00 | < 0.00 | < 0.00 | 0.02 | 0.00 |
| Inputs (I) | < 0.00 | < 0.00 | 0.00 | 0.00 | 0.00 | < 0.00 | < 0.00 | < 0.00 | 0.00 | 0.00 |
| MxI | < 0.00 | < 0.00 | 1.00 | 1.00 | 0.43 | < 0.00 | < 0.00 | < 0.00 | 0.40 | 0.64 |
Unfolding the significant interaction between model x input for Pearson's correlation coefficient (r) between the observed and estimated values of moisture content in soybean seeds by different machine learning models and inputs.
| Models | T | P + T | ST + P + T | ST + T | P | ST + P | ST |
|---|---|---|---|---|---|---|---|
| ANN | 0.36 aE | 0.43 aD | 0.94 aA | 0.86 aB | 0.10 aF | 0.63 aC | 0.63 aC |
| REPTree | 0.36 aE | 0.43 aD | 0.95 aA | 0.87 aB | 0.10 aF | 0.63 aC | 0.63 aC |
| LR | 0.36 aC | 0.37 aC | 0.72 bA | 0.72 bA | 0.10 aD | 0.63 aB | 0.63 aB |
| M5P | 0.36 aE | 0.43 aD | 0.94 aA | 0.87 aB | 0.10 aF | 0.63 aC | 0.63 aC |
| RF | 0.36 aE | 0.43 aD | 0.95 aA | 0.87 aB | 0.10 aF | 0.63 aC | 0.63 aC |
Means followed by equal lowercase letters in the same column and equal uppercase letters in the same row do not differ by the Scott-Knott test at 5% probability.
T temperature, P packaging, ST storage time.
Unfolding the significant interaction between model x input for mean absolute error (MAE) between the observed and estimated values of moisture content in soybean seeds by different machine learning models and inputs.
| Models | T | P + T | ST + P + T | ST + T | P | ST + P | ST |
|---|---|---|---|---|---|---|---|
| ANN | 1.26 aA | 1.22 aA | 0.41 bD | 0.67 aC | 1.33 aA | 0.92 aB | 0.92 aB |
| REPTree | 1.07 bA | 1.09 bA | 0.30 bD | 0.53 bC | 1.11 bA | 0.80 aB | 0.81 aB |
| LR | 1.07 bA | 1.09 bA | 0.73 aB | 0.73 aB | 1.11 bA | 0.81 aB | 0.81 aB |
| M5P | 1.07 bA | 1.09 bA | 0.32 bC | 0.53 bD | 1.11 bA | 0.81 aB | 0.81 aB |
| RF | 1.07 bA | 1.09 bA | 0.30 bC | 0.53 bD | 1.11 bA | 0.81 aB | 0.81 aB |
Means followed by equal lowercase letters in the same column and equal uppercase letters in the same row do not differ by the Scott-Knott test at 5% probability.
T temperature, P packaging, ST storage time.
Figure 2Mean values and scatter plot for the Pearson's correlation coefficient (r) and mean absolute error (MAE) between observed and estimated values of moisture content in soybean seeds by different machine learning models and inputs.
Clustering of means for the Pearson's correlation coefficient (r) and mean absolute error (MAE) between observed and estimated values of apparent specific mass in soybean seeds by different learning models.
| Models | r | MAE |
|---|---|---|
| ANN | 0.35 a | 17.12 a |
| REPTree | 0.35 a | 10.45 b |
| LR | 0.35 a | 10.45 b |
| M5P | 0.36 a | 10.45 b |
| RF | 0.35 a | 10.44 b |
Means followed by the same letters in the same column do not differ by the Scott-Knott test at 5% probability.
Clustering of means for the Pearson's correlation coefficient (r) and mean absolute error (MAE) between observed and estimated values of apparent specific mass in soybean seeds by different inputs.
| Input | r | MAE |
|---|---|---|
| T | 0.00 b | 13.86 a |
| P + T | − 0.02 b | 13.88 a |
| ST + P + T | 0.63 a | 10.47 b |
| ST + T | 0.63 a | 10.31b |
| P | − 0.02 b | 13.88 a |
| ST + P | 0.63 a | 10.47 b |
| ST | 0.63 a | 9.59 b |
Means followed by the same letters in the same column do not differ by the Scott-Knott test at 5% probability.
T temperature, P packaging, ST storage time.
Figure 3Mean values and scatter plot for the variables Pearson's correlation coefficient (r) and mean absolute error (MAE) between observed and estimated values of apparent specific mass in soybean seeds by different machine learning models and inputs.
Clustering of means for the Pearson's correlation coefficient (r) and mean absolute error (MAE) between observed and estimated values of electrical conductivity in soybean seeds by different learning models.
| Models | r |
|---|---|
| ANN | 0.41 a |
| REPTree | 0.42 a |
| LR | 0.38 b |
| M5P | 0.42 a |
| RF | 0.42 a |
Means followed by the same letters in the same column do not differ by the Scott-Knott test at 5% probability.
Clustering of means for the Pearson's correlation coefficient (r) between observed and estimated values of electrical conductivity in soybean seeds by different inputs.
| Input | r |
|---|---|
| T | 0.32 c |
| P + T | 0.34 c |
| ST + P + T | 0.65 a |
| ST + T | 0.63 a |
| P | 0.03 d |
| ST + P | 0.45 b |
| ST | 0.44 b |
Means followed by the same letters in the same column do not differ by the Scott-Knott test at 5% probability.
T temperature, P packaging, ST storage time.
Unfolding the significant interaction between model x input for mean absolute error (MAE) between the observed and estimated values of electrical conductivity in soybean seeds by different machine learning models and inputs.
| Models | T | P + T | ST + P + T | ST + T | P | ST + P | ST |
|---|---|---|---|---|---|---|---|
| ANN | 29.91 aA | 30.43 aA | 25.13 aB | 25.24 aB | 31.12 aA | 30.23 aA | 30.35 aA |
| REPTree | 28.25 aA | 28.33 aA | 21.67 bB | 21.94 bB | 29.93 aA | 26.61 aA | 26.80 aA |
| LR | 28.25 aA | 28.37 aA | 25.47 aB | 25.39 aB | 29.95 aA | 26.81 aB | 26.77 aB |
| M5P | 28.25 aA | 1.03 bC | 21.60 bB | 22.01 bB | 29.95 aA | 26.78 aA | 26.77 aA |
| RF | 28.26 aA | 28.34 aA | 21.67 bB | 21.95 bB | 29.94 aA | 26.61 aA | 26.81 aA |
Means followed by equal lowercase letters in the same column and equal uppercase letters in the same row do not differ by the Scott-Knott test at 5% probability.
T temperature, P packaging, ST storage time.
Figure 4Mean values and scatter plot for the variables Pearson's correlation coefficient (r) and mean absolute error (MAE) between observed and estimated values of electrical conductivity in soybean seeds by different machine learning models and inputs.
Unfolding the significant interaction between model x input for Pearson's correlation coefficient (r) between the observed and estimated values of germination in soybean seeds by different machine learning models and inputs.
| Models | T | P + T | ST + P + T | ST + T | P | ST + P | ST |
|---|---|---|---|---|---|---|---|
| ANN | 0.33 aB | 0.33 Ab | 0.67 aA | 0.65 aA | 0.03 aC | 0.36 bB | 0.35 aB |
| REPTree | 0.33 aB | 0.33 aB | 0.67 aA | 0.65 aA | 0.03 aC | 0.65 aA | 0.35 aB |
| LR | 0.33 aB | 0.33 aB | 0.48 bA | 0.48 bA | -0.02 aC | 0.35 bB | 0.35 aB |
| M5P | 0.33 aB | 0.32 aB | 0.66 aA | 0.65 aA | -0.02 aC | 0.36 bB | 0.35 aB |
| RF | 0.33 aB | 0.33 aB | 0.67 aA | 0.65 aA | 0.03 aC | 0.36 bB | 0.35 aB |
Means followed by equal lowercase letters in the same column and equal uppercase letters in the same row do not differ by the Scott-Knott test at 5% probability.
T temperature, P packaging, ST storage time.
Unfolding the significant interaction between model x input for mean absolute error (MAE) between the observed and estimated values of germination in soybean seeds by different machine learning models and inputs.
| Models | T | P + T | ST + P + T | ST + T | P | ST + P | ST |
|---|---|---|---|---|---|---|---|
| ANN | 13.33 Aa | 13.65 bA | 9.77 aC | 11.61B | 13.70 aA | 14.71 aA | 12.16 aB |
| REPTree | 11.75 aA | 11.76 bA | 8.95 aB | 9.10 bB | 12.67 aA | 9.10 cB | 11.89 aA |
| LR | 11.77 aA | 11.79 bA | 11.26 aA | 11.25 aA | 12.67 aA | 11.89 bA | 11.87 aA |
| M5P | 11.77 aA | 11.84 bA | 9.05 aB | 9.21 bB | 12.67 aA | 11.89 bA | 11.87 aA |
| RF | 11.76 aB | 17.01 aA | 8.95 aC | 9.10 bC | 12.68aB | 11.92 bB | 11.89 aB |
Means followed by equal lowercase letters in the same column and equal uppercase letters in the same row do not differ by the Scott-Knott test at 5% probability.
T temperature, P packaging, ST storage time.
Figure 5Mean values and scatter plot for the variables Pearson's correlation coefficient (r) and mean absolute error (MAE) between observed and estimated values of germination in soybean seeds by different machine learning models and inputs.
Clustering of means for the Pearson's correlation coefficient (r) and mean absolute error (MAE) between observed and estimated values of vigor in soybean seeds by different learning models.
| Models | r | MAE |
|---|---|---|
| ANN | 0.44 a | 17.33 a |
| REPTree | 0.44 a | 15.46 c |
| LR | 0.39 b | 16.20 b |
| M5P | 0.44 a | 15.51 c |
| RF | 0.43 a | 15.46 c |
Means followed by the same letters in the same column do not differ by the Scott-Knott test at 5% probability.
Clustering of means for the Pearson's correlation coefficient (r) and mean absolute error (MAE) between observed and estimated values of vigor in soybean seeds by different inputs.
| Input | r | MAE |
|---|---|---|
| T | 0.34 c | 17.23 b |
| P + T | 0.33 c | 17.27 b |
| ST + P + T | 0.68 a | 13.09 d |
| ST + T | 0.68 a | 13.24 d |
| P | 0.01 d | 18.56 a |
| ST + P | 0.47 a | 16.29 c |
| ST | 0.47 a | 26.28 c |
Means followed by the same letters in the same column do not differ by the Scott-Knott test at 5% probability.
T temperature, P packaging, ST storage time.
Figure 6Mean values and scatter plot for the variables Pearson's correlation coefficient (r) and mean absolute error (MAE) between observed and estimated values of vigor in soybean seeds by different machine learning models and inputs.