| Literature DB >> 35983145 |
Qiusi Zhang1,2, Bo Li3, Yong Zhang3, Shufeng Wang1.
Abstract
With the continuous growth of the global population, insufficient food production has become an urgent problem to be solved in most countries. At present, using artificial intelligence technology to improve suitability between land and crop varieties to increase crop yields has become a consensus among agricultural researchers. However, there are still many problems in existing works, such as limited crop phenotypic data and the poor performance of artificial intelligence models. In this regard, we take maize as an example to collect a large amount of environmental climate and crop phenotypic traits data at multiple experimental sites and construct an extensive dataset. Then, we introduce a graph neural network model to learn crop suitability evaluation and finally achieve a good evaluation effect. The evaluation results of the model can not only provide a reference for expert evaluation but also judge the suitability of the variety to other test trial sites according to the data of the current one, so as to guide future breeding experiments.Entities:
Mesh:
Year: 2022 PMID: 35983145 PMCID: PMC9381238 DOI: 10.1155/2022/5614974
Source DB: PubMed Journal: Comput Intell Neurosci
Figure 1Visualization of numerical distribution of relative change of yield (up) and plant height (down); the left column is the original data, and the right column is after normalization.
Accuracy comparison of the following networks with different numbers of training samples.
| Number of training samples | 50 | 100 | 400 | 700 | 1000 | 2000 |
|---|---|---|---|---|---|---|
| Raw data | 0.651 | 0.660 | 0.666 | 0.674 | 0.673 | 0.675 |
| Standardized data |
|
|
|
|
|
|
| Normalized data | 0.551 | 0.556 | 0.562 | 0.555 | 0.564 | 0.576 |
The performance comparison of traditional machine learning methods and neural network under two data initialization situations. Important data are mar—d in bold.
| KNN | LR | SVM | NB | RF | DT | MLP | RBFNN | GAT | GCN | |
|---|---|---|---|---|---|---|---|---|---|---|
| Accuracy | 0.647 | 0.676 | 0.671 | 0.567 | 0.670 | 0.553 | 0.684 | 0.681 | 0.731 |
|
| Precision score | 0.628 | 0.651 | 0.644 | 0.562 | 0.662 | 0.563 | 0.670 | 0.657 |
|
|
| Recall score | 0.870 | 0.880 | 0.892 | 0.946 | 0.812 | 0.583 | 0.834 | 0.872 | 0.708 |
|
|
| 0.729 | 0.748 | 0.748 | 0.705 | 0.730 | 0.545 | 0.749 | 0.750 | 0.725 |
|
| AUC score | 0.623 | 0.654 | 0.647 | 0.526 | 0.655 | 0.483 | 0.673 | 0.661 | 0.732 |
|
Data correlation between 39 types of data and proposed label. The trait data are replaced by abbreviations. Climate data are prefixed with A and V, which represent the mean and variance, respectively.
| LB-0.041 | LR-0.059 | IR-0.052 | GSD-0.015 | PH 0.014 | EH 0.016 | ESR-0.062 | DP-0.017 |
|---|---|---|---|---|---|---|---|
| ER 0.013 | HGW 0.047 | EL 0.038 | BTL-0.011 | FEF 0.033 | AY 0.045 |
| Label 1.000 |
| AMaxT 0.046 | VMaxA 0.044 | AAT-0.053 | VAT-0.041 |
| VMinT-0.046 | ATD-0.054 | VAT -0.058 |
| AGP-0.023 | VGP -0.018 |
|
| AP 0.025 | VP 0.010 | AMWS-0.046 | VMVS -0.062 |
| AAWS-0.012 | VAWS-0.012 | ADWA 0.048 | VWDA 0.024 |
| VST-0.042 | AWR-0.048 | VWL -0.027 |
Figure 2Overall flowchart of the project. The whole project flowchart can be divided into 3 parts: data analysis, correlation analysis, and construction of graph structure. The data analysis part shows the source and numerical distribution of the data; the correlation analysis part gives the relationship between the suitability evaluation indicators and the climate, environment, and crop phenotype data; the graph construction part uses each piece of data as a node to construct a graph and input it into GNN.
Figure 3Graph neural network framework. The network consists of an input layer, 4 hidden layers, and an output layer, and the ReLU activation function is used in the middle to increase the nonlinear fitting ability.
Comparison of the original performance of the model and the performance after removing the relative change of yield (RCY) index.
| KNN | LR | SVM | NB | RF | DT | MLP | RBFNN | GAT | GCN | |
|---|---|---|---|---|---|---|---|---|---|---|
| All data | 0.647 | 0.676 | 0.671 | 0.567 | 0.658 | 0.553 | 0.684 | 0.681 | 0.731 |
|
| No RCY | 0.588 | 0.571 | 0.580 | 0.552 | 0.580 | 0.551 | 0.615 | 0.581 | 0.698 |
|