| Literature DB >> 35893629 |
Dania Batool1, Muhammad Shahbaz1, Hafiz Shahzad Asif2, Kamran Shaukat3,4, Talha Mahboob Alam5, Ibrahim A Hameed6, Zeeshan Ramzan2, Abdul Waheed7, Hanan Aljuaid8, Suhuai Luo3.
Abstract
Tea (Camellia sinensis L.) is one of the most highly consumed beverages globally after water. Several countries import large quantities of tea from other countries to meet domestic needs. Therefore, accurate and timely prediction of tea yield is critical. The previous studies used statistical, deep learning, and machine learning techniques for tea yield prediction, but crop simulation models have not yet been used. However, the calibration of a simulation model for tea yield prediction and the comparison of these approaches is needed regarding the different data types. This research study aims to provide a comparative study of the methods for tea yield prediction using the Food and Agriculture Organization (FAO) of the United Nations AquaCrop simulation model and machine learning techniques. We employed weather, soil, crop, and agro-management data from 2016 to 2019 acquired from tea fields of the National Tea and High-Value Crop Research Institute (NTHRI), Pakistan, to calibrate the AquaCrop simulation model and to train regression algorithms. We achieved a mean absolute error (MAE) of 0.45 t/ha, a mean squared error (MSE) of 0.23 t/ha, and a root mean square error (RMSE) of 0.48 t/ha in the calibration of the AquaCrop model and, out of the ten regression models, we achieved the lowest MAE of 0.093 t/ha, MSE of 0.015 t/ha, and RMSE of 0.120 t/ha using 10-fold cross-validation and MAE of 0.123 t/ha, MSE of 0.024 t/ha, and RMSE of 0.154 t/ha using the XGBoost regressor with train test split. We concluded that the machine learning regression algorithm performed better in yield prediction using fewer data than the simulation model. This study provides a technique to improve tea yield prediction by combining different data sources using a crop simulation model and machine learning algorithms.Entities:
Keywords: AquaCrop; crop simulation models; crop yield prediction; machine learning; tea yield
Year: 2022 PMID: 35893629 PMCID: PMC9332224 DOI: 10.3390/plants11151925
Source DB: PubMed Journal: Plants (Basel) ISSN: 2223-7747
Figure 1Crop yield estimation methods.
Summary of related work for different crops.
| Paper | Data | Techniques | Crop | Performance |
|---|---|---|---|---|
| [ | Weather and crop management | Agroecological Zone simulation model & DM (RF, SVM & GBM) | Sugarcane | |
| [ | Climate and satellite | SVM, RF, ANN | Wheat | R2 = 0.75 |
| [ | Environment and management | A hybrid approach using CNN and RNN | Corn and soybean | |
| [ | Agro-ecological data | AquaCrop | Amaranthus | |
| [ | Soil, weather, crop, and agro-management | AquaCrop | Coffee | R2 ≈ 0.71 |
| [ | Climate, crop, and soil data | Developed a simulation model | Tea | R2 > 0.58 |
| [ | Weather data | Multiple linear models | Tea | Accuracy = 70% |
Figure 2(a) Satellite view of Shinkiari, (b) satellite view of NTHRI.
Weather input data for calibration.
| Year | Avg Min | Avg Max | Avg Humidity (%) | Avg Rainfall (mm) | Solar Radiation (MJ/m2) |
|---|---|---|---|---|---|
| 2016 | 17.4 | 32.4 | 28.5 | 92.03 | 19.06 |
| 2017 | 16.7 | 32.8 | 33.8 | 167 | 19.30 |
| 2018 | 17 | 32.5 | 32.64 | 149.7 | 19.22 |
| 2019 | 16.8 | 31.2 | 55.2 | 112 | 19.45 |
Soil input data for calibration.
| Soil Characteristic | Value |
|---|---|
| Soil type | Clay loam |
| Soil depth | 2.0 m |
| Soil water content at soil saturation (θSAT) | 50 vol % |
| Soil water content at field capacity (θFC) | 39 vol % |
| Soil water content at permanent wilting point (θPWP) | 23 vol % |
| Total available soil water (TAW) | 160 mm/m |
| Saturated hydraulic conductivity | 500 mm/day |
| tau (τ) drainage coefficient | 0.76 |
Crop input data for calibration.
| Symbol | Description | Value |
|---|---|---|
| Planting Density (plants/ha.) | 12,000 | |
| CCin | Initial canopy cover after pruning | 20% |
| CCx | Maximum canopy cover | 95% |
| CGC | Canopy growth coefficient | |
| Zx | Maximum effective rooting depth (m) | 2 m |
| Zn | Minimum effective rooting depth (m) | 2 m |
| WP* | ET water productivity | 0.18, 0.18, 0.20, 0.17 |
|
| Reference harvest index | 14% |
| Time between pruning | 5 years |
Figure 3Methodology for the comparative study of tea yield prediction.
Figure 4Observed vs. simulated yield for the years 2016–2019.
Figure 5Simulation errors for the years 2016–2019.
Figure 6Errors in training and testing of regression algorithms.
Figure 7Scatter plots showing actual and predicted yields of all regression algorithms.
Figure 8Errors in 10-fold cross-validation of regression algorithms.
An overview of the test data.
| Min. Air Temperature | Max. Air Temperature | Average Humidity | Rainfall | Tea Type | Actual Yield (Kg) | Predicted Yield (Kg) |
|---|---|---|---|---|---|---|
| 15.9 | 29.3 | 54.5 | 39 | Black | 275 | 255.968,9 |
| 12.6 | 34.1 | 47 | 39 | Black | 316 | 323.608,1 |
| 12.6 | 34.1 | 47 | 39.00 | Green | 186 | 186.335,9 |
| 12.7 | 33.7 | 42 | 39.00 | Green | 203 | 184.490,1 |
| 12.6 | 33.9 | 39 | 39.00 | Green | 165 | 173.613,7 |
| 16.6 | 36 | 64 | 35.50 | Green | 102 | 99.674,55 |
| 20 | 26 | 78.5 | 203.30 | Green | 120 | 133.068,1 |
| 19.9 | 33.3 | 62.5 | 170.00 | Green | 143 | 157.643,5 |
| 17.2 | 33.6 | 52.5 | 170.00 | Green | 164 | 152.560,5 |
Figure 9Difference between simulation errors and ML errors.