| Literature DB >> 34178002 |
Aikaterini Kasimati1, Borja Espejo-Garcia1, Eleanna Vali1, Ioannis Malounas1, Spyros Fountas1.
Abstract
The most common method for determining wine grape quality characteristics is to perform sample-based laboratory analysis, which can be time-consuming and expensive. In this article, we investigate an alternative approach to predict wine grape quality characteristics by combining machine learning techniques and normalized difference vegetation index (NDVI) data collected at different growth stages with non-destructive methods, such as proximal and remote sensing, that are currently used in precision viticulture (PV). The study involved several sets of high-resolution multispectral data derived from four sources, including two vehicle-mounted crop reflectance sensors, unmanned aerial vehicle (UAV)-acquired data, and Sentinel-2 (S2) archived imagery to estimate grapevine canopy properties at different growth stages. Several data pre-processing techniques were employed, including data quality assessment, data interpolation onto a 100-cell grid (10 × 20 m), and data normalization. By calculating Pearson's correlation matrix between all variables, initial descriptive statistical analysis was carried out to investigate the relationships between NDVI data from all proximal and remote sensors and the grape quality characteristics in all growth stages. The transformed dataset was then ready and applied to statistical and machine learning algorithms, firstly trained on the data distribution available and then validated and tested, using linear and nonlinear regression models, including ordinary least square (OLS), Theil-Sen, and the Huber regression models and Ensemble Methods based on Decision Trees. Proximal sensors performed better in wine grapes quality parameters prediction in the early season, while remote sensors during later growth stages. The strongest correlations with the sugar content were observed for NDVI data collected with the UAV, Spectrosense+GPS (SS), and the CropCircle (CC), during Berries pea-sized and the Veraison stage, mid-late season with full canopy growth, for both years. UAV and SS data proved to be more accurate in predicting the sugars out of all wine grape quality characteristics, especially during a mid-late season with full canopy growth, in Berries pea-sized and the Veraison growth stages. The best-fitted regressions presented a maximum coefficient of determination (R 2) of 0.61.Entities:
Keywords: correlation; ensemble methods; linear regression; normalized difference vegetation index; precision viticulture; quality prediction; remote sensing; wine grape quality
Year: 2021 PMID: 34178002 PMCID: PMC8226266 DOI: 10.3389/fpls.2021.683078
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Figure 1Two vehicle-mounted crop reflectance sensors, UAV-acquired data, and Sentinel-2 archived imagery were used to estimate grapevine canopy properties. UAV, unmanned aerial vehicle.
Grapevine seasonal EL growth stages of proximal and remote sensing data acquisition.
| Stage | EL No | Description | Date range | Data acquisition date |
|---|---|---|---|---|
| Flowering | 23 | 16–20 leaves separated; 50% caps off | June 1–June 20 | CC; SS; UAV: 040619, 050620 |
| Setting | 27 | Young berries enlarging, bunch at right angles to stem | June 20–July 20 | CC; SS; UAV: 040719, 140720 |
| Berries pea-sized | 31 | About 7 mm in diameter | July 20–Aug 15 | CC; SS; UAV: 010819, 110820 |
| Veraison | 35 | Berries begin to color and enlarge | Aug 15–Sept 10 | CC; SS; UAV: 280819, 260820 |
CC, CropCircle; SS, Spectrosense+GPS; UAV, unmanned aerial vehicle; and S2, Sentinel-2.
Figure 2Satellite images of the wine grapes commercial vineyard and the 100-cell grid developed parallel to the trellis lines (Google Earth Pro, 2021).
Hyperparameters evaluated for optimizing the ensemble learning models.
| Hyperparameter | Description | Values |
|---|---|---|
| Number of trees | The number of trees used in the ensemble. High values could reduce overfitting. | 10, 50, 100, 200 |
| Max. depth | The maximum depth of the trees. Too high could lead to overfitting. | No maximum, 1, 5, 7 |
| Split criteria | The function to measure the quality of a node split. The calculation of the Gini Index is computationally faster, but it could lead to more poor results. | Information gain, Gini impurity |
Figure 3Workflow for investigating a selection of methods for predicting wine grape quality characteristics using normalized difference vegetation index (NDVI) data from proximal and remote sensing.
Selected best performed Pearson’s correlation coefficients comparisons between NDVI data from all four proximal and remote sensors and total soluble solids (i) for a given sensor (rows 1 and 2) and (ii) for given growth stages of the season (rows 3 and 4; CC, CropCircle; SS, Spectrosense+GPS; UAV; and S2, Sentinel-2).
| Per sensor | CropCircle | Spectrosense+GPS | UAV | Sentinel-2 |
|---|---|---|---|---|
| 2019 | 0.69 (Veraison) | 0.74 (Veraison) | 0.63 (Veraison) | 0.57 (Berries pea-sized) |
| 2020 | 0.54 (Setting) | 0.70 (Setting) | 0.79 (Veraison) | 0.33 (Veraison) |
| Per growth stage | Flowering | Setting | Berries pea-sized | Veraison |
| 2019 | 0.62 (UAV) | 0.58 (UAV) | 0.69 (SS) | 0.74 (SS) |
| 2020 | 0.76 (UAV) | 0.70 (SS) | 0.77 (UAV) | 0.79 (UAV) |
Figure 4Pearson’s correlation coefficients evolution throughout the growing seasons 2019 and 2020 (legend as for Table 3).
Selected best performed linear regression models performed using the highly correlated NDVI data from all four proximal and remote sensors to evaluate their performance in assessing the wine grapes quality characteristics (legend as for Table 3).
| Sensor_growth stage | Method | RMSE | ||
|---|---|---|---|---|
| 2019 | SS_Veraison | OLS | 0.51 ± 0.09 | 1.45 ± 0.19 |
| CC_Veraison | OLS | 0.42 ± 0.10 | 1.67 ± 0.35 | |
| SS_Berries pea-sized | Huber | 0.41 ± 0.11 | 1.71 ± 0.24 | |
| UAV_Veraison | Theil-Sen | 0.38 ± 0.10 | 1.95 ± 0.55 | |
| 2020 | UAV_Veraison | Theil-Sen | 0.61 ± 0.03 | 1.37 ± 0.19 |
| UAV_Berries pea-sized | Huber | 0.57 ± 0.04 | 1.55 ± 0.32 | |
| UAV_Flowering | Theil-Sen | 0.56 ± 0.06 | 1.75 ± 0.19 | |
| SS_Setting | OLS | 0.44 ± 0.07 | 1.73 ± 0.23 | |
| UAV_Setting | OLS | 0.44 ± 0.04 | 2.09 ± 0.45 |
Selected best performed nonlinear regression models performed using the highly correlated NDVI data from all four proximal and remote sensors to evaluate their performance in assessing the wine grapes quality characteristics (legend as for Table 3).
| Sensor_growth stage | Method | RMSE | ||
|---|---|---|---|---|
| 2019 | SS_Veraison | Extra Trees | 0.46 ± 0.03 | 1.64 ± 0.14 |
| CC_Veraison | Random Forest | 0.43 ± 0.5 | 1.68 ± 0.32 | |
| SS_Berries pea-sized | Random Forest | 0.43 ± 0.06 | 1.82 ± 0.29 | |
| UAV_Veraison | Extra Trees | 0.42 ± 0.05 | 2.05 ± 0.51 | |
| UAV_Flowering | Random Forest | 0.39 ± 0.07 | 2.11 ± 0.33 | |
| 2020 | UAV_Veraison | AdaBoost | 0.59 ± 0.05 | 1.41 ± 0.25 |
| UAV_Berries pea-sized | Extra Trees | 0.53 ± 0.03 | 1.65 ± 0.22 | |
| UAV_Flowering | Extra Trees | 0.56 ± 0.05 | 1.72 ± 0.29 | |
| SS_Setting | Extra Trees | 0.43 ± 0.02 | 1.83 ± 0.33 | |
| UAV_Setting | Extra Trees | 0.41 ± 0.06 | 1.95 ± 0.35 |