| Literature DB >> 34069181 |
Martín Solís1, Vanessa Rojas-Herrera2.
Abstract
The prediction of leaf wetness duration (LWD) is an issue of interest for disease prevention in coffee plantations, forests, and other crops. This study analyzed different LWD prediction approaches using machine learning and meteorological and temporal variables as the models' input. The information was collected through meteorological stations placed in coffee plantations in six different regions of Costa Rica, and the leaf wetness duration was measured by sensors installed in the same regions. The best prediction models had a mean absolute error of around 60 min per day. Our results demonstrate that for LWD modeling, it is not convenient to aggregate records at a daily level. The model performance was better when the records were collected at intervals of 15 min instead of 30 min.Entities:
Keywords: Leaf wetness duration; coffee leaf; machine learning
Year: 2021 PMID: 34069181 PMCID: PMC8161455 DOI: 10.3390/biomimetics6020029
Source DB: PubMed Journal: Biomimetics (Basel) ISSN: 2313-7673
Figure 1Sensor used by ICAFE and sensor from another brand. Note: Images taken from [8,9].
Figure 2Costa Rica. Regional locations of the meteorological stations and sensors.
Mean and standard deviation of the variables.
| Variables | Unit | 1. Barva | 2. San Vito | 3. San Lor.. | 4. Naranjo | 5. San Ped.. | 6. Páramo | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| m | std | m | std | m | std | m | std | m | std | m | std | ||
| Temp. out_station | °C | 21 | 4 | 23 | 3 | 19 | 3 | 22 | 4 | 23 | 3 | 20 | 3 |
| High temperature | °C | 21 | 4 | 23 | 3 | 19 | 3 | 22 | 4 | 23 | 3 | 20 | 3 |
| Low temperature | °C | 21 | 4 | 23 | 3 | 18 | 3 | 22 | 4 | 23 | 3 | 20 | 3 |
| Temp. in station | °C | 24 | 2 | 27 | 4 | 24 | 2 | 26 | 1 | 26 | 3 | 24 | 5 |
| Humidity out_station | % | 80 | 14 | 89 | 10 | 86 | 14 | 77 | 13 | 85 | 8 | 91 | 8 |
| Humidity in_station | % | 55 | 9 | 60 | 9 | 64 | 7 | 58 | 8 | 65 | 8 | 65 | 9 |
| Solar radiation | W/m² | 193 | 294 | 172 | 272 | 196 | 298 | 189 | 278 | 141 | 235 | 145 | 233 |
| High solar rad | W/m² | 228 | 338 | 218 | 337 | 233 | 346 | 252 | 355 | 208 | 328 | 207 | 323 |
| Wind speed | km/h | 1 | 2 | 1 | 2 | 2 | 4 | 2 | 2 | 1 | 1 | 2 | 3 |
| High speed | km/h | 8 | 8 | 4 | 5 | 7 | 9 | 7 | 7 | 5 | 5 | 7 | 6 |
| Barometer | hPa | 782 | 1 | 757 | 2 | 758 | 2 | 755 | 37 | 741 | 27 | 760 | 1 |
| Rain | mm | 0.1 | 0.6 | 0.1 | 0.7 | 0.1 | 0.4 | 0.1 | 0.8 | 0.2 | 1.4 | 0.2 | 1.0 |
| Soil moisture | cB | 193 | 20 | 120 | 71 | 67 | 69 | 116 | 69 | 7 | 15 | 37 | 50 |
| Wet leaf (%) | % | 0.43 | 0.49 | 0.62 | 0.49 | 0.57 | 0.50 | 0.44 | 0.50 | 0.43 | 0.49 | 0.38 | 0.49 |
Note: m = mean, std = standard deviation. Temp. out_station = Average temperature outside the meteorological station within a given time interval (15 or 30 min); High temperature = Maximum temperature outside the meteorological station within a given time interval; Low temperature = Minimum temperature outside the meteorological station within a given time interval; Temp. in station = Average temperature inside the meteorological station within a given time interval; Humidity out_station = Average humidity outside the meteorological station within a given time interval; Humidity in_station = Average humidity inside the meteorological station within a given time interval; Solar radiation = Average solar radiation within a given time interval; High solar rad = Maximum solar radiation within a given time interval; Wind speed = Average wind speed within a given time interval; High speed = Maximum high speed within a given time interval; Barometer = Average air pressure within a given time interval; Rain = Rain within a given time interval; Soil moisture = Soil moisture within a given time interval; Wet leaf = Percentage of time intervals where the leaf wetness threshold was greater than zero.
Daily mean absolute error and root mean squared error in minutes, according to regions and approaches.
| Station | DMR | DOR | HOR | NMC | NOC | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| MAE | RMSE | MAE | RMSE | MAE | RMSE | MAE | RMSE | MAE | RMSE | |
| 15_min | ||||||||||
| 1 | 96 b | 139 | 96 b | 135 | 65 a | 91 | 65 a | 96 | 62 a | 90 |
| 2 | 82 b | 105 | 80 b | 106 | 54 a | 72 | 53 a | 75 | 57 a | 79 |
| 3 | 92 b | 123 | 91 b | 123 | 64 a | 98 | 64 a | 92 | 65 a | 93 |
| 30_min | ||||||||||
| 4 | 123 c | 175 | 124 c | 181 | 102 b | 140 | 96 a | 136 | 99 b | 146 |
| 5 | 125 b | 161 | 126 b | 162 | 95 a | 128 | 86 a | 124 | 88 a | 129 |
| 6 | 113 b | 145 | 119 b | 150 | 83 a | 107 | 81 a | 113 | 84 a | 111 |
Note: a = the smallest averages between approaches, according to the post hoc multiple paired t-test at 5% significance and Bonferroni correction; b = the second smallest averages; c = the third smallest averages.
Figure 3Residual distribution of the NOC approach for each region.
Daily mean absolute error in minutes with excluded regions in training.
| Test Sample | Train Sample | ||
|---|---|---|---|
| Without 1 | Without 2 | Without 3 | |
| 1 | 127 * | 61 | 63 |
| 2 | 54 | 135 * | 55 |
| 3 | 66 | 64 | 168 * |
|
|
|
| |
| 4 | 351 * | 97 | 96 |
| 5 | 87 | 387 * | 86 |
| 6 | 83 | 85 | 363 * |
* p < 0.05, difference between excluding and not excluding, after applying paired t-test for comparison of means.
Figure 4NOC feature importance for records collected each 15 min and records collected each 30 min, respectively.
Daily mean absolute error in minutes for NOC models with data reduction.
| Region | NOC_All Variables | NOC_1 | NOC_2 | NOC_3 |
|---|---|---|---|---|
| 15_min | ||||
| 1 | 63 | 67 * | 71 * | 69 * |
| 2 | 55 | 61 * | 63 * | 68 * |
| 3 | 66 | 72 * | 75 * | 79 * |
| 30_min | ||||
| 4 | 99 | 104 | 123 * | 132 * |
| 5 | 88 | 94 | 131 * | 137 * |
| 6 | 84 | 103 * | 101 * | 102 * |
NOC_all variables = model with all the variables, NOC_1 = excluded high temperature, low temperature, high speed, high solar radiation, rain, wind speed, and month; NOC_2 = excluded the same features as NOC_1 and humidity; NOC_3 = excluded the same features as NOC_2 and solar radiation. * p < 0.05, the difference between NOC reduced and NOC original after applying paired t-test for comparison of means.
Daily mean absolute error in minutes for the test sample of 2019 and 2020, using NOC models trained with different records.
| Region | NOC_All | NOC_>2017 | NOC_>2018 |
|---|---|---|---|
| 15_min | |||
| 1 | 57 | 56 | 55 |
| 2 | 45 | 46 | 43 |
| 3 | 65 | 64 | 64 |
| 30_min | |||
| 4 | 131 | 117 | 100 |
| 5 | 76 | 76 | 80 |
| 6 | 90 | 94 | 89 |
NOC_all = NOC model trained with all records; NOC_>2017 = NOC model trained with the years 2018, 2019, and 2020; NOC_>2018 = NOC model trained with the years 2019 and 2020.