| Literature DB >> 34804091 |
Qiyu Zhou1, Douglas J Soldat1.
Abstract
Nitrogen is the most limiting nutrient for turfgrass growth. Instead of pursuing the maximum yield, most turfgrass managers use nitrogen (N) to maintain a sub-maximal growth rate. Few tools or soil tests exist to help managers guide N fertilizer decisions. Turf growth prediction models have the potential to be useful, but the currently existing turf growth prediction model only takes temperature into account, limiting its accuracy. This study developed machine-learning-based turf growth models using the random forest (RF) algorithm to estimate short-term turfgrass clipping yield. To build the RF model, a large set of variables were extracted as predictors including the 7-day weather, traffic intensity, soil moisture content, N fertilization rate, and the normalized difference red edge (NDRE) vegetation index. In this study, the data were collected from two putting greens where the turfgrass received 0 to 1,800 round/week traffic rates, various irrigation rates to maintain the soil moisture content between 9 and 29%, and N fertilization rates of 0 to 17.5 kg ha-1 applied biweekly. The RF model agreed with the actual clipping yield collected from the experimental results. The temperature and relative humidity were the most important weather factors. Including NDRE improved the prediction accuracy of the model. The highest coefficient of determination (R2) of the RF model was 0.64 for the training dataset and was 0.47 for the testing data set upon the evaluation of the model. This represented a large improvement over the existing growth prediction model (R 2 = 0.01). However, the machine-learning models created were not able to accurately predict the clipping production at other locations. Individual golf courses can create customized growth prediction models using clipping volume to eliminate the deviation caused by temporal and spatial variability. Overall, this study demonstrated the feasibility of creating machine-learning-based yield prediction models that may be able to guide N fertilization decisions on golf course putting greens and presumably other turfgrass areas.Entities:
Keywords: machine learning; nitrogen management; random forest; turfgrass; yield prediction
Year: 2021 PMID: 34804091 PMCID: PMC8600360 DOI: 10.3389/fpls.2021.749854
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Soil chemical properties of two putting green root zones used for creating or evaluating the growth prediction models.
| Depth (cm) | SOM | P | K | Ca | Mg | CEC | pH | |
|
| ||||||||
| (mg kg–1) | ||||||||
| Research green 1 | 0 – 5 | 1.23 | 64.2 | 91.6 | 1210 | 295 | 8 | 7.5 |
| 5 – 10 | 0.55 | 17.0 | 25.5 | 579 | 144 | 4 | 7.3 | |
| Research green 2 | 0 – 5 | 0.67 | 25.9 | 40.7 | 487 | 133 | 3 | 7.7 |
| 5 – 10 | 0.51 | 24.1 | 17.2 | 430 | 102 | 3 | 7.5 | |
Variables used in the random forest (RF) models.
| RF variables | Variables names |
| Biweekly N rate | N rate (kg/ha/2wk) |
| Daily NDRE | NDRE |
| Three-day average soil moisture content | Moist avg (3 days) |
| Weekly walking traffic | Traffic (round/week) |
| Accumulative days turf grows | Days grow |
| Research green soil | Rootzone |
| Maximum, minimum, and average temperature/relative humidity on the clipping collection day | Tmax; RHmax Tmin; RHmin Tavg; RHmin |
| Maximum, minimum, and average temperature/relative humidity of x (1,2,3,4,5,6) days before the clipping collection day | Tmax (pre x days) Tmin (pre x days) Tavg (pre x days) RHmax (pre x days) RHmin (pre x days) RHavg (pre x days) |
| Accumulative maximum, minimum, and average temperature/relative humidity of x (2,3,4,5,6,7) days | Tmax (x days accu) Tmin (x days accu) Tavg (x days accu) RHmax (x days accu) RHmin (x days accu) RHavg (x days accu) |
| Accumulative precipitation/evapotranspiration of x (2,3,4,5,6,7) days | Precip (x days accu) ET (x days accu) |
| Accumulative difference between precipitation and evapotranspiration of x (2.3.4.5.6.7) days | Precip-ET (x days accu) |
| Average wind speed of x (1,2,3,4,5,6) days before the clipping collection day | Wind avg (pre x day) |
FIGURE 1Flow chart of the four-fold cross-validation.
Comparison of the performance of five machine learning models for training data set (n = 1897).
| Machine learning method | R2 with SD | RMSE with SD |
| Random forest (RF) | 0.64 (0.08) | 0.339 (0.06) |
| Extreme gradient boosting | 0.57 (0.15) | 0.366 (0.07) |
| Gradient boosting model | 0.43 (0.13) | 0.422 (0.07) |
| Decision tree | 0.36 (0.17) | 0.450 (0.09) |
| Support vector regression | −0.15 (0.15) | 0.604 (0.08) |
Full variable inputs were used when developing models.
Model performance on the training and validation datasets of the complete RF model, simplified RF model, and weather-only RF model.
| Variables input | Training RMSE | Training R2 with SD | Evaluation RMSE | Evaluation R2 | |
| Complete RF model | N fertilization Traffic intensity Categorized root zone Weather Days grow NDRE | 0.339 (0.06) | 0.64 (0.08) | 0.489 | 0.47 |
| Simplified RF model | N fertilization Traffic intensity Categorized root zone Weather Days grow | 0.367 (0.06) | 0.57 (0.09) | 0.515 | 0.42 |
| Weather-only RF model | Weather Days grow | 0.406 (0.09) | 0.46 (0.20) | 0.567 | 0.30 |
| PACE Turf GP model | Temperature | N/A | 0.01 | N/A | N/A |
FIGURE 2Scatter plot of the model performances with (A) complete random forest (RF) model with all variables inputs on the training dataset; (B) complete RF model with all variables inputs on the validation dataset; (C) simplified RF model with the historical nitrogen (N) rate record, traffic intensity, and weather data on the training dataset; (D) simplified RF model with the historical N rate record, traffic intensity, and weather data on the validation dataset; (E) simplified RF model with only weather data input on the training dataset; (F) simplified RF model with only the weather data input on the validation dataset; (G) PACE Turf GP model.
FIGURE 3Top important variables for the (A) complete RF model; (B) simplified RF model with N rates, traffic intensities, and weather inputs; (C) weather-only RF model with all the weather variable inputs.
FIGURE 4A decision tree with three depths. The nodes with dark colors have a higher estimated clipping yield.
FIGURE 5The partial dependence of creeping bentgrass clipping production on the N application rate (A); average 3-day soil moisture content (B); the average temperature on the previous 3 days of clipping collection event (C); accumulative days that turf grew between mowing (D); normalized difference red edge (NDRE) (E); average relative humidity on the previous 4 days of clipping collection event (F).
FIGURE 6Scatter plot with (A) random forest (RF) model performance from the MN golf course that was built with on-site clipping data; (B) RF model built with clipping data collected from Madison, Wisconsin, USA; (C) PACE Turf GP model.