| Literature DB >> 35937316 |
Ying Yuan1,2, Xuefeng Wang1,2, Mengmeng Shi1,2, Peng Wang1,2.
Abstract
Reasonable cultivation is an important part of the protection work of endangered species. The timely and nondestructive monitoring of chlorophyll can provide a basis for the accurate management and intelligent development of cultivation. The image analysis method has been applied in the nutrient estimation of many economic crops, but information on endangered tree species is seldom reported. Moreover, shade control, as the common seedling management measure, has a significant impact on chlorophyll, but shade levels are rarely discussed in chlorophyll estimation and are used as variables to improve model accuracy. In this study, 2-year-old seedlings of tropical and endangered Hopea hainanensis were taken as the research object, and the SPAD value was used to represent the relative chlorophyll content. Based on the performance comparison of RGB and multispectral (MS) images using different algorithms, a low-cost SPAD estimation method combined with a machine learning algorithm that is adaptable to different shade conditions was proposed. The SPAD values changed significantly at different shade levels (p < 0.01), and 50% shade in the orthographic direction was conducive to chlorophyll accumulation in seedling leaves. The coefficient of determination (R 2), root mean square error (RMSE), and average absolute percent error (MAPE) were used as indicators, and the models with dummy variables or random effects of shade greatly improved the goodness of fit, allowing better adaption to monitoring under different shade conditions. Most of the RGB and MS vegetation indices (VIs) were significantly correlated with the SPAD values, but some VIs exhibited multicollinearity (variance inflation factor (VIF) > 10). Among RGB VIs, RGRI had the strongest correlation, but multiple VIs filtered by the Lasso algorithm had a stronger ability to interpret the SPAD data, and there was no multicollinearity (VIF < 10). A comparison of the use of multiple VIs to estimate SPAD indicated that Random forest (RF) had the highest fitting ability, followed by Support vector regression (SVR), linear mixed effect model (LMM), and ordinary least squares regression (OLR). In addition, the performance of MS VIs was superior to that of RGB VIs. The R 2 of the optimal model reached 0.9389 for the modeling samples and 0.8013 for the test samples. These findings reinforce the effectiveness of using VIs to estimate the SPAD value of H. hainanensis under different shade conditions based on machine learning and provide a reference for the selection of image data sources.Entities:
Keywords: Hopea hainanensis; chlorophyll; machine learning; shade; vegetation indices
Year: 2022 PMID: 35937316 PMCID: PMC9355326 DOI: 10.3389/fpls.2022.928953
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 6.627
Figure 1Location of study area.
Specific parameters of the spectral sensor.
| Spectral band | Center wavelength (nm) | Bandwidth FWHM (nm) |
|---|---|---|
| Blue | 475 | 20 |
| Green | 460 | 20 |
| Red | 668 | 10 |
| Near IR | 840 | 40 |
| Red Edge | 717 | 10 |
Figure 2Flow chart of data measurement and processing. (A) Hopea hainanensis seedling. (B) Portable plant nutrition meter (TYS-4N). (C) Digital camera (Cannon EOS 4000D). (D) MS camera (MicaSense Edge 3). (E) Combination device of digital camera and MS camera. (C1) Region of interest extracted from RGB image. (C2) Gray image based on ExG conversion. (C3) Binary image after Kapur segmentation. (C4) Mask result of C3–C1. (D1) Region of interest extracted from MS image. (D2) Gray image based on NDVI conversion. (D3) Binary image after Kapur segmentation. (D4) Mask result of D3–D1.
The information of RGB VIs.
| Index | Abbreviation | Formula | References |
|---|---|---|---|
| Excess green | ExG | 2 * G − R − B |
|
| Excess green minus excess red | ExGR | ExG − (1.4 * R − G) |
|
| Normalized green–red difference index | NGRDI | (G − R)/(G + R) |
|
| Normalized blue–red difference index | NGBDI | (G − B)/(G + B) |
|
| Red green ratio index | RGRI | R/G |
|
| Green blue ratio index | GBRI | B/G |
|
| Color index of vegetation extraction | CIVE | 0.441 * R − 0.811 * G + 0.385 * B + 18.78745 |
|
| Vegetative index | VEG | G/(RA * B(1−A)), a = 0.667 |
|
| Red green blue vegetation indices | RGBVI | (G2 – B * R2)/(G2 + B*R2) |
|
| Modified green red vegetation indices | MGRVI | (G2 − R2)/(G2 + R2) |
|
The information of MS VIs.
| Index | Abbreviation | Formula | References |
|---|---|---|---|
| Normalized difference vegetation indices | NDVI | (NIR − Red)/(NIR + Red) |
|
| Ratio vegetation indices | RVI | NIR/Red |
|
| Difference vegetation indices | DVI | NIR − Red |
|
| Enhanced vegetation indices | EVI | 2.5 * (NIR − Red)/(NIR + 6 * Red − 7.5 * Blue + 1) |
|
| Renormalized difference vegetation indices | RDVI | (NDVI/DVI)0.5 |
|
| Red-edge vegetation indices | REVI | NIR/RE − 1 |
|
| Normalized difference red-edge index | NDRE | (NIR – RE)/(NIR + RE) |
|
| Red-edge ratio vegetation indices | RERVI | NIR/RE |
|
| Red-edge difference vegetation indices | REDVI | NIR − RE |
|
| Simplified canopy chlorophyll content index | sCCCI | NDRE/NDVI |
|
Figure 3Chlorophyll content (SPAD) data distribution of seedlings under different shade levels. ***p < 0.001.
VIF and correlation with SPAD in RGB VIs.
| RGB VIs | Mean ± standard deviation |
| VIF | |
|---|---|---|---|---|
| ExG | 0.69 ± 0.08 | 0.1859 | 0.0228 | 3.94E+04 |
| ExGR | 0.57 ± 0.10 | 0.4106 | 0.0000 | 820.7527 |
| NGRDI | 0.21 ± 0.04 | 0.4256 | 0.0000 | 431.3603 |
| NGBDI | 0.50 ± 0.08 | −0.2469 | 0.0023 | 305.4540 |
| RGRI | 1.01 ± 0.06 | −0.4371 | 0.0000 | 65.9226 |
| GBRI | 0.91 ± 0.10 | 0.2322 | 0.0042 | 15.0777 |
| CIVE | 18.70 ± 0.03 | −0.2213 | 0.0065 | 4.85E+04 |
| VEG | 1.57 ± 0.07 | 0.1162 | 0.1568 | 19.7474 |
| RGBVI | 0.52 ± 0.06 | −0.0781 | 0.3420 | 63.4681 |
| MGRVI | 0.41 ± 0.07 | 0.4297 | 0.0000 | 580.1231 |
p < 0.01;
p < 0.001.
VIF and correlation with SPAD in MS VIs.
| MS VIs | Mean ± standard deviation |
| VIF | |
|---|---|---|---|---|
| NDVI | 0.96 ± 0.07 | 0.3458 | 0.0000 | 1.6832 |
| RVI | 5.07 ± 0.76 | 0.3045 | 0.0002 | 0.1818 |
| DVI | 0.64 ± 0.11 | 0.3551 | 0.0000 | 1.7496 |
| EVI | 1.28 ± 0.17 | 0.2617 | 0.0012 | 0.0373 |
| RDVI | 2.01 ± 0.16 | −0.3511 | 0.0000 | 0.9178 |
| REVI | 1.08 ± 0.23 | 0.2418 | 0.0029 | −1.19E+13 |
| NDRE | 0.89 ± 0.13 | 0.3738 | 0.0000 | 13.5474 |
| RERVI | 2.08 ± 0.23 | 0.2418 | 0.0029 | −1.19E+13 |
| REDVI | 0.32 ± 0.09 | 0.3696 | 0.0000 | 0.1521 |
| sCCCI | 0.92 ± 0.11 | 0.3579 | 0.0000 | 4.6747 |
p < 0.001.
Figure 4λ-value iterative process of Lasso algorithm with error bar. (A) RGB-based Lasso. (B) MS-based Lasso.
Selection results of VIs using Lasso algorithm.
| VIs |
| VIF | ||
|---|---|---|---|---|
| RGB VIs | ExG | 0.5396 | 0.0000 | 0.2563 |
| ExGR | 0.6378 | |||
| NGRDI | 0.3589 | |||
| NGBDI | 0.0362 | |||
| MS VIs | NDVI | 0.4835 | 0.0000 | 0.4331 |
| EVI | 0.0305 | |||
| RDVI | 0.3752 | |||
| REVI | 0.0986 | |||
| REDVI | 0.1148 | |||
p < 0.001.
Figure 5Estimated residuals of modeling samples using different images and algorithms without shade variables. (A) RGB-based OLR. (B) RGB-based RF. (C) RGB-based SVR. (D) MS-based OLR. (E) MS-based RF. (F) MS-based SVR.
Evaluation of the OLR, RF, and SVR models without shade variables.
| Model | RGB VIs | MS VIs | ||||
|---|---|---|---|---|---|---|
|
| RMSE (SPAD) | MAPE |
| RMSE (SPAD) | MAPE | |
| OLR | 0.2912 | 5.8242 | 20.97% | 0.2337 | 6.0558 | 22.57% |
| RF | 0.4108 | 5.4523 | 20.18% | 0.3007 | 5.8475 | 21.99% |
| SVR | 0.4899 | 4.9465 | 15.56% | 0.5310 | 4.7830 | 15.35% |
Figure 6Estimated residuals of modeling samples using different images and algorithms with shade variables. (A) RGB-based LMM. (B) RGB-based OLR. (C) RGB-based RF. (D) RGB-based SVR. (E) MS-based LMM. (F) MS-based OLR. (G) MS-based RF. (H) MS-based SVR.
Evaluation of the LMM, OLR, RF, and SVR models with shade variables.
| Model | RGB VIs | MS VIs | ||||
|---|---|---|---|---|---|---|
|
| RMSE (SPAD) | MAPE |
| RMSE (SPAD) | MAPE | |
| LMM | 0.7231 | 3.6410 | 12.24% | 0.7278 | 3.6099 | 11.90% |
| OLR | 0.7138 | 3.7008 | 12.39% | 0.7168 | 3.6817 | 12.21% |
| RF | 0.9273 | 1.9844 | 6.73% | 0.9389 | 1.8109 | 6.10% |
| SVR | 0.8651 | 2.6019 | 7.53% | 0.8595 | 2.6364 | 8.16% |
Evaluation of RF and SVR models for testing samples.
| Model | RGB VIs | MS VIs | |||||
|---|---|---|---|---|---|---|---|
|
| RMSE (SPAD) | MAPE |
| RMSE (SPAD) | MAPE | ||
| Without shade variables | RF | 0.3774 | 5.8646 | 19.30% | 0.3173 | 5.8545 | 19.11% |
| SVR | 0.3059 | 6.0529 | 19.18% | 0.1300 | 6.5385 | 20.61% | |
| With shade variables | RF | 0.7854 | 3.1582 | 9.38% | 0.8013 | 2.9616 | 9.04% |
| SVR | 0.5769 | 4.3624 | 13.49% | 0.6771 | 3.7816 | 10.86% | |