| Literature DB >> 32514286 |
Monica Herrero-Huerta1, Pablo Rodriguez-Gonzalvez2, Katy M Rainey1.
Abstract
BACKGROUND: Nowadays, automated phenotyping of plants is essential for precise and cost-effective improvement in the efficiency of crop genetics. In recent years, machine learning (ML) techniques have shown great success in the classification and modelling of crop parameters. In this research, we consider the capability of ML to perform grain yield prediction in soybeans by combining data from different optical sensors via RF (Random Forest) and XGBoost (eXtreme Gradient Boosting). During the 2018 growing season, a panel of 382 soybean recombinant inbred lines were evaluated in a yield trial at the Agronomy Center for Research and Education (ACRE) in West Lafayette (Indiana, USA). Images were acquired by the Parrot Sequoia Multispectral Sensor and the S.O.D.A. compact digital camera on board a senseFly eBee UAS (Unnamed Aircraft System) solution at R4 and early R5 growth stages. Next, a standard photogrammetric pipeline was carried out by SfM (Structure from Motion). Multispectral imagery serves to analyse the spectral response of the soybean end-member in 2D. In addition, RGB images were used to reconstruct the study area in 3D, evaluating the physiological growth dynamics per plot via height variations and crop volume estimations. As ground truth, destructive grain yield measurements were taken at the end of the growing season.Entities:
Keywords: High throughput phenotyping; Machine learning (ML); Point clouds; Soybean; Structure from Motion (SfM); Unmanned aircraft system (UAS); Yield
Year: 2020 PMID: 32514286 PMCID: PMC7268475 DOI: 10.1186/s13007-020-00620-6
Source DB: PubMed Journal: Plant Methods ISSN: 1746-4811 Impact factor: 4.993
Technical specifications of the senseFly’s S.O.D.A. Digital Camera
| Parameter | Value |
|---|---|
| Optical sensor size | 116.2 mm2 |
| Image size | 5742*3648 pixels |
| Focal length | 10.6 mm |
| Pixel size | 3 µm |
Technical specifications of the Parrot Sequoia Multispectral sensor
| Parameter | Value |
|---|---|
| Spectral range | 350–2500 nm |
| Shooting time | 0.1 s |
| Spectral resolution | 1 nm |
| Field of view | 25º |
| Pixel size | 3.75 µm |
| Focal length | 3.98 mm |
| Image size | 1280*960 pixels |
Fig. 2Colour composite MSI mapping over the study area on July 23rd 2018 (DAP 61) (DL (500047.3, 4480849.5); UR (500364.0, 4480968.0); EPSG 32616) a, a detail of field-map based plot extraction b and Triple S software run over a random plot for July 23rd, 2018 (DAP 61) c
Fig. 1Workflow
VI used as inputs from the model
| VI | Equation | Proposed by |
|---|---|---|
| NDVI | (NIR-R)/(NIR + R) | [ |
| SAVI | (1 + L)*(NIR-R)/(NIR + R+L) | [ |
| MSAVI | (2*NIR + 1-((2*NIR + 1)2− 8*(NIR-R)*(NIR-R))0.5)/2 | [ |
| GESAVI | (NIR-a)*(R-b)/(R + z) | [ |
| CIre | (NIR/RE)-1 | [ |
| CIg | (NIR-G)-1 | [ |
| VARI | (G-R)/(G + R) | [ |
| RVI | (NIR/R) | [ |
| DVI | (NIR-R) | [ |
| RDVI | (NIR-R)/(NIR + R)0.5 | [ |
| TVI | 0.5*(120*(NIR-G)-200*(R-G)) | [ |
Statistics of canopy cover and soybean reflectance by band of soybean class per plot from MSI analysis at DAP 61 and 70: mean, standard deviation (Std), median, normalized median absolute deviation (NMAD), square root of the biweight midvariance (BwMv), percentiles at 2.5% (P2.5%), 25% (Q25%), 75% (Q75%) and 97.5% (P97.5%), interquartile range (IQR) and interpercentile range at 90% (IPR90%) and 99% (IPR99%) confidence interval
| Parameter (%) | Mean | Std | Median | NMAD | BwMv | P2.5% | Q25% | Q75% | P97.5% | IQR | IPR90% | IPR99% | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 7/23/2018 (DAP 61) | Canopy cover | 79.54 | 20.29 | 85.45 | 6.66 | 7.60 | 3.37 | 80.09 | 89.47 | 98.32 | 9.37 | 70.54 | 99.85 |
| green | 6.21 | 0.72 | 6.19 | 0.79 | 0.73 | 4.93 | 5.65 | 6.71 | 7.60 | 1.06 | 2.34 | 3.31 | |
| Red | 2.53 | 0.27 | 2.51 | 0.23 | 0.26 | 2.02 | 2.37 | 2.68 | 3.08 | 0.31 | 0.88 | 1.51 | |
| Red edge | 31.84 | 2.61 | 32.03 | 2.26 | 2.44 | 26.29 | 30.42 | 33.45 | 36.91 | 3.02 | 8.25 | 16.49 | |
| Near infrared | 55.15 | 7.02 | 55.42 | 5.80 | 6.83 | 39.45 | 51.70 | 59.48 | 69.23 | 7.78 | 24.46 | 35.83 | |
| 8/01/2018 (DAP 70) | Canopy cover | 86.90 | 5.48 | 87.77 | 3.62 | 3.79 | 69.42 | 85.22 | 90.06 | 94.35 | 4.83 | 19.35 | 31.97 |
| green | 5.90 | 0.39 | 5.85 | 0.39 | 0.39 | 5.25 | 5.61 | 6.15 | 6.76 | 0.54 | 1.25 | 1.96 | |
| Red | 2.62 | 0.18 | 2.60 | 0.18 | 0.18 | 2.33 | 2.48 | 2.73 | 3.03 | 0.24 | 0.57 | 1.00 | |
| Red edge | 32.22 | 1.34 | 32.22 | 1.28 | 1.35 | 29.63 | 31.31 | 33.06 | 34.96 | 1.75 | 4.52 | 6.89 | |
| Near infrared | 55.59 | 2.27 | 55.58 | 2.19 | 2.22 | 50.99 | 54.12 | 57.08 | 60.08 | 2.96 | 7.59 | 13.14 |
Fig. 3Deviation point cloud over Soybean from July 23rd (DAP 61) using June 7th (DAP 15) as reference in metersa and canopy volume calculation of two random plots from the same date b
Statistics of CV and H max by band per plot from RGB analysis at DAP 61 and 70: mean, standard deviation (Std), median, normalized median absolute deviation (NMAD), square root of the biweight midvariance (BwMv), percentiles at 2.5% (P2.5%), 25% (Q25%), 75% (Q75%) and 97.5% (P97.5%), interquartile range (IQR) and interpercentile range at 90% (IPR90%) and 99% (IPR99%) confidence interval
| Parameter | Mean | Std. | Median | NMAD | BwMv | P 2.5% | Q 25% | Q 75% | P 97.5% | IQR | IPR 90% | IPR 99% | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 7/23/2018 (DAP 61) | CV (dm3) | 1282.78 | 218.37 | 1253.85 | 199.04 | 215.52 | 917.37 | 1135.57 | 1413.23 | 1754.78 | 277.66 | 729.35 | 1119.80 |
| Hmax (cm) | 92.33 | 16.64 | 87.58 | 10.89 | 12.36 | 73.02 | 81.70 | 97.16 | 139.61 | 15.46 | 56.37 | 84.95 | |
| 19.65 | 4.98 | 19.18 | 4.96 | 4.77 | 11.78 | 15.91 | 22.57 | 31.18 | 6.66 | 16.03 | 27.06 | ||
| 19.14 | 4.84 | 18.66 | 4.63 | 4.61 | 11.44 | 15.54 | 21.73 | 30.78 | 6.20 | 15.66 | 25.57 | ||
| 8/01/2018 (DAP 70) | CV (dm3) | 1496.79 | 242.17 | 1487.83 | 219.58 | 233.32 | 1065.43 | 1327.34 | 1630.02 | 2079.86 | 302.69 | 819.91 | 1388.60 |
| Hmax (cm) | 107.57 | 16.95 | 104.02 | 11.76 | 13.62 | 84.53 | 96.76 | 113.45 | 154.90 | 16.70 | 55.02 | 88.09 | |
| 21.28 | 5.95 | 20.92 | 6.18 | 5.95 | 11.65 | 16.74 | 25.03 | 34.52 | 8.29 | 19.13 | 29.36 | ||
| 20.84 | 5.74 | 20.49 | 5.95 | 5.68 | 11.55 | 16.48 | 24.54 | 33.87 | 8.06 | 18.92 | 30.03 |
Error metrics of both models in (kg/ha) at 95% confidence interval evaluated in training and testing dataset: MBE (Mean Bias Error), AMBE (Absolute Mean Bias Error), RMSE (Root Mean Square Error), NMAD (normalized median absolute deviation), RE (Relative Error), AE (Absolute Error) and η (the Nash and Sutcliffe index)
| Dataset | Model | MBE | AMBE | RMSE | NMAD | RE | AE | η |
|---|---|---|---|---|---|---|---|---|
| Training | RF | 13.61 | 140.25 | 181.19 | 167.48 | 1.14% | 4.03% | 0.80 |
| XGBoost | 30.39 | 240.45 | 303.99 | 292.12 | 1.98% | 6.87% | 0.21 | |
| Testing | RF | − 4.17 | 325.33 | 410.24 | 384.62 | 1.37% | 9.06% | − 2.46 |
| XGBoost | − 7.15 | 306.76 | 394.66 | 353.04 | 1.18% | 8.55% | − 1.52 |
Fig. 4Features importance for more than 71% by RF a and XGBoost b
Fig. 5Prediction errors and actual Grain Yield (GY) sorted smallest to largest per plot along the training dataset a and testing dataset b (please note that the errors and GY are ploted in the primary and secondary axis, respectively); scatter plots of the measured against the predicted grain yield (kg*ha − 1) by RF and XGBoost from training c and testing dataset d. In both cases is drawn the line corresponding to the robust linear fit at 95% of confidence
Robust linear fit coefficient, R2 value, highest studentized residuals mad RMSE & NMAD values of the fitting
| Dataset | Model | a | b | R2 | Max studentized residual | RMSE | NMAD |
|---|---|---|---|---|---|---|---|
| Training | RF | 1.372 | − 1420.5 | 0.9728 | 1.95 | 94.06 | 102.40 |
| XGBoost | 1.429 | − 1638.8 | 0.7787 | 1.95 | 262.03 | 263.05 | |
| Testing | RF | 1.433 | − 1614.7 | 0.3828 | 1.88 | 399.87 | 360.08 |
| XGBoost | 1.290 | − 1069.1 | 0.4183 | 1.89 | 387.11 | 370.03 |
Fig. 6Errors from the testing dataset grouped by family: RF a and XGBoost b