| Literature DB >> 31737009 |
Brooke Bruning1, Huajian Liu1, Chris Brien1, Bettina Berger1, Megan Lewis2, Trevor Garnett1.
Abstract
Quantifying plant water content and nitrogen levels and determining water and nitrogen phenotypes is important for crop management and achieving optimal yield and quality. Hyperspectral methods have the potential to advance high throughput phenotyping efforts by providing a rapid, accurate, and nondestructive alternative for estimating biochemical and physiological plant traits. Our study (i) acquired hyperspectral images of wheat plants using a high throughput phenotyping system, (ii) developed regression models capable of predicting water and nitrogen levels of wheat plants, and (iii) applied the regression coefficients from the best-performing models to hyperspectral images in order to develop prediction maps to visualize nitrogen and water distribution within plants. Hyperspectral images were collected of four wheat (Triticum aestivum) genotypes grown in nine soil nutrient conditions and under two water treatments. Five multivariate regression methods in combination with 10 spectral preprocessing techniques were employed to find a model with strong predictive performance. Visible and near infrared wavelengths (VNIR: 400-1,000nm) alone were not sufficient to accurately predict water and nitrogen content (validation R2 = 0.56 and R2 = 0.59, respectively) but model accuracy was improved when shortwave-infrared wavelengths (SWIR: 1,000-2,500nm) were incorporated (validation R2 = 0.63 and R2 = 0.66, respectively). Wavelength reduction produced equivalent model accuracies while reducing model size and complexity (validation R2 = 0.69 and R2 = 0.66 for water and nitrogen, respectively). Developed distribution maps provided a visual representation of the concentration and distribution of water within plants while nitrogen maps seemed to suffer from noise. The findings and methods from this study demonstrate the high potential of high-throughput hyperspectral imagery for estimating and visualizing the distribution of plant chemical properties.Entities:
Keywords: PLSR; hyperspectral; nitrogen; plant phenotyping; water; wheat
Year: 2019 PMID: 31737009 PMCID: PMC6831646 DOI: 10.3389/fpls.2019.01380
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Nutrient levels of the different soils used in this study. Half of the full 24(= 16) combinations of the soil nutrient treatments were used in addition to a control treatment, where no nutrients were added.
| Soil | N (mg/kg) | P (mg/kg) | K (mg/kg) | secondary nutrients (g/150kg) |
|---|---|---|---|---|
| 1 | 0 | 0 | 0 | 0 |
| 2 | 25 | 15 | 20 | 5 |
| 3 | 100 | 15 | 20 | 10 |
| 4 | 25 | 40 | 20 | 10 |
| 5 | 100 | 40 | 20 | 5 |
| 6 | 25 | 15 | 60 | 10 |
| 7 | 100 | 15 | 60 | 5 |
| 8 | 25 | 40 | 60 | 5 |
| 9 | 100 | 40 | 60 | 10 |
Wavelengths known to be associated with water content in vegetation. A broad range (40 nm) centered on these values were included in the feature-reduced models to ensure that each feature was captured.
| Wavelength (nm) | Range Included (nm) | Assignation | Reference |
|---|---|---|---|
| 600 | 580–620 | O-H Hydrogen Bonding | |
| 680 | 660–700 | Electron transition |
|
| 810 | 790–830 | C-H |
|
| 820 | 800–840 | C-H |
|
| 860 | 840–880 | C-H |
|
| 900 | 880–920 | C-H |
|
| 970 | 950–990 | O-H bend |
|
| 1240 | 1,220–1,260 | C-H |
|
| 1530 | 1,510–1,550 | N-H secondary amines |
|
| 1550 | 1,530–1,570 | N-H secondary amines |
|
| 1720 | 1,700–1,740 | C-H |
|
| 1750 | 1,730–1,770 | C-H secondary overtones |
|
| 2080 | 2,060–2,100 | N-H and C-H, O-H stretch and deformation |
|
| 2350 | 2,330–2,370 | C-H combinations |
|
| 1400-1450 | 1,400–1,450 | O-H bend and stretch |
|
From Ecarnot et al. (2013). Wavelengths known to be associated with nitrogen content in vegetation. A broad range (40 nm) centered on these values were included in the feature-reduced models to ensure that each feature was captured.
| Wavelength (nm) | Range Included (nm) | Assignation |
|---|---|---|
| 460 | 440–480 | Electron transition, chlorophyll a,b |
| 530 | 510–550 | Electron transition, carotenoids |
| 670 | 650–690 | Electron transition, chlorophyll a,b |
| 1440 | 1,420–1,460 | O-H bend, first overtone, starch |
| 1500 | 1,480–1,520 | N-H stretch |
| 1680 | 1,660–1,700 | C-H stretch, aromatic |
| 1712 | 1,692–1,732 | C-H stretch, CH3 |
| 1770 | 1,750–1,790 | C-H stretch, CH2 |
| 1900 | 1,880–1,920 | O-H stretch, C = O, starch, CO2H |
| 1960 | 1,940–1,980 | N-H, CONH2 |
| 2080 | 2,060–2,100 | N-H stretch, proteins |
| 2115 | 2,095–2,135 | N-H stretch, CONH2, CONHR |
| 2140 | 2,120–2,160 | Amide, proteins |
| 2230 | 2,210–2,250 | N-H stretch, C = H stretch, amino acid |
| 2300 | 2,280–2,320 | N-H stretch, C = O stretch, amino acid |
| 2400 | 2,380–2,420 | CH2 bend, C-H deformation, cellulose |
Figure 1Mean spectra extracted from the vegetation pixels for each image using the FX10 visible and near infrared wavelength (VNIR) camera for wavelengths from 400–1,000nm and the shortwave-infrared wavelength (SWIR) camera for the 1,000–2,500nm wavelengths.
Performance results for the prediction of water content in wheat using VNIR (400–1,000 nm) spectra.
| Calibration | ||||||
|---|---|---|---|---|---|---|
| PLSR | PCR | MLR | RF | SVM | ||
| ASGD1 | R2 | 0.63 | 0.42 | 0.50 | 0.47 | 0.84 |
| RMSE | 2.92 | 3.61 | 3.42 | 3.46 | 1.96 | |
| RPD | 1.63 | 1.32 | 1.39 | 1.38 | 2.43 | |
| SMO | R2 | 0.64 | 0.60 | 0.61 | 0.44 | 0.67 |
| RMSE | 2.90 | 3.04 | 2.96 | 3.57 | 2.76 | |
| RPD | 1.64 | 1.57 | 1.61 | 1.33 | 1.73 | |
| Raw | R2 | 0.67 | 0.60 | 0.57 | 0.44 | 0.69 |
| RMSE | 2.79 | 3.04 | 3.13 | 3.55 | 2.70 | |
| RPD | 1.71 | 1.56 | 1.52 | 1.34 | 1.77 | |
| Validation | ||||||
| PLSR | PCR | MLR | RF | SVM | ||
| ASGD1 | R2 | 0.51 | 0.41 | 0.54 | 0.49 | 0.52 |
| RMSE | 3.01 | 3.30 | 2.94 | 3.06 | 2.96 | |
| RPD | 1.42 | 1.29 | 1.45 | 1.39 | 1.44 | |
| SMO | R2 | 0.56 | 0.54 | 0.62 | 0.47 | 0.55 |
| RMSE | 2.90 | 2.88 | 2.70 | 3.11 | 2.89 | |
| RPD | 1.47 | 1.48 | 1.58 | 1.37 | 1.48 | |
| Raw | R2 | 0.56 | 0.54 | 0.57 | 0.49 | 0.56 |
| RMSE | 2.85 | 2.88 | 2.82 | 3.05 | 2.86 | |
| RPD | 1.50 | 1.48 | 1.51 | 1.40 | 1.49 | |
R2, coefficient of determination, RMSE, root mean square error; RPD, ratio of performance to deviation. PLSR, partial least-squares regression; PCR, principal components regression; MLR, multiple linear regression; RF, random forest; SVM, support vector machine; SMO = smoothed.
Performance results (validation R2≥0.5) of trialled preprocessing and multivariate methods for the prediction of nitrogen in wheat.
| Calibration | ||||||
|---|---|---|---|---|---|---|
| PLSR | PCR | MLR | RF | SVM | ||
| SMO | R2 | 0.56 | 0.50 | 0.53 | 0.42 | 0.60 |
| RMSE | 0.42 | 0.45 | 0.43 | 0.48 | 0.40 | |
| RPD | 1.49 | 1.40 | 1.46 | 1.31 | 1.58 | |
| Raw | R2 | 0.59 | 0.49 | 0.48 | 0.42 | 0.62 |
| RMSE | 0.41 | 0.45 | 0.45 | 0.47 | 0.39 | |
| RPD | 1.53 | 1.38 | 1.39 | 1.32 | 1.60 | |
|
| ||||||
| PLSR | PCR | MLR | RF | SVM | ||
| SMO | R2 | 0.59 | 0.54 | 0.57 | 0.33 | 0.43 |
| RMSE | 0.41 | 0.44 | 0.42 | 0.52 | 0.48 | |
| RPD | 1.56 | 1.47 | 1.53 | 1.22 | 1.33 | |
| Raw | R2 | 0.57 | 0.52 | 0.58 | 0.33 | 0.43 |
| RMSE | 0.42 | 0.44 | 0.42 | 0.52 | 0.48 | |
| RPD | 1.53 | 1.44 | 1.54 | 1.23 | 1.33 | |
R2 = coefficient of determination, RMSE; root mean square error; RPD; ratio of performance to deviation. PLSR; partial least-squares regression; PCR; principal components regression; MLR; multiple linear regression; RF; random forest; SVM; support vector machine; SMO; smoothed.
Validation prediction accuracies for full-spectra (VNIR+SWIR: 400-2500nm) regression models for predicting water content and nitrogen.
| Water full-spectra (VNIR+SWIR) validation | ||||||
|---|---|---|---|---|---|---|
| PLSR | PCR | MLR | RF | SVM | ||
| ASGD1 | R2 | 0.56 | 0.59 | 0.56 |
|
|
| RMSE | 3.11 | 2.95 | 3.25 |
|
| |
| RPD | 1.48 | 1.56 | 1.41 |
|
| |
| ASGD2 | R2 | 0.58 | 0.50 | 0.34 | 0.59 | 0.55 |
| RMSE | 3.07 | 3.24 | 5.22 | 3.00 | 3.10 | |
| RPD | 1.50 | 1.42 | 0.88 | 1.53 | 1.48 | |
| EMSC | R2 | 0.57 | 0.59 | 0.54 | 0.55 | 0.59 |
| RMSE | 3.10 | 3.01 | 3.27 | 3.08 | 2.94 | |
| RPD | 1.49 | 1.53 | 1.41 | 1.49 | 1.57 | |
| MSC | R2 | 0.57 | 0.56 | 0.54 | 0.49 | 0.55 |
| RMSE | 3.09 | 3.10 | 3.19 | 3.37 | 3.09 | |
| RPD | 1.49 | 1.49 | 1.44 | 1.37 | 1.49 | |
| SGD1 | R2 | 0.58 | 0.58 | 0.54 |
| 0.59 |
| RMSE | 3.01 | 3.00 | 3.25 |
| 2.94 | |
| RPD | 1.53 | 1.54 | 1.42 |
| 1.57 | |
| SGD2 | R2 | 0.52 | 0.57 | 0.37 | 0.57 | 0.58 |
| RMSE | 3.19 | 3.03 | 4.42 | 3.07 | 3.00 | |
| RPD | 1.44 | 1.52 | 1.04 | 1.50 | 1.54 | |
| SNV | R2 | 0.57 | 0.57 | 0.54 | 0.48 | 0.55 |
| RMSE | 3.10 | 3.08 | 3.19 | 3.37 | 3.10 | |
| RPD | 1.48 | 1.49 | 1.44 | 1.37 | 1.48 | |
| SMO | R2 |
|
|
|
| 0.59 |
| RMSE |
|
|
|
| 2.97 | |
| RPD |
|
|
|
| 1.55 | |
| Raw | R2 |
|
| 0.58 |
|
|
| RMSE |
|
| 3.01 |
|
| |
| RPD |
|
| 1.53 |
|
| |
|
| ||||||
| PLSR | PCR | MLR | RF | SVM | ||
| ASGD1 | R2 | 0.58 | 0.57 | 0.55 | 0.56 | 0.55 |
| RMSE | 0.45 | 0.49 | 0.46 | 0.45 | 0.47 | |
| RPD | 1.46 | 1.36 | 1.43 | 1.46 | 1.42 | |
| ASGD2 | R2 | 0.51 | 0.14 | 0.29 | 0.54 | 0.59 |
| RMSE | 0.48 | 0.63 | 0.63 | 0.47 | 0.45 | |
| RPD | 1.39 | 1.06 | 1.04 | 1.41 | 1.48 | |
| EMSC | R2 | 0.56 |
| 0.54 | 0.48 | 0.53 |
| RMSE | 0.45 |
| 0.46 | 0.49 | 0.47 | |
| RPD | 1.47 |
| 1.45 | 1.36 | 1.40 | |
| MSC | R2 |
| 0.53 | 0.54 | 0.39 | 0.43 |
| RMSE |
| 0.49 | 0.45 | 0.52 | 0.52 | |
| RPD |
| 1.35 | 1.48 | 1.26 | 1.28 | |
| SGD1 | R2 | 0.57 | 0.58 | 0.55 | 0.48 | 0.55 |
| RMSE | 0.46 | 0.47 | 0.45 | 0.52 | 0.47 | |
| RPD | 1.45 | 1.41 | 1.46 | 1.27 | 1.40 | |
| SGD2 | R2 | 0.48 | 0.43 | 0.26 | 0.51 | 0.56 |
| RMSE | 0.48 | 0.51 | 0.63 | 0.49 | 0.45 | |
| RPD | 1.38 | 1.31 | 1.05 | 1.36 | 1.46 | |
| SNV | R2 | 0.57 | 0.50 | 0.55 | 0.36 | 0.42 |
| RMSE | 0.45 | 0.51 | 0.45 | 0.54 | 0.52 | |
| RPD | 1.46 | 1.30 | 1.48 | 1.23 | 1.27 | |
| SMO | R2 |
|
|
| 0.37 | 0.43 |
| RMSE |
|
|
| 0.52 | 0.50 | |
| RPD |
|
|
| 1.26 | 1.32 | |
| Raw | R2 |
|
| 0.59 | 0.36 | 0.43 |
| RMSE |
|
| 0.43 | 0.53 | 0.50 | |
| RPD |
|
| 1.56 | 1.25 | 1.33 | |
R2 = coefficient of determination; RMSE, root mean square error; RPD, ratio of performance to deviation. PLSR, partial least-squares regression; PCR, principal components regression; MLR, multiple linear regression; RF, random forest; SVM, support vector machine; ASGD1, absorbance transformation then Savitzky-Golay first derivative; ASGD2, absorbance transformation then Savitzky-Golay second derivative; EMSC, extended multiplicative scatter-correction; MSC, multiplicative scatter-correction; SGD1, Savitzky-Golay first derivative; SGD2, Savitzky-Golay second derivative; SNV, standard normal variate; SMO, smoothed. Models with validation R2≥0.6 are in bold.
Figure 2Water (top) and nitrogen (bottom) regression graphs for the models showing the best validation predictive performance from all trialled methods (regression coefficient reduction method with raw spectra and partial least square regression (PLSR)).
Predictive performances for the wavelength refined models. Cal = calibration. Val = validation. Results are shown for the two different approaches used for variable selection: using the top 30% of wavelengths based on the regression coefficients from the full-spectra models and using wavelengths previously determined to be associated with nitrogen and water.
| Water wavelength selection models | Nitrogen wavelength selection models | ||||||
|---|---|---|---|---|---|---|---|
| PLSR | Cal | Val | PLSR | Cal | Val | ||
| Regression Coefficients | R2 | 0.81 | 0.69 | Regression Coefficients | R2 | 0.74 | 0.66 |
| RMSE | 2.05 | 2.53 | RMSE | 0.32 | 0.41 | ||
| RPD | 2.31 | 1.78 | RPD | 1.89 | 1.66 | ||
| Known Absorption Features | R2 | 0.71 | 0.64 | Known Absorption Features | R2 | 0.54 | 0.52 |
| RMSE | 2.57 | 2.74 | RMSE | 0.42 | 0.47 | ||
| RPD | 1.85 | 1.76 | RPD | 1.45 | 1.39 | ||
Validation results for the strongest performing models for water and nitrogen prediction across the different pre-processing and regression methods trialled- PLSR in combination with raw input data. Wavelength selection using the regression coefficient method produced the strongest models.
| VNIR | Full spectra | Wavelength selection regression coefficients | Wavelength selection feature positions | ||
|---|---|---|---|---|---|
|
| R2 | 0.56 | 0.63 | 0.69 | 0.64 |
| RMSE | 2.85 | 2.81 | 2.53 | 2.74 | |
| RPD | 1.50 | 1.64 | 1.78 | 1.76 | |
|
| R2 | 0.57 | 0.60 | 0.66 | 0.52 |
| RMSE | 0.42 | 0.43 | 0.41 | 0.47 | |
| RPD | 1.53 | 1.54 | 1.66 | 1.39 | |
Figure 3The procedure for the development of water and nitrogen distribution maps. Mean spectra were used as input to generate a PLSR model. The coefficients of the partial least square regression (PLSR) model were then applied to the unfolded datacube at the individual pixel level providing a spatial visualisation of water and nitrogen distribution within the plants.
Figure 4Distribution maps showing the prediction of water content in a watered (A) and drought (C) plant and nitrogen levels in a low (B) and high (D) nitrogen soil plant.