| Literature DB >> 31015553 |
Oliver Elle1,2, Ronny Richter3,4,5, Michael Vohland6,7, Alexandra Weigelt1,7.
Abstract
1. Root lignin is a key driver of root decomposition, which in turn is a fundamental component of the terrestrial carbon cycle and increasingly in the focus of ecologists and global climate change research. However, measuring lignin content is labor-intensive and therefore not well-suited to handle the large sample sizes of most ecological studies. To overcome this bottleneck, we explored the applicability of high-throughput near infrared spectroscopy (NIRS) measurements to predict fine root lignin content. 2. We measured fine root lignin content in 73 plots of a field biodiversity experiment containing a pool of 60 grassland species using the Acetylbromid (AcBr) method. To predict lignin content, we established NIRS calibration and prediction models based on partial least square regression (PLSR) resulting in moderate prediction accuracies (RPD = 1.96, R2 = 0.74, RMSE = 3.79). 3. In a second step, we combined PLSR with spectral variable selection. This considerably improved model performance (RPD = 2.67, R2 = 0.86, RMSE = 2.78) and enabled us to identify chemically meaningful wavelength regions for lignin prediction. 4. We identified 38 case studies in a literature survey and quantified median model performance parameters from these studies as a benchmark for our results. Our results show that the combination Acetylbromid extracted lignin and NIR spectroscopy is well suited for the rapid analysis of root lignin contents in herbaceous plant species even if the amount of sample is limited.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31015553 PMCID: PMC6479063 DOI: 10.1038/s41598-019-42837-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Schematic depiction of the workflow of chemical analysis (left box with steps 1–4 indicated in square brackets), spectral analysis (outlined in black boxes and step 5) and statistical analysis. The latter is subdivided in spectral pre-processing (block 1 with steps 6–9) and final analysis (block 2 with steps 10–12). See text for further details.
Accuracies for different pre-processing methods achieved in model validation (mean and standard deviation from 100 runs).
| Method | nLV | RPD | RP2 | RMSEP [%] | SEP [%] | BIAS [%] |
|---|---|---|---|---|---|---|
| Raw | 8.17 ± 0.75 | 1.83 ± 0.32 | 0.71 ± 0.08 | 4.11 ± 0.62 | 4.05 ± 0.59 | −0.43 ± 0.95 |
| B.als | 6.62 ± 1.48 | 1.73 ± 0.27 | 0.69 ± 0.07 | 4.34 ± 0.64 | 4.27 ± 0.59 | −0.44 ± 1.05 |
| B.irls | 7.96 ± 1.40 | 1.69 ± 0.30 | 0.68 ± 0.09 | 4.47 ± 0.69 | 4.40 ± 0.65 | −0.49 ± 1.04 |
| MSC | 7.34 ± 1.61 | 1.95 ± 0.37 | 0.74 ± 0.09 | 11.64 ± 8.74 | 3.84 ± 0.64 | −1.83 ± 13.97 |
|
|
|
|
|
|
|
|
| D1 | 7.95 ± 2.08 | 1.84 ± 0.39 | 0.71 ± 0.10 | 4.14 ± 0.73 | 4.09 ± 0.70 | −0.35 ± 0.98 |
| D2 | 5.04 ± 2.31 | 1.56 ± 0.17 | 0.58 ± 0.09 | 4.77 ± 0.64 | 4.71 ± 0.59 | −0.02 ± 1.18 |
Given are measures of accuracies as number of latent variables used in PLSR (nLV), residual predictive deviation (RPD), measure of determination between predicted and observed lignin contents (Rp²), root mean squared error of prediction (RMSEP), standard error of prediction (SEP) and deviation from the line of equality of linear regression between predicted and observed values (BIAS). We used raw data (Raw) and six different pre-processing methods: asymmetric least squares baseline offset correction (B.als), iterative restricted least squares baseline offset correction (B.irls), multiplicative scatter correction (MSC), standard normal variate (SNV), first derivative (D1) and second derivative (D2). The best model is highlighted in bold.
Comparison of accuracies (mean ± SD) achieved for full spectrum PLSR versus best model from variable selection using CARS-PLS resulting from 100 repetitions for the calibration and validation.
| Models (S = 100) | WL-Selection | Full spectrum PLSR | CARS-PLSR |
|---|---|---|---|
| Calibration | Variables | 662 | 16 |
| nLV | 7.37 ± 1.28 | 6.99 ± 0.58 | |
| RMSE [%] | 3.58 ± 0.54 | 2.75 ± 0.30 | |
| Validation | RPD | 1.96 ± 0.35 | 2.67 ± 0.46 |
| RP2 | 0.74 ± 0.08 | 0.86 ± 0.05 | |
| RMSEP [%] | 3.83 ± 0.57 | 2.82 ± 0.40 | |
| SEP [%] | 3.79 ± 0.56 | 2.78 ± 0.40 | |
| BIAS | −0.22 ± 0.86 | −0.19 ± 0.65 |
Abbreviations are as given in Table 1. In addition we use root mean square error of calibration (RMSE).
Figure 2(a) Regression coefficients from full spectrum PLS (red line) with the selected wavelengths from the best CARS-PLS model as overlay (black arrows with wavelength indicated). (b) Mean absorbance spectrum of fine roots (red line) and its range (light grey) together with the relative importance (in %) of wavelength selection during 100 runs (dark grey bars) and the most important variables from the best model (black bars). Black arrows and number refer to the wavelength clusters indicated by the most important selected variables.
Figure 3Predicted versus observed lignin contents (mean over 100 runs) from model calibration (left) and model validation (right) given as linear regression (dashed line) against the 1:1 line (solid line). Symbols represent the gradient of species richness (squares = 1, circles = 2, triangle = 4, rhombus = 8, triangle (upside down) = 16). n indicates the number of samples used in calibration and validation.
Summary statistics of a literature survey of studies predicting lignin contents from different plant tissues using spectroscopic methods.
| Species38 | Lignin [mg]36 | Val/Cal35 | RPD28 |
| RMSEP [%]19 | SEP [%]21 | |
|---|---|---|---|---|---|---|---|
| Min | 1 | 100 | 0.12 | 0.5 | 0.05 | 0.19 | 0.28 |
| Q25 | 1 | 275 | 0.29 | 1.61 | 0.61 | 0.68 | 0.65 |
| Q75 | 1 | 500 | 0.50 | 3.56 | 0.90 | 1.88 | 2.27 |
| Max | 32 | 2500 | 3.55 | 9.5 | 0.99 | 3.48 | 5.00 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Given are minimum, maximum and interquartile ranges of the number of species examined per study (Species), sample volume used for chemical lignin extraction (Lignin) and proportion of samples used for model validation and calibration (Val/Cal). Additional abbreviations as in Table 1. Superscript numbers indicate the number of individual datasets that entered the analysis. See Table S1 for details on individual studies.