| Literature DB >> 24955114 |
Jason S Lupoi1, Seema Singh2, Mark Davis3, David J Lee4, Merv Shepherd5, Blake A Simmons6, Robert J Henry7.
Abstract
BACKGROUND: In order to rapidly and efficiently screen potential biofuel feedstock candidates for quintessential traits, robust high-throughput analytical techniques must be developed and honed. The traditional methods of measuring lignin syringyl/guaiacyl (S/G) ratio can be laborious, involve hazardous reagents, and/or be destructive. Vibrational spectroscopy can furnish high-throughput instrumentation without the limitations of the traditional techniques. Spectral data from mid-infrared, near-infrared, and Raman spectroscopies was combined with S/G ratios, obtained using pyrolysis molecular beam mass spectrometry, from 245 different eucalypt and Acacia trees across 17 species. Iterations of spectral processing allowed the assembly of robust predictive models using partial least squares (PLS).Entities:
Keywords: Biomass; Fourier-transform infrared spectroscopy; High-throughput; Lignin S/G; Multivariate analysis; Near-infrared spectroscopy; Raman spectroscopy
Year: 2014 PMID: 24955114 PMCID: PMC4064109 DOI: 10.1186/1754-6834-7-93
Source DB: PubMed Journal: Biotechnol Biofuels ISSN: 1754-6834 Impact factor: 6.040
Evaluation of literature multivariate models for lignin S/G prediction
| 10 | - | - | 0.02 (S) 0.04 (G) | - | N | 0.04 (S) 0.04 (G) | 0.996 (S) 0.997 (G) | 0.985 (S) 0.986 (G) | - | [ | ||
| 55 (Cal) 25 (Val) | 0.07 | 0.32 | - | - | Y | - | 0.935 | 5 | [ | |||
| 63 (Cal) 30 (Val) | 0.32 | 0.35 | - | - | Y | - | 0.919 | 2 | [ | |||
| 9 | - | - | - | - | N | - | - | - | 0.983 | - | [ | |
| 5 | - | - | - | - | - | - | - | - | - | - | [ | |
| 5 | - | - | - | - | - | - | - | - | - | - | [ | |
| 65 | - | - | - | - | - | - | - | - | 0.91-0.98 | - | [ | |
| 15 | - | - | - | - | - | - | - | - | - | - | [ | |
| 42 (Cal) 36 (Val) | - | - | - | 0.025-0.033 | Y | 0.025-0.036 | 0.95-0.97 | 0.92-0.96 | 3-5 | [ | ||
| 267 (Cal) | - | - | 0.176 (S) 0.202 (G) 0.005 (H) | 0.201 (S) 0.202 (G) 0.005 (H) | N | - | 0.979 (S) 0.979 (G) 0.843 (H) | 0.968 (S) 0.957 (G) 0.731 (H) | 0.958 (S) 0.957 (G) 0.710 (H) | 8 | [ | |
| 135 (Cal) 45 (Val) | - | 0.124 | - | 0.121 | Y | - | 0.583 | 0.47 | 5-7 | [ | ||
| 26 (Cal) 8 (Val) | 0.26 | 0.3 | - | - | Y | - | 0.96 | 0.88 | 6 | [ |
1Standard Error of Calibration.
2Standard Error of Prediction.
3Root Mean Standard Error of Calibration.
4Root Mean Standard Error of Cross-Validation.
5Root Mean Standard Error of Prediction.
6Coefficient of Correlation for Validation Set.
7Coefficient of Determination for Calibration Set.
8Coefficient of Determination for Validation Set.
9#of Factors.
Italicized values calculated by the authors.
MIR = mid-infrared spectroscopy, NIR = near-infrared spectroscopy, Cal = calibration set, Val = validation set, S = syringyl, G = guaiacyl, H = -hydroxyphenol, Y = yes, N = no.
Figure 1Comparison of raw and pretreated mid-infrared spectral data. Mid-infrared spectra of Acacia microbotrya (green), Corymbia hybrid (blue), and Eucalyptus globulus subspecies maidenii (red). The upper panel (A) shows the untreated spectral data, while the middle (B) and bottom (C) panels show the second derivative, and second derivative + standard normal variate (SNV) spectral transformations, respectively. The x-axis is in wavenumbers while the y-axis is the absorbance.
Figure 2Comparison of raw and pretreated near-infrared spectral data. Near-infrared spectra of Acacia microbotrya (green), Corymbia hybrid (blue), and Eucalyptus globulus subspecies maidenii (red). The upper panel (A) shows the untreated spectral data, while the middle (B) and bottom (C) panels show the second derivative, and second derivative + standard normal variate (SNV) spectral transformations, respectively. The x-axis is in wavenumbers while the y-axis is the absorbance.
Figure 3Comparison of raw and pretreated Raman spectral data. Raman spectra of Acacia microbotrya (green), Corymbia hybrid (blue), and Eucalyptus globulus subspecies maidenii (red). The upper panel (A) shows the untreated spectral data, while the middle (B) and bottom (C) panels show the second derivative, and second derivative + standard normal variate (SNV) spectral transformations, respectively. The x-axis is in wavenumbers, while the y-axis shows the Raman intensity.
Comparison of PLS calibration models using vibrational spectroscopy and pyrolysis molecular beam mass spectrometry
| 0.05 | 0.13 | 0.14 | 0.83 ± 0.02 | 0.81 ± 0.02 | 4-5 | 2 | |
| 0.05 | 0.13 | 0.14 | 0.845 ± 0.003 | 0.82 ± 0.01 | 4-5 | 3 | |
| 0.05 | 0.13 | 0.14 | 0.83 ± 0.01 | 0.81 ± 0.01 | 5-6 | 2-3 | |
| 0.05 | 0.13 | 0.13 | 0.84 ± 0.01 | 0.82 ± 0.01 | 4-5 | 4-5 | |
| 0.05 | 0.13 | 0.14 | 0.84 ± 0.03 | 0.81 ± 0.03 | 3-4 | 1-2 | |
| 0.05 | 0.13 | 0.14 | 0.82 ± 0.01 | 0.78 ± 0.01 | 3-4 | 1-2 | |
| 0.05 | 0.13 | 0.14 | 0.85 ± 0.02 | 0.82 ± 0.03 | 3-4 | 2-3 | |
| 0.05 | 0.17 | 0.18 | 0.73 ± 0.01 | 0.681 ± 0.004 | 4-5 | 4-7 | |
| 0.05 | 0.17 | 0.18 | 0.72 ± 0.02 | 0.68 ± 0.02 | 4-6 | 1-5 | |
| 0.05 | 0.16 | 0.17 | 0.74 ± 0.01 | 0.70 ± 0.02 | 4-5 | 2-3 |
1Standard Error of the Laboratory for the calibration data.
2Root Mean Standard Error of Calibration.
3Root Mean Standard Error of Cross-Validation.
4Coefficient of determination for calibration set.
5Coefficient of determination for full cross-validation.
6Average number of factors used in model construction.
7Number of outliers removed from calibration models.
aAverage errors of 3 randomly generated models using data provided. Models were not statistically different.
The numbers listed parenthetically reflect the degree of Savitzky-Golay spectral smoothing.
Statistical values are the average of 3 independent models.
MIR = mid-infrared spectroscopy, NIR = near-infrared spectroscopy, EMSC = extended multiplicative scatter correction, MSC = multiplicative scatter correction, SNV = standard normal variate.
Comparison of PLS predictive models using vibrational spectroscopy and pyrolysis molecular beam mass spectrometry
| 0.05 | 0.14 | 0.13 | 0.89 ± 0.04 | 0.79 ± 0.08 | 1 | |
| 0.05 | 0.13 | 0.13 | 0.91 ± 0.02 | 0.83 ± 0.04 | 1 | |
| 0.05 | 0.14 | 0.15 | 0.90 ± 0.02 | 0.81 ± 0.04 | 0 | |
| 0.06 | 0.17 | 0.16 | 0.86 ± 0.02 | 0.74 ± 0.04 | 0 | |
| 0.05 | 0.14 | 0.13 | 0.87 ± 0.06 | 0.8 ± 0.1 | 1 | |
| 0.05 | 0.14 | 0.14 | 0.91 ± 0.01 | 0.83 ± 0.01 | 1 | |
| 0.05 | 0.15 | 0.15 | 0.87 ± 0.02 | 0.76 ± 0.03 | 1 | |
| 0.06 | 0.19 | 0.20 | 0.79 ± 0.01 | 0.62 ± 0.01 | 0 | |
| 0.06 | 0.18 | 0.18 | 0.82 ± 0.04 | 0.67 ± 0.07 | 1 | |
| 0.06 | 0.22 | 0.21 | 0.80 ± 0.04 | 0.65 ± 0.07 | 1 |
1Standard Error of the Laboratory for the validation data.
2Standard Error of Prediction.
3Root Mean Standard Error Prediction.
4Correlation coefficient for the validation set.
5Pearson coefficient of determination for validation.
6Number of outliers removed from validation models.
aAverage errors of three randomly generated models using data provided. Models were not statistically different.
The numbers listed parenthetically reflect the degree of Savitzky-Golay spectral smoothing.
Statistical values are the average of 3 independent models.
MIR = mid-infrared spectroscopy, NIR = near-infrared spectroscopy, EMSC = extended multiplicative scatter correction, MSC = multiplicative scatter correction, SNV = standard normal variate.
Figure 4Reference versus predicted plot for validation set using mid-infrared, near-infrared, and Raman spectral data. (A) Plot of the predicted lignin S/G ratio using a model built from second derivative + MSC-transformed mid-infrared spectra and the reference pyMBMS data. The black line indicates the target line of optimal fit and the blue line represents the experimental fit of the data to the model. (B) Plot of the predicted lignin S/G ratio using a model built from second derivative + MSC-transformed near-infrared spectra and the reference pyMBMS data. The black line indicates the target line of optimal fit and the blue line represents the experimental fit of the data to the model. (C) Plot of the predicted lignin S/G ratio using a model built from first derivative + EMSC-transformed Raman spectra and the reference pyMBMS data. The black line indicates the target line of optimal fit and the blue line represents the experimental fit of the data to the model. The x-axis shows the pyMBMS measured lignin S/G ratio, and the y-axis reveals the predicted lignin S/G ratios. S/G = syringyl-to-guaiacyl ratio, MSC = multiplicative scatter correction, pyMBMS = pyrolysis molecular beam mass spectrometry, EMSC = extended multiplicative scatter correction.
MIR vibrational mode regions identified from regression coefficient plots and spectral assignments corresponding to lignin and/or lignin monomers
| 788-790 | 784 (G) [ |
| 808-836 | 813 (G) [ |
| 827 (S) [ | |
| 854-883 | 863, 878 (G) [ |
| 912-917 | 914 (G) [ |
| 1137-1168 | 1142 (G) [ |
| 1151 (G) [ | |
| 1205-1263 | 1215 (Lignin) [ |
| 1226 (G), 1252 [ | |
| 1270-1299 | 1270 (G) [ |
| 1269 (G) [ | |
| 1319-1425 | 1425 (S) [ |
| 1330, 1425 (S), 1379, 1428 (Lignin) [ | |
| 1327 (G), 1425, 1427 [ | |
| 1442-1502 | 1500 (S) [ |
| 1465 (Lignin) [ | |
| 1462, 1463 [ | |
| 1508-1521 | 1506-1513 [ |
| 1513, 1514 [ | |
| 1585-1606 | 1589 (S) [ |
| 1596–1600 (Lignin) [ | |
| 1594, 1603 [ | |
| 1610-1612 | 1610 [ |
| 1698-1714 | 1704 [ |
| 1745-1756 | 1733-1753 [ |
G = guaiacyl, S = syringyl.
Raman vibrational modes identified from regression coefficient plots and spectral assignments corresponding to lignin and/or lignin monomers
| 351-376 | 369 (S), 357, 370 (G) [ |
| 378-401 | 370-399 (S) [ |
| 474-623 | 529, 564, 582 (S), 541, 559, 590 (G) [ |
| 665-725 | 711 (S) [ |
| 712 (G), 701 (H) [ | |
| 736-756 | 741 (S) [ |
| 741 (H) [ | |
| 748-765 | 761 (G) [ |
| 765-796 | 781-820 (S) [ |
| 784 (G) [ | |
| 793 (G) [ | |
| 800-835 | 819-864 (H) [ |
| 810 (S) [ | |
| 799 (S), 823 (H) [ | |
| 875-939 | 920 (G) [ |
| 907 (S), 921 (G) [ | |
| 991-1051 | 1024 (G) [ |
| 1043 (S), 1036 (G) [ | |
| 1091-1131 | 1108 (S), 1124 (G), 1094 (H) [ |
| 1116 (S), 1122 (G), 1105 (H) [ | |
| 1135-1195 | 1154 (S), 1158 (G), 1168 (H), 1170 (Lignin) [ |
| 1138–1160 (S), 1162–1188 (G), 1163–1179 (H) [ | |
| 1148 (S), 1186 (G), 1164 (H) [ | |
| 1152, 1187 (S), 1155, 1186 (G), 1173, 1199 (H) [ | |
| 1205-1242 | 1200 (H) [ |
| 1213–1218 (H) [ | |
| 1228 (S), 1215 (H) [ | |
| 1214, 1241 (S), 1208, 1241 (G), 1216 (H) [ | |
| 1261-1346 | 1337 (S), 1263 (H), 1270 (Lignin) [ |
| 1262–1275 (G), 1318–1332, 1331–1338 (S), 1286–1299 (H) [ | |
| 1331 (S), 1270–1285 (G), 1338 H [ | |
| 1331 (S), 1272, 1288 (G), 1298, 1331 (H) [ | |
| 1434-1448 | 1454-1460 (S), 1452–1465 (G), 1452–1459 (H) [ |
| 1452 (S), 1455 (G), 1455 (H) [ | |
| 1587-1606 | 1594 (S), 1589, 1604 (G), 1588, 1606 (H), 1591, 1604 (Lignin) [ |
| 1588 (S) [ | |
| 1609 (S), 1609 (G), 1599 (H) [ | |
| 1623-1629 | 1634 (S), 1633 (G), 1632 (H), 1634 (Lignin) [ |
| 1653-1672 | coniferyl (G) and sinapyl (G) alcohol [ |
G = guaiacyl, S = syringyl, H = p-coumaryl.
NIR vibrational modes identified from regression coefficient plots and spectral assignments corresponding to lignin and/or lignin monomers
| 4400-4586 | 4411 [ |
| 4546 [ | |
| 4686 [ | |
| 5581-5600 | 5583 [ |
| 5959-6009 | 5963, 5978 [ |
| 5974, 5978, 5980 [ | |
| 7081-7197 | 7092 [ |
| 8459-8674 | 8547 [ |
| 8720-8801 | 8749 [ |