| Literature DB >> 35056650 |
Ning Ai1,2, Yibo Jiang2,3, Sainab Omar4, Jiawei Wang4, Luyue Xia2,3, Jie Ren1.
Abstract
Near-infrared (NIR) spectroscopy and characteristic variables selection methods were used to develop a quick method for the determination of cellulose, hemicellulose, and lignin contents in Sargassum horneri. Calibration models for cellulose, hemicellulose, and lignin in Sargassum horneri were established using partial least square regression methods with full variables (full-PLSR). The PLSR calibration models were established by four characteristic variables selection methods, including interval partial least square (iPLS), competitive adaptive reweighted sampling (CARS), correlation coefficient (CC), and genetic algorithm (GA). The results showed that the performance of the four calibration models, namely iPLS-PLSR, CARS-PLSR, CC-PLSR, and GA-PLSR, was better than the full-PLSR calibration model. The iPLS method was best in the performance of the models. For iPLS-PLSR, the determination coefficient (R2), root mean square error (RMSE), and residual predictive deviation (RPD) of the prediction set were as follows: 0.8955, 0.8232%, and 3.0934 for cellulose, 0.8669, 0.4697%, and 2.7406 for hemicellulose, and 0.7307, 0.7533%, and 1.9272 for lignin, respectively. These findings indicate that the NIR calibration models can be used to predict cellulose, hemicellulose, and lignin contents in Sargassum horneri quickly and accurately.Entities:
Keywords: lignocelluloses; macroalgae; near-infrared spectroscopy; rapid measurement; variables selection
Mesh:
Substances:
Year: 2022 PMID: 35056650 PMCID: PMC8780011 DOI: 10.3390/molecules27020335
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Figure 1Raw NIR spectra of Sargassum horneri samples.
Reference measurement results of samples in the calibration and prediction sets (lignocellulose in Sargassum horneri).
| Range | Mean | SD 1 | CV 2 (%) | |
|---|---|---|---|---|
| Total sets (n = 74) | ||||
| Cellulose (%) | 28.29–39.88 | 31.37 | 2.3604 | 7.5237 |
| Hemicellulose (%) | 16.75–22.66 | 19.28 | 1.3157 | 6.8229 |
| Lignin (%) | 22.10–27.20 | 27.04 | 1.2092 | 4.4725 |
| Calibration sets (n = 48) | ||||
| Cellulose (%) | 28.29–39.88 | 31.39 | 2.4720 | 7.8760 |
| Hemicellulose (%) | 16.75–22.64 | 19.14 | 1.3463 | 7.0349 |
| Lignin (%) | 22.10–26.98 | 25.14 | 1.2348 | 4.9126 |
| Prediction sets (n = 26) | ||||
| Cellulose (%) | 28.46–37.40 | 31.35 | 2.1860 | 6.9738 |
| Hemicellulose (%) | 17.14–21.39 | 19.55 | 1.2371 | 6.3266 |
| Lignin (%) | 22.41–27.20 | 24.70 | 1.1416 | 4.6220 |
1 Standard deviation; 2 coefficient of variation ((SD/mean) × 100).
Cellulose, hemicellulose, and lignin contents in Sargassum horneri and other plant fibers.
| Cellulose (%) | Hemicellulose (%) | Lignin (%) | |
|---|---|---|---|
|
| 28.29–39.88% | 16.75–22.64% | 22.10–27.20% |
| Eucalyptus [ | 37–46.9% | / | / |
| Corn fiber [ | 2.26–9.1% | 36.4–46.4% | / |
| Corn stalk [ | 30.6–33.1% | 25.8–27.65% | 14.6–15.9% |
| Miscanthus sinensis [ | 40–60% | 20–40% | 10–25% |
| Big bluestem [ | 29.59–43.02 | 20.73–30.84 | / |
| Moso bamboo [ | 37.98–53.76% | 17.7–28.18% | 13.82–23.86% |
Performance of the full-PLSR models with different pretreatment methods.
| Method | Calibration | Prediction | |||||
|---|---|---|---|---|---|---|---|
|
| RMSEC 2 |
| RMSECV 4 |
| RMSEP 6 | RPD 7 | |
| Cellulose | |||||||
| SG | 0.9825 | 0.3274 | 0.5347 | 1.3672 | 0.6161 | 1.3833 | 1.6139 |
| SG+1st | 0.9998 | 0.0287 | 0.4955 | 1.4034 | 0.3407 | 1.8667 | 1.2316 |
| SG+2nd | 0.9942 | 0.1872 | 0.4440 | 1.4093 | 0.4802 | 1.6271 | 1.3871 |
| SNV | 1.0000 | 0.0033 | 0.2490 | 1.5854 | 0.3459 | 1.7723 | 1.2364 |
| MSC | 1.0000 | 0.0030 | 0.2495 | 1.5803 | 0.3479 | 1.7693 | 1.2383 |
| Hemicellulose | |||||||
| SG | 0.8843 | 0.4579 | 0.6163 | 0.9491 | 0.5515 | 1.0277 | 1.4931 |
| SG+1st | 0.9998 | 0.0163 | 0.6132 | 0.9238 | 0.5558 | 0.9735 | 1.5004 |
| SG+2nd | 0.9696 | 0.2348 | 0.5681 | 0.9721 | 0.4724 | 0.9888 | 1.3767 |
| SNV | 1.0000 | 0.0024 | 0.5526 | 1.0135 | 0.3894 | 0.9823 | 1.2797 |
| MSC | 1.0000 | 0.0026 | 0.5483 | 1.0131 | 0.3787 | 0.9988 | 1.2687 |
| Lignin | |||||||
| SG | 0.9467 | 0.2849 | 0.2382 | 1.5341 | 0.1935 | 1.5654 | 1.1135 |
| SG+1st | 0.9561 | 0.2589 | 0.1892 | 1.5265 | 0.1413 | 1.5676 | 1.0791 |
| SG+2nd | 0.8163 | 0.5292 | 0.2113 | 1.3426 | 0.2058 | 1.3833 | 1.1221 |
| SNV | 0.9759 | 0.1918 | 0.1259 | 1.5004 | 0.1025 | 1.5259 | 1.0556 |
| MSC | 0.9598 | 0.2478 | 0.1932 | 1.4760 | 0.1385 | 1.5947 | 1.0774 |
1 Coefficient of determination for calibration set; 2 root mean square error for calibration set; 3 coefficient of determination for cross-validation set; 4 root mean square error for leave-one-out cross-validation set; 5 coefficient of determination for prediction set; 6 root mean square error for prediction set; 7 residual predictive deviation.
Performance of the full-PLSR models with different pretreatment methods.
| Model | Calibration | Prediction | ||||||
|---|---|---|---|---|---|---|---|---|
| CVs 1 |
| RMSEC |
| RMSECV |
| RMSEP | RPD | |
| Cellulose | ||||||||
| Full-PLSR | 8298 | 0.9825 | 0.3274 | 0.5347 | 1.3672 | 0.6161 | 1.3833 | 1.6139 |
| iPLS-PLSR | 1540 | 0.9827 | 0.3247 | 0.8511 | 0.7624 | 0.8955 | 0.8232 | 3.0934 |
| CARS-PLSR | 3261 | 0.9736 | 0.4017 | 0.6910 | 1.1304 | 0.7742 | 1.0637 | 2.1043 |
| CC-PLSR | 2485 | 0.9990 | 0.0783 | 0.7405 | 1.1877 | 0.7353 | 1.2988 | 1.9437 |
| GA-PLSR | 421 | 0.9339 | 0.6350 | 0.6808 | 1.1267 | 0.7418 | 1.1506 | 1.9680 |
| Hemicellulose | ||||||||
| Full-PLSR | 8298 | 0.9998 | 0.0163 | 0.6132 | 0.9238 | 0.5558 | 0.9735 | 1.5004 |
| iPLS-PLSR | 1935 | 0.9209 | 0.3786 | 0.8947 | 0.4983 | 0.8669 | 0.4697 | 2.7406 |
| CARS-PLSR | 6461 | 0.9998 | 0.0170 | 0.7581 | 0.8095 | 0.6962 | 0.9624 | 1.8143 |
| CC-PLSR | 1705 | 0.9723 | 0.2242 | 0.7661 | 0.7351 | 0.6746 | 0.7689 | 1.7532 |
| GA-PLSR | 731 | 0.9904 | 0.1320 | 0.7639 | 0.8181 | 0.7201 | 0.8029 | 1.8902 |
| Lignin | ||||||||
| Full-PLSR | 8298 | 0.8163 | 0.5292 | 0.2113 | 1.3426 | 0.2058 | 1.3833 | 1.1221 |
| iPLS-PLSR | 1665 | 0.9315 | 0.3232 | 0.8261 | 0.5172 | 0.7307 | 0.7533 | 1.9272 |
| CARS-PLSR | 3328 | 0.9726 | 0.2043 | 0.4423 | 0.9033 | 0.4119 | 0.9015 | 1.3040 |
| CC-PLSR | 2264 | 0.9411 | 0.2996 | 0.4139 | 1.3251 | 0.4460 | 1.1685 | 1.3435 |
| GA-PLSR | 899 | 0.8495 | 0.4789 | 0.5992 | 1.3164 | 0.3660 | 1.3635 | 1.5796 |
1 Characteristic variables.
Figure 2Plots of characteristic variable selection based on iPLS for cellulose in Sargassum horneri.
Figure 3Plots of characteristic variable selection based on CARS for cellulose in Sargassum horneri. Plot (a), ((b) upper), and ((b) lower) show the regression coefficient, number of sampled variables, and RMSECV value, respectively.
Figure 4Plots of characteristic variable selection based on CC for cellulose in Sargassum horneri. Plots (a,b) show the correlation coefficient and RMSECV value, respectively.
Figure 5Plots of characteristic variable selection based on GA for cellulose in Sargassum horneri. Plots (a,b) show the variable frequency and RMSECV value, respectively.
Figure 6Scatter plots of reference measurements and NIR predictions using iPLS-PLSR models for cellulose (a), hemicellulose (b), and lignin (c) in Sargassum horneri.
The results of the near-infrared spectroscopy measurement of terrestrial biomass and Sargassum horneri.
| Content | R2 | RMSE | SEP | |
|---|---|---|---|---|
|
| Cellulose, hemicellulose and lignin | 0.8955, 0.8669, and 0.7307 | 0.8232, 0.4697, and 0.7533 | 3.0934, 2.7406, and 1.9272 |
| Eucalyptus [ | Cellulose | 0.82–0.94 | 0.7–1.07 | / |
| Corn fiber [ | Cellulose and hemicellulose | 0.81–0.96 and 0.31–0.81 | 0.30–0.68 and 0.79–1.04 | / |
| Miscanthus sinensis [ | Cellulose, hemicellulose and lignin | 0.943, 0.938, and 0.864 | 0.678, 0.707, and 0.562 | / |
| Big bluestem [ | Cellulose and hemicellulose | 0.92 and 0.91 | 0.67 and 0.72 | 4.52 and 3.12 |
| Moso bamboo [ | Cellulose, hemicellulose and lignin | 0.909. 0.921, and 0.892 | 0.81, 1.05, and 0.65 | 5.42, 3.18, and 1.62 |