| Literature DB >> 28878821 |
Meng Li1,2, Jun Wang1,2, Fu Du1,2, Boubacar Diallo1,2, Guang Hui Xie1,2.
Abstract
BACKGROUND: Due to its chemical composition and abundance, lignocellulosic biomass is an attractive feedstock source for global bioenergy production. However, chemical composition variations interfere with the success of any single methodology for efficient bioenergy extraction from diverse lignocellulosic biomass sources. Although chemical component distributions could guide process design, they are difficult to obtain and vary widely among lignocellulosic biomass types. Therefore, expensive and laborious "one-size-fits-all" processes are still widely used. Here, a non-destructive and rapid analytical technology, near-infrared spectroscopy (NIRS) coupled with multivariate calibration, shows promise forEntities:
Keywords: Bioenergy sorghum; Chemical components; Linear discriminant analysis model; Near infrared spectroscopy; Optimal sample subset partitioning; Optimal variable selection; Partial least squares model; Theoretical ethanol yield
Year: 2017 PMID: 28878821 PMCID: PMC5584014 DOI: 10.1186/s13068-017-0892-z
Source DB: PubMed Journal: Biotechnol Biofuels ISSN: 1754-6834 Impact factor: 6.040
Fig. 1Diversity of chemical components and theoretical ethanol yield, and their correlations in 147 bioenergy sorghums. S soluble sugar, C cellulose, H hemicellulose, L lignin, A ash, TEY theoretical ethanol yield, TEY-C6 TEY from hexose, and TEY-C5 TEY from pentose. a Chemical components of 147 bioenergy sorghum samples, b TEY of 147 bioenergy sorghum samples, c correlations among chemical components and TEY of 70 sweet sorghum samples, d correlations among chemical components and TEY of 77 biomass sorghum samples
Fig. 2Flowchart of NIRS qualitative and quantitative analyses. PCA principal component analysis, LDA linear discriminant analysis, SPXY sample set partitioning based on joint X–Y distances, CARS competitive adaptive reweighted sampling, SR selectivity ratio, VIP variable importance for projection, UVE uninformative variable elimination, MC-UVE UVE couple with the principle of Monte Carlo, PLS partial least squares
Fig. 3Description and qualitative classification of 147 bioenergy sorghum samples. a Original spectra of 147 bioenergy sorghum samples, b 3D plot of the principal component analysis scores of 147 bioenergy sorghum samples, c the correct classification rate and variance obtained by linear discriminant analysis, d Mahalanobis distance of 147 bioenergy sorghum samples (principal components = 20)
The main absorption band peak location of chemical components in bioenergy sorghum
| Wavenumber (cm−1) | Component | Bond vibration |
|---|---|---|
| 4015–4022 | C, L | C–Hstr, C–Cstr |
| 4285–4296 | H | C–Hstr, C–Hdef |
| 4392–4412 | C, H, L | C–Hstr, C–H2str, O–Hstr, C–Ostr, C–Hdef, C–H2def |
| 4760–4780 | C | O–Hdef, C–Hdef, O–Hstr |
| 5150–5195 | W, S | O–Hasym, O–Hstr, O–Hdef |
| 5776–5796 | S, C, L | 1st OT C–Hstr |
| 6329–6336 | C | 1st OT C–Hstr |
| 6775–6822 | S, C, H | 1st OT C–Hstr |
| 7305–7328 | C | 1st OT C–Hstr, C–Hdef |
C cellulose, H hemicellulose, L lignin, S soluble sugar, W water
Fig. 4Histograms of chemical components and theoretical ethanol yield. Soluble sugar (a), cellulose (b), hemicellulose (c), lignin (d), ash (e), and theoretical ethanol yield (f) for calibration and external validation subsets that were partitioned by sample set partitioning based on joint X–Y distances
Fig. 5Principal component analysis plots distribution of chemical components and theoretical ethanol yield. Soluble sugar (a), cellulose (b), hemicellulose (c), lignin (d), ash (e), and theoretical ethanol yield (f) for calibration and external validation subsets that partitioned by sample set partitioning based on joint X–Y distances
Summary statistics of partial-optimized partial least squares models for chemical components and theoretical ethanol yield (TEY) of bioenergy sorghum
| Parameter | Calibration | Cross validation | External validation | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Number | PCs | RMSEC |
| RMSECV |
| Number | RMSEP |
| RPD | RER | |
| Soluble sugar | 108 | 7 | 2.27 | 0.95 | 2.66 | 0.93 | 37 | 2.57 | 0.91 | 3.39 | 16.42 |
| Cellulose | 108 | 9 | 0.93 | 0.95 | 1.26 | 0.91 | 37 | 1.18 | 0.91 | 3.46 | 16.30 |
| Hemicellulose | 108 | 7 | 0.75 | 0.95 | 0.85 | 0.94 | 37 | 0.64 | 0.95 | 4.33 | 25.15 |
| Lignin | 107 | 8 | 0.65 | 0.95 | 0.74 | 0.94 | 37 | 0.65 | 0.92 | 3.66 | 19.72 |
| Ash | 108 | 7 | 0.68 | 0.72 | 0.82 | 0.61 | 37 | 0.49 | 0.90 | 3.25 | 12.58 |
| TEY | 108 | 6 | 12.95 | 0.76 | 14.18 | 0.72 | 37 | 8.08 | 0.84 | 3.42 | 17.61 |
PCs principal components
Fig. 6Plots of predicted versus reference values of chemical components and theoretical ethanol yield. Soluble sugar (a), cellulose (b), hemicellulose (c), lignin (d), ash (e), and theoretical ethanol yield (f) for the external validation subsets based on partial-optimized partial least squares calibration models. The represents the square of the correlation coefficients of the external validation subsets
Fig. 7The optimal variable selection in calibration subset by CARS, SR, VIP, MC-UVE, and UVE. CARS competitive adaptive reweighted sampling, SR selectivity ratio, VIP variable importance for projection, UVE uninformative variable elimination, and MC-UVE UVE couple with the principle of Monte Carlo. Soluble sugar (a), cellulose (b), hemicellulose (c), lignin (d), ash (e), and theoretical ethanol yield (f). Asterisk refers to the number of variables selected by each method
The statistics of characteristic variables for chemical components and theoretical ethanol yield (TEY)
| Parameter | MIV | VIV | IV | NV | LIV | UIV | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
| |
| Soluble sugar | 4 | 0.3 | 64 | 4.1 | 230 | 14.8 | 908 | 58.3 | 215 | 13.8 | 136 | 8.7 |
| Cellulose | 5 | 0.3 | 56 | 3.6 | 240 | 15.4 | 824 | 52.9 | 248 | 15.9 | 184 | 11.8 |
| Hemicellulose | 4 | 0.3 | 37 | 2.4 | 251 | 16.1 | 918 | 59.0 | 166 | 10.7 | 181 | 11.6 |
| Lignin | 7 | 0.5 | 62 | 4.0 | 276 | 17.7 | 826 | 53.1 | 233 | 15.0 | 153 | 9.8 |
| Ash | 1 | 0.1 | 14 | 1.0 | 269 | 17.3 | 796 | 51.1 | 208 | 13.4 | 269 | 17.3 |
| TEY | 4 | 0.3 | 55 | 3.5 | 234 | 15.0 | 921 | 59.2 | 211 | 13.6 | 132 | 8.5 |
MIV most important variable, VIV very important variable, IV important variable, NV normal variable, LIV less important variable, UIV uninformative variable, N number of selected variables, P the proportion of selected variables in total variables
Fig. 8The performance of 30 dual-optimized PLS models and 6 partial-optimized PLS models. FS full spectra, CARS competitive adaptive reweighted sampling, SR selectivity ratio, VIP variable importance for projection, UVE uninformative variable elimination, MC-UVE UVE couple with the principle of Monte Carlo. Soluble sugar (a), cellulose (b), hemicellulose (c), lignin (d), ash (e), theoretical ethanol yield (f)