| Literature DB >> 19547705 |
Xiao-Ping Yu1, Lu Xu, Ru-Qin Yu.
Abstract
In our recent work, Monte Carlo Cross Validation Stacked Regression (MCCVSR) is proposed to achieve automatic optimization of spectral interval selection in multivariate calibration. Though MCCVSR performs well in normal conditions, it is still necessary to improve it for more general applications. According to the well-known principle of "garbage in, garbage out (GIGO)", as a precise ensemble method, MCCVSR might be influenced by outlying and very bad submodels. In this paper, a statistical test is designed to exclude the ruinous submodels from the ensemble learning process, therefore, the combination process becomes more reliable. Though completely automated, the proposed method is adjustable according to the nature of the data analyzed, including the size of training samples, resolution of spectra and quantitative potentials of the submodels. The effectiveness of the submodel refining is demonstrated by the investigation of a real standard data.Entities:
Year: 2009 PMID: 19547705 PMCID: PMC2696029 DOI: 10.1155/2009/291820
Source DB: PubMed Journal: J Autom Methods Manag Chem ISSN: 1463-9246
Figure 1Some of the original spectra in the temperature data set.
The results of PLS model with total spectral range for the temperature data.
| Component | LVna | RMSEMCCV | RMSEC | RMSEP |
|---|---|---|---|---|
| Ethanol | 12 | 0.0098 | 0.0287 | 0.0257 |
| Water | 13 | 0.0038 | 0.0094 | 0.0136 |
| Isopropanol | 12 | 0.0082 | 0.0225 | 0.0212 |
aThe number of PLS latent variables.
Figure 2(a) Model complexity and (b) RMSEMCCV values of interval models for prediction of ethanol.
Figure 3The combination coefficients of (a) MCCVSR and (b) MCCVSR with submodel refining for predicting ethanol.
The calibration results of the three components obtained by combination models.
| Component | Nma | RMSEMCCV | RMSEC | RMSEP | ||||
|---|---|---|---|---|---|---|---|---|
| 1b | 2c | 1 | 2 | 1 | 2 | 1 | 2 | |
| Ethanol | 97 | 67 | 0.0137 | 0.0138 | 0.0103 | 0.0104 | 0.0183 | 0.0185 |
| Water | 97 | 81 | 0.0094 | 0.0094 | 0.0077 | 0.0078 | 0.0121 | 0.0120 |
| Isopropanol | 97 | 72 | 0.0136 | 0.0137 | 0.0109 | 0.0109 | 0.0170 | 0.0164 |
aThe number of submodels for combination.
bResults obtained by MCCVSR.
cResults obtained by MCCVSR with submodel refining.