| Literature DB >> 29346328 |
Hong Men1, Songlin Fu2, Jialin Yang3, Meiqi Cheng4, Yan Shi5, Jingjing Liu6.
Abstract
Paraffin odor intensity is an important quality indicator when a paraffin inspection is performed. Currently, paraffin odor level assessment is mainly dependent on an artificial sensory evaluation. In this paper, we developed a paraffin odor analysis system to classify and grade four kinds of paraffin samples. The original feature set was optimized using Principal Component Analysis (PCA) and Partial Least Squares (PLS). Support Vector Machine (SVM), Random Forest (RF), and Extreme Learning Machine (ELM) were applied to three different feature data sets for classification and level assessment of paraffin. For classification, the model based on SVM, with an accuracy rate of 100%, was superior to that based on RF, with an accuracy rate of 98.33-100%, and ELM, with an accuracy rate of 98.01-100%. For level assessment, the R² related to the training set was above 0.97 and the R² related to the test set was above 0.87. Through comprehensive comparison, the generalization of the model based on ELM was superior to those based on SVM and RF. The scoring errors for the three models were 0.0016-0.3494, lower than the error of 0.5-1.0 measured by industry standard experts, meaning these methods have a higher prediction accuracy for scoring paraffin level.Entities:
Keywords: classify; grade; level; paraffin; paraffin odor analysis system
Year: 2018 PMID: 29346328 PMCID: PMC5795501 DOI: 10.3390/s18010285
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1A schematic diagram of the paraffin odor analysis system.
Figure 2Paraffin odor analysis system.
Figure 3Sensor responses of four different paraffin samples: (a) Paraffin score 0.6; (b) Paraffin score 1.0; (c) Paraffin score 1.2; (d) Paraffin score 1.5.
Figure 4The PCA processing result.
The feature screening process.
| Time | Selected Variable | Cross-Validation Discrimination Function |
|---|---|---|
| 1 | t1 | 1 |
| 2 | t2 | 0.7896 |
| 3 | t3 | 0.2248 |
| 4 | t4 | 0.1760 |
| 5 | t5 | −0.0638 |
Figure 5The Grid Search for the best parameter for constructing the LIBSVM model: (a) Based on the PCA-optimized feature set; (b) Based on the PLS-optimized feature set; (c) Based on the original feature set.
Comparison of classification results of #PCA, #PLS and #Complete combined with SVM model.
| Feature Set | Best Parameter | Accuracy Rate for Training Set (%) | Accuracy Rate for 3-Fold Cross-Validation (%) | Accuracy Rate for Test Set (%) | |
|---|---|---|---|---|---|
| Penalty Factor | Kernel Parameter | ||||
| #PCA | 1 | 1.4142 | 100 | 100 | 100 |
| #PLS | 0.00097656 | 0.00087656 | 100 | 100 | 100 |
| #Complete | 1.4142 | 0.35355 | 100 | 100 | 100 |
Figure 6The influence of the number of decision trees on RF performance: (a) Based on the PCA-optimized feature set; (b) Based on the PLS-optimized feature set; (c) Based on the original feature set.
Figure 7The influence of the number of hidden layer neurons on ELM performance: (a) Based on the PCA-optimized feature set; (b) Based on the PLS-optimized feature set; (c) Based on the original feature set.
Comparison of the #PCA, #PLS, and #Complete parameters combined with the SVM model.
| Feature Set | The Best Parameter | Training Set | Test Set | |||
|---|---|---|---|---|---|---|
| R2 | RMSE | R2 | RMSE | |||
| #PCA | 32 | 0.1767 | 0.9829 | 00481 | 0.9502 | 0.1376 |
| #PLS | 5.6596 | 0.125 | 0.9894 | 0.0491 | 0.9639 | 0.1968 |
| #Complete | 2.8284 | 0.0883 | 0.9974 | 0.0289 | 0.8913 | 0.1317 |
Prediction error of different paraffin odor levels based on SVM.
| Feature Set | Maximum Error | Minimum Error |
|---|---|---|
| #PCA | 0.1448 | 0.0041 |
| #PLS | 0.2163 | 0.0044 |
| #Complete | 0.1690 | 0.0016 |
Comparison of the #PCA, #PLS, and #Complete parameters combined with the RF model.
| Feature Set | Training Set | Test Set | ||
|---|---|---|---|---|
| R2 | RMSE | R2 | RMSE | |
| #PCA | 0.9767 | 0.1951 | 0.8717 | 0.3707 |
| #PLS | 0.9869 | 0.1197 | 0.9645 | 0.2022 |
| #Complete | 0.9865 | 0.1089 | 0.9896 | 0.1537 |
Prediction error of different paraffin odor levels based on RF.
| Feature Set | Maximum Error | Minimum Error |
|---|---|---|
| #PCA | 0.3494 | 0.0121 |
| #PLS | 0.1793 | 0.0024 |
| #Complete | 0.1266 | 0.0045 |
Comparison of the #PCA, #PLS, and #Complete parameters with the ELM model.
| Feature Set | Training Set | TEST SET | ||
|---|---|---|---|---|
| R2 | RMSE | R2 | RMSE | |
| #PCA | 0.9730 | 0.0727 | 0.9438 | 0.1437 |
| #PLS | 0.9972 | 0.0208 | 0.9675 | 0.1793 |
| #Complete | 0.9878 | 0.0472 | 0.9341 | 0.1741 |
Figure 8Score prediction of the paraffin odor based on the ELM model: (a) Based on the PCA-optimized feature set; (b) Based on the PLS-optimized feature set; (c) Based on the original feature set.
Prediction error of different paraffin odor levels based on RF.
| Feature Set | Maximum Error | Minimum Error |
|---|---|---|
| #PCA | 0.1487 | 0.0016 |
| #PLS | 0.1239 | 0.0061 |
| #Complete | 0.1804 | 0.0033 |