Literature DB >> 21501713

Variable selection in near-infrared spectroscopy: benchmarking of feature selection methods on biodiesel data.

Roman M Balabin1, Sergey V Smirnov.   

Abstract

During the past several years, near-infrared (near-IR/NIR) spectroscopy has increasingly been adopted as an analytical tool in various fields from petroleum to biomedical sectors. The NIR spectrum (above 4000 cm(-1)) of a sample is typically measured by modern instruments at a few hundred of wavelengths. Recently, considerable effort has been directed towards developing procedures to identify variables (wavelengths) that contribute useful information. Variable selection (VS) or feature selection, also called frequency selection or wavelength selection, is a critical step in data analysis for vibrational spectroscopy (infrared, Raman, or NIRS). In this paper, we compare the performance of 16 different feature selection methods for the prediction of properties of biodiesel fuel, including density, viscosity, methanol content, and water concentration. The feature selection algorithms tested include stepwise multiple linear regression (MLR-step), interval partial least squares regression (iPLS), backward iPLS (BiPLS), forward iPLS (FiPLS), moving window partial least squares regression (MWPLS), (modified) changeable size moving window partial least squares (CSMWPLS/MCSMWPLSR), searching combination moving window partial least squares (SCMWPLS), successive projections algorithm (SPA), uninformative variable elimination (UVE, including UVE-SPA), simulated annealing (SA), back-propagation artificial neural networks (BP-ANN), Kohonen artificial neural network (K-ANN), and genetic algorithms (GAs, including GA-iPLS). Two linear techniques for calibration model building, namely multiple linear regression (MLR) and partial least squares regression/projection to latent structures (PLS/PLSR), are used for the evaluation of biofuel properties. A comparison with a non-linear calibration model, artificial neural networks (ANN-MLP), is also provided. Discussion of gasoline, ethanol-gasoline (bioethanol), and diesel fuel data is presented. The results of other spectroscopic techniques application, such as Raman, ultraviolet-visible (UV-vis), or nuclear magnetic resonance (NMR) spectroscopies, can be greatly improved by an appropriate feature selection choice.
Copyright © 2011 Elsevier B.V. All rights reserved.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21501713     DOI: 10.1016/j.aca.2011.03.006

Source DB:  PubMed          Journal:  Anal Chim Acta        ISSN: 0003-2670            Impact factor:   6.558


  16 in total

1.  Characterization and noninvasive diagnosis of bladder cancer with serum surface enhanced Raman spectroscopy and genetic algorithms.

Authors:  Shaoxin Li; Linfang Li; Qiuyao Zeng; Yanjiao Zhang; Zhouyi Guo; Zhiming Liu; Mei Jin; Chengkang Su; Lin Lin; Junfa Xu; Songhao Liu
Journal:  Sci Rep       Date:  2015-05-07       Impact factor: 4.379

2.  Soil type recognition as improved by genetic algorithm-based variable selection using near infrared spectroscopy and partial least squares discriminant analysis.

Authors:  Hongtu Xie; Jinsong Zhao; Qiubing Wang; Yueyu Sui; Jingkuan Wang; Xueming Yang; Xudong Zhang; Chao Liang
Journal:  Sci Rep       Date:  2015-06-18       Impact factor: 4.379

3.  Estimation of the age of human bloodstains under the simulated indoor and outdoor crime scene conditions by ATR-FTIR spectroscopy.

Authors:  Hancheng Lin; Yinming Zhang; Qi Wang; Bing Li; Ping Huang; Zhenyuan Wang
Journal:  Sci Rep       Date:  2017-10-16       Impact factor: 4.379

4.  Sensitive Wavelengths Selection in Identification of Ophiopogon japonicus Based on Near-Infrared Hyperspectral Imaging Technology.

Authors:  Zhengyan Xia; Chu Zhang; Haiyong Weng; Pengcheng Nie; Yong He
Journal:  Int J Anal Chem       Date:  2017-08-27       Impact factor: 1.885

5.  High-throughput analysis of chemical components and theoretical ethanol yield of dedicated bioenergy sorghum using dual-optimized partial least squares calibration models.

Authors:  Meng Li; Jun Wang; Fu Du; Boubacar Diallo; Guang Hui Xie
Journal:  Biotechnol Biofuels       Date:  2017-09-04       Impact factor: 6.040

6.  Detection of Sclerotinia Stem Rot on Oilseed Rape (Brassica napus L.) Leaves Using Hyperspectral Imaging.

Authors:  Wenwen Kong; Chu Zhang; Feng Cao; Fei Liu; Shaoming Luo; Yu Tang; Yong He
Journal:  Sensors (Basel)       Date:  2018-06-01       Impact factor: 3.576

7.  An Ensemble Successive Project Algorithm for Liquor Detection Using Near Infrared Sensor.

Authors:  Fangfang Qu; Dong Ren; Jihua Wang; Zhong Zhang; Na Lu; Lei Meng
Journal:  Sensors (Basel)       Date:  2016-01-11       Impact factor: 3.576

8.  Improving the Classification Accuracy for Near-Infrared Spectroscopy of Chinese Salvia miltiorrhiza Using Local Variable Selection.

Authors:  Lianqing Zhu; Haitao Chang; Qun Zhou; Zhongyu Wang
Journal:  J Anal Methods Chem       Date:  2018-01-29       Impact factor: 2.193

9.  Band width selection data from Near Infra-red Spectral (NIRS) quantitative modelling of energy storage components (protein, lipid, glycogen) for single and multi-bivalve species models.

Authors:  Jill K Bartlett; William A Maher; Matthew B J Purss
Journal:  Data Brief       Date:  2018-04-22

10.  Quantitative Analysis of Cadmium in Tobacco Roots Using Laser-Induced Breakdown Spectroscopy With Variable Index and Chemometrics.

Authors:  Fei Liu; Tingting Shen; Wenwen Kong; Jiyu Peng; Chi Zhang; Kunlin Song; Wei Wang; Chu Zhang; Yong He
Journal:  Front Plant Sci       Date:  2018-09-13       Impact factor: 5.753

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.