Literature DB >> 24262536

Simultaneous data pre-processing and SVM classification model selection based on a parallel genetic algorithm applied to spectroscopic data of olive oils.

Olivier Devos1, Gerard Downey, Ludovic Duponchel.   

Abstract

Classification is an important task in chemometrics. For several years now, support vector machines (SVMs) have proven to be powerful for infrared spectral data classification. However such methods require optimisation of parameters in order to control the risk of overfitting and the complexity of the boundary. Furthermore, it is established that the prediction ability of classification models can be improved using pre-processing in order to remove unwanted variance in the spectra. In this paper we propose a new methodology based on genetic algorithm (GA) for the simultaneous optimisation of SVM parameters and pre-processing (GENOPT-SVM). The method has been tested for the discrimination of the geographical origin of Italian olive oil (Ligurian and non-Ligurian) on the basis of near infrared (NIR) or mid infrared (FTIR) spectra. Different classification models (PLS-DA, SVM with mean centre data, GENOPT-SVM) have been tested and statistically compared using McNemar's statistical test. For the two datasets, SVM with optimised pre-processing give models with higher accuracy than the one obtained with PLS-DA on pre-processed data. In the case of the NIR dataset, most of this accuracy improvement (86.3% compared with 82.8% for PLS-DA) occurred using only a single pre-processing step. For the FTIR dataset, three optimised pre-processing steps are required to obtain SVM model with significant accuracy improvement (82.2%) compared to the one obtained with PLS-DA (78.6%). Furthermore, this study demonstrates that even SVM models have to be developed on the basis of well-corrected spectral data in order to obtain higher classification rates.
Copyright © 2013 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Classification; Genetic algorithm; Infrared spectroscopy; Parameter optimisation; Spectral pre-processing; Support vector machines

Mesh:

Substances:

Year:  2013        PMID: 24262536     DOI: 10.1016/j.foodchem.2013.10.020

Source DB:  PubMed          Journal:  Food Chem        ISSN: 0308-8146            Impact factor:   7.514


  10 in total

1.  Optimization of Parameter Selection for Partial Least Squares Model Development.

Authors:  Na Zhao; Zhi-sheng Wu; Qiao Zhang; Xin-yuan Shi; Qun Ma; Yan-jiang Qiao
Journal:  Sci Rep       Date:  2015-07-13       Impact factor: 4.379

2.  Soil type recognition as improved by genetic algorithm-based variable selection using near infrared spectroscopy and partial least squares discriminant analysis.

Authors:  Hongtu Xie; Jinsong Zhao; Qiubing Wang; Yueyu Sui; Jingkuan Wang; Xueming Yang; Xudong Zhang; Chao Liang
Journal:  Sci Rep       Date:  2015-06-18       Impact factor: 4.379

3.  A Novel Extreme Learning Machine Classification Model for e-Nose Application Based on the Multiple Kernel Approach.

Authors:  Yulin Jian; Daoyu Huang; Jia Yan; Kun Lu; Ying Huang; Tailai Wen; Tanyue Zeng; Shijie Zhong; Qilong Xie
Journal:  Sensors (Basel)       Date:  2017-06-19       Impact factor: 3.576

Review 4.  Chemometrics Methods for Specificity, Authenticity and Traceability Analysis of Olive Oils: Principles, Classifications and Applications.

Authors:  Habib Messai; Muhammad Farman; Abir Sarraj-Laabidi; Asma Hammami-Semmar; Nabil Semmar
Journal:  Foods       Date:  2016-11-17

Review 5.  Comparison of Chemometric Problems in Food Analysis Using Non-Linear Methods.

Authors:  Werickson Fortunato de Carvalho Rocha; Charles Bezerra do Prado; Niksa Blonder
Journal:  Molecules       Date:  2020-07-02       Impact factor: 4.411

6.  Attenuated Total Reflection-Fourier Transform Infrared Spectroscopy (ATR-FTIR) Combined with Chemometrics Methods for the Classification of Lingzhi Species.

Authors:  Yuan-Yuan Wang; Jie-Qing Li; Hong-Gao Liu; Yuan-Zhong Wang
Journal:  Molecules       Date:  2019-06-13       Impact factor: 4.411

7.  Rapid Identification of Rainbow Trout Adulteration in Atlantic Salmon by Raman Spectroscopy Combined with Machine Learning.

Authors:  Zeling Chen; Ting Wu; Cheng Xiang; Xiaoyan Xu; Xingguo Tian
Journal:  Molecules       Date:  2019-08-06       Impact factor: 4.411

8.  Rapidly detecting fennel origin of the near-infrared spectroscopy based on extreme learning machine.

Authors:  Enguang Zuo; Lei Sun; Junyi Yan; Cheng Chen; Chen Chen; Xiaoyi Lv
Journal:  Sci Rep       Date:  2022-08-10       Impact factor: 4.996

9.  Determination of Hemicellulose, Cellulose and Lignin in Moso Bamboo by Near Infrared Spectroscopy.

Authors:  Xiaoli Li; Chanjun Sun; Binxiong Zhou; Yong He
Journal:  Sci Rep       Date:  2015-11-25       Impact factor: 4.379

10.  Non-Destructive and Rapid Variety Discrimination and Visualization of Single Grape Seed Using Near-Infrared Hyperspectral Imaging Technique and Multivariate Analysis.

Authors:  Yiying Zhao; Chu Zhang; Susu Zhu; Pan Gao; Lei Feng; Yong He
Journal:  Molecules       Date:  2018-06-04       Impact factor: 4.411

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.