Literature DB >> 25682424

Using variable combination population analysis for variable selection in multivariate calibration.

Yong-Huan Yun1, Wei-Ting Wang1, Bai-Chuan Deng2, Guang-Bi Lai3, Xin-bo Liu1, Da-Bing Ren1, Yi-Zeng Liang4, Wei Fan5, Qing-Song Xu6.   

Abstract

Variable (wavelength or feature) selection techniques have become a critical step for the analysis of datasets with high number of variables and relatively few samples. In this study, a novel variable selection strategy, variable combination population analysis (VCPA), was proposed. This strategy consists of two crucial procedures. First, the exponentially decreasing function (EDF), which is the simple and effective principle of 'survival of the fittest' from Darwin's natural evolution theory, is employed to determine the number of variables to keep and continuously shrink the variable space. Second, in each EDF run, binary matrix sampling (BMS) strategy that gives each variable the same chance to be selected and generates different variable combinations, is used to produce a population of subsets to construct a population of sub-models. Then, model population analysis (MPA) is employed to find the variable subsets with the lower root mean squares error of cross validation (RMSECV). The frequency of each variable appearing in the best 10% sub-models is computed. The higher the frequency is, the more important the variable is. The performance of the proposed procedure was investigated using three real NIR datasets. The results indicate that VCPA is a good variable selection strategy when compared with four high performing variable selection methods: genetic algorithm-partial least squares (GA-PLS), Monte Carlo uninformative variable elimination by PLS (MC-UVE-PLS), competitive adaptive reweighted sampling (CARS) and iteratively retains informative variables (IRIV). The MATLAB source code of VCPA is available for academic research on the website: http://www.mathworks.com/matlabcentral/fileexchange/authors/498750.
Copyright © 2015 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Exponentially decreasing function; Model population analysis; Multivariate calibration; Partial least squares; Variable combination; Variable selection

Mesh:

Year:  2014        PMID: 25682424     DOI: 10.1016/j.aca.2014.12.048

Source DB:  PubMed          Journal:  Anal Chim Acta        ISSN: 0003-2670            Impact factor:   6.558


  12 in total

1.  Determination of Adulteration Content in Extra Virgin Olive Oil Using FT-NIR Spectroscopy Combined with the BOSS-PLS Algorithm.

Authors:  Hui Jiang; Quansheng Chen
Journal:  Molecules       Date:  2019-06-06       Impact factor: 4.411

2.  An efficient variable selection method based on random frog for the multivariate calibration of NIR spectra.

Authors:  Jingjing Sun; Wude Yang; Meichen Feng; Qifang Liu; Muhammad Saleem Kubar
Journal:  RSC Adv       Date:  2020-04-23       Impact factor: 4.036

3.  How to Resolve the Maximum Valuable Information in Complex NIR Signal: A Practicable Method Based on Wavelet Transform.

Authors:  Jing Chen; Xiaoquan Lu
Journal:  Front Chem       Date:  2022-04-07       Impact factor: 5.545

4.  Mutation status coupled with RNA-sequencing data can efficiently identify important non-significantly mutated genes serving as diagnostic biomarkers of endometrial cancer.

Authors:  Keqin Liu; Li He; Zhichao Liu; Junmei Xu; Yuan Liu; Qifan Kuang; Zhining Wen; Menglong Li
Journal:  BMC Bioinformatics       Date:  2017-12-28       Impact factor: 3.169

5.  Detection of Soil Nitrogen Using Near Infrared Sensors Based on Soil Pretreatment and Algorithms.

Authors:  Pengcheng Nie; Tao Dong; Yong He; Fangfang Qu
Journal:  Sensors (Basel)       Date:  2017-05-11       Impact factor: 3.576

6.  Fine root lignin content is well predictable with near-infrared spectroscopy.

Authors:  Oliver Elle; Ronny Richter; Michael Vohland; Alexandra Weigelt
Journal:  Sci Rep       Date:  2019-04-23       Impact factor: 4.379

7.  An Ensemble Successive Project Algorithm for Liquor Detection Using Near Infrared Sensor.

Authors:  Fangfang Qu; Dong Ren; Jihua Wang; Zhong Zhang; Na Lu; Lei Meng
Journal:  Sensors (Basel)       Date:  2016-01-11       Impact factor: 3.576

8.  Analysis of near infrared spectra for age-grading of wild populations of Anopheles gambiae.

Authors:  Benjamin J Krajacich; Jacob I Meyers; Haoues Alout; Roch K Dabiré; Floyd E Dowell; Brian D Foy
Journal:  Parasit Vectors       Date:  2017-11-07       Impact factor: 3.876

9.  Research on the Effects of Drying Temperature on Nitrogen Detection of Different Soil Types by Near Infrared Sensors.

Authors:  Pengcheng Nie; Tao Dong; Yong He; Shupei Xiao
Journal:  Sensors (Basel)       Date:  2018-01-29       Impact factor: 3.576

10.  Towards improvement in prediction of iodine value in edible oil system based on chemometric analysis of portable vibrational spectroscopic data.

Authors:  Hong Yan; Jixiong Zhang; Jingxian Gao; Yangming Huang; Yanmei Xiong; Shungeng Min
Journal:  Sci Rep       Date:  2018-10-03       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.