| Literature DB >> 24356218 |
Yong-Huan Yun1, Wei-Ting Wang1, Min-Li Tan1, Yi-Zeng Liang2, Hong-Dong Li1, Dong-Sheng Cao3, Hong-Mei Lu1, Qing-Song Xu4.
Abstract
Nowadays, with a high dimensionality of dataset, it faces a great challenge in the creation of effective methods which can select an optimal variables subset. In this study, a strategy that considers the possible interaction effect among variables through random combinations was proposed, called iteratively retaining informative variables (IRIV). Moreover, the variables are classified into four categories as strongly informative, weakly informative, uninformative and interfering variables. On this basis, IRIV retains both the strongly and weakly informative variables in every iterative round until no uninformative and interfering variables exist. Three datasets were employed to investigate the performance of IRIV coupled with partial least squares (PLS). The results show that IRIV is a good alternative for variable selection strategy when compared with three outstanding and frequently used variable selection methods such as genetic algorithm-PLS, Monte Carlo uninformative variable elimination by PLS (MC-UVE-PLS) and competitive adaptive reweighted sampling (CARS). The MATLAB source code of IRIV can be freely downloaded for academy research at the website: http://code.google.com/p/multivariate-calibration/downloads/list.Keywords: Informative variables; Iteratively retaining informative variables; Partial least squares; Random combination; Variable selection
Mesh:
Substances:
Year: 2013 PMID: 24356218 DOI: 10.1016/j.aca.2013.11.032
Source DB: PubMed Journal: Anal Chim Acta ISSN: 0003-2670 Impact factor: 6.558