Literature DB >> 20830430

Detecting influential observations by cluster analysis and Monte Carlo cross-validation.

Xihui Bian1, Wensheng Cai, Xueguang Shao, Da Chen, Edward R Grant.   

Abstract

The detection of influential observations is an essential step for building high performance models and has been recognized as an important and challenging task in many industrial and laboratorial applications. A new approach for detecting influential observations is developed based on their effect on partial least squares (PLS) modeling. In this method, we build a large number of PLS models by using Monte Carlo cross-validation (MCCV), and then perform principal component analysis (PCA) on the regression coefficients of these models. Because a model with influential observations is different from the one without influential observation, the series of PLS models cluster into different groups in principal component (PC) spaces, based on the different number of influential observations they contain. The influential observations can be therefore recognized according to the frequency number of each sample in each group. By three examples quantitatively modeling near-infrared (NIR) and Raman spectra, it was shown that the method can detect the influential observations intuitively and veraciously.

Entities:  

Year:  2010        PMID: 20830430     DOI: 10.1039/c0an00345j

Source DB:  PubMed          Journal:  Analyst        ISSN: 0003-2654            Impact factor:   4.616


  1 in total

1.  New utility for an old tool: can a simple gait speed test predict ambulatory surgical discharge outcomes?

Authors:  Charles A Odonkor; Robert B Schonberger; Feng Dai; Kirk H Shelley; David G Silverman; Paul G Barash
Journal:  Am J Phys Med Rehabil       Date:  2013-10       Impact factor: 2.159

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.