Literature DB >> 28367620

Consensus Outlier Detection Using Sum of Ranking Differences of Common and New Outlier Measures Without Tuning Parameter Selections.

Brett Brownfield1, John H Kalivas1.   

Abstract

Sample outlier detection is imperative before calculating a multivariate calibration model. Outliers, especially in high-dimensional space, can be difficult to detect. The outlier measures Hotelling's t-squared, Q-residuals, and Studentized residuals are standard in analytical chemistry with spectroscopic data. However, these and other merits are tuning parameter dependent and sensitive to the outlier themselves, i.e., the measures are susceptible to swamping and masking. Additionally, different samples are also often flagged as outliers depending on the outlier measure used. Sum of ranking differences (SRD) is a new generic fusion tool that can simultaneously evaluate multiple outlier measures across windows of tuning parameter values thereby simplifying outlier detection and providing improved detection. Presented in this paper is SRD to detect multiple outliers despite the effects of masking and swamping. Both spectral (x-outlier) and analyte (y-outlier) outliers can be detected separately or in tandem with SRD using respective merits. Unique to SRD are fusion verification processes to confirm samples flagged as outliers. The SRD process also allows for sample masking checks. Presented, and used by SRD, are several new outlier detection measures. These measures include atypical uses of Procrustes analysis and extended inverted signal correction (EISC). The methodologies are demonstrated on two near-infrared (NIR) data sets.

Entities:  

Year:  2017        PMID: 28367620     DOI: 10.1021/acs.analchem.7b00637

Source DB:  PubMed          Journal:  Anal Chem        ISSN: 0003-2700            Impact factor:   6.986


  2 in total

Review 1.  Chemometric analysis in Raman spectroscopy from experimental design to machine learning-based modeling.

Authors:  Shuxia Guo; Jürgen Popp; Thomas Bocklitz
Journal:  Nat Protoc       Date:  2021-11-05       Impact factor: 13.491

2.  Apportionment and districting by Sum of Ranking Differences.

Authors:  Balázs R Sziklai; Károly Héberger
Journal:  PLoS One       Date:  2020-03-23       Impact factor: 3.240

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.