Literature DB >> 19530115

A new strategy of outlier detection for QSAR/QSPR.

Dong-Sheng Cao1, Yi-Zeng Liang, Qing-Song Xu, Hong-Dong Li, Xian Chen.   

Abstract

The crucial step of building a high performance QSAR/QSPR model is the detection of outliers in the model. Detecting outliers in a multivariate point cloud is not trivial, especially when several outliers coexist in the model. The classical identification methods do not always identify them, because they are based on the sample mean and covariance matrix influenced by the outliers. Moreover, existing methods only lay stress on some type of outliers but not all the outliers. To avoid these problems and detect all kinds of outliers simultaneously, we provide a new strategy based on Monte-Carlo cross-validation, which was termed as the MC method. The MC method inherently provides a feasible way to detect different kinds of outliers by establishment of many cross-predictive models. With the help of the distribution of predictive residuals such obtained, it seems to be able to reduce the risk caused by the masking effect. In addition, a new display is proposed, in which the absolute values of mean value of predictive residuals are plotted versus standard deviations of predictive residuals. The plot divides the data into normal samples, y direction outliers and X direction outliers. Several examples are used to demonstrate the detection ability of MC method through the comparison of different diagnostic methods. Copyright 2009 Wiley Periodicals, Inc.

Entities:  

Mesh:

Year:  2010        PMID: 19530115     DOI: 10.1002/jcc.21351

Source DB:  PubMed          Journal:  J Comput Chem        ISSN: 0192-8651            Impact factor:   3.376


  8 in total

1.  Toward better QSAR/QSPR modeling: simultaneous outlier detection and variable selection using distribution of model features.

Authors:  Dongsheng Cao; Yizeng Liang; Qingsong Xu; Yifeng Yun; Hongdong Li
Journal:  J Comput Aided Mol Des       Date:  2010-11-13       Impact factor: 3.686

2.  From By-Products to Fertilizer: Chemical Characterization Using UPLC-QToF-MS via Suspect and Non-Target Screening Strategies.

Authors:  Anthi Panara; Evagelos Gikas; Nikolaos S Thomaidis
Journal:  Molecules       Date:  2022-05-29       Impact factor: 4.927

3.  Application of metabolomics in traditional chinese medicine differentiation of deficiency and excess syndromes in patients with diabetes mellitus.

Authors:  Tao Wu; Ming Yang; Hua-Feng Wei; Song-Hua He; Shun-Chun Wang; Guang Ji
Journal:  Evid Based Complement Alternat Med       Date:  2012-06-13       Impact factor: 2.629

4.  QSPR model for Caco-2 cell permeability prediction using a combination of HQPSO and dual-RBF neural network.

Authors:  Yukun Wang; Xuebo Chen
Journal:  RSC Adv       Date:  2020-11-26       Impact factor: 4.036

5.  Ensemble machine learning to evaluate the in vivo acute oral toxicity and in vitro human acetylcholinesterase inhibitory activity of organophosphates.

Authors:  Liangliang Wang; Junjie Ding; Peichang Shi; Li Fu; Li Pan; Jiahao Tian; Dongsheng Cao; Hui Jiang; Xiaoqin Ding
Journal:  Arch Toxicol       Date:  2021-05-01       Impact factor: 5.153

6.  QSBR study of bitter taste of peptides: application of GA-PLS in combination with MLR, SVM, and ANN approaches.

Authors:  Somaieh Soltani; Hossein Haghaei; Ali Shayanfar; Javad Vallipour; Karim Asadpour Zeynali; Abolghasem Jouyban
Journal:  Biomed Res Int       Date:  2013-11-25       Impact factor: 3.411

7.  GC-MS Fingerprinting Combined with Chemometric Methods Reveals Key Bioactive Components in Acori Tatarinowii Rhizoma.

Authors:  Wenbin Liu; Bingyang Zhang; Zhongquan Xin; Dabing Ren; Lunzhao Yi
Journal:  Int J Mol Sci       Date:  2017-07-03       Impact factor: 5.923

8.  Metabolomic Profiles Reveal Potential Factors that Correlate with Lactation Performance in Sow Milk.

Authors:  Chengquan Tan; Zhenya Zhai; Xiaojun Ni; Hao Wang; Yongcheng Ji; Tianyue Tang; Wenkai Ren; Hongrong Long; Baichuan Deng; Jinping Deng; Yulong Yin
Journal:  Sci Rep       Date:  2018-07-16       Impact factor: 4.379

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.