Literature DB >> 30149700

Study of Data Set Modelability: Modelability, Rivality, and Weighted Modelability Indexes.

Irene Luque Ruiz1, Miguel Ángel Gómez-Nieto1.   

Abstract

The knowledge of the capacity of a data set to be modeled in the first stages of the building of quantitative structure-activity relationship (QSAR) prediction models is an important issue because it might reduce the effort and time necessary to select or reject data sets and in refining the data set's composition. The modelability index (MODI) is based on the counting of the first nearest neighbor belonging to the molecules of the data set and is a standardized measurement assumed in the QSAR community. In this paper, we revisit the calculation of the modelability index, proposing a more formal formulation that extends the calculation to the first nearest neighbors that belong to each existing class in the data set. In addition, this new formulation allows the calculation of the rivality index, as a measurement of the presence of correctly classifiable molecules and activity cliffs. By weighting the rivality index considering the cardinality of the neighborhood of each molecule of the data set, the calculated weighted modelability index is highly correlated with the correct classification rate (QSAR_CCR) obtained in the building of QSAR models using different classification algorithms. The results obtained with the weighted modelability index show correlations of r2 higher than 0.9, slopes close to 1, and bias close to zero for different algorithms.

Mesh:

Year:  2018        PMID: 30149700     DOI: 10.1021/acs.jcim.8b00188

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  2 in total

1.  Study of the Applicability Domain of the QSAR Classification Models by Means of the Rivality and Modelability Indexes.

Authors:  Irene Luque Ruiz; Miguel Ángel Gómez-Nieto
Journal:  Molecules       Date:  2018-10-24       Impact factor: 4.411

2.  Predictive Capability of QSAR Models Based on the CompTox Zebrafish Embryo Assays: An Imbalanced Classification Problem.

Authors:  Mario Lovrić; Olga Malev; Göran Klobučar; Roman Kern; Jay J Liu; Bono Lučić
Journal:  Molecules       Date:  2021-03-15       Impact factor: 4.411

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.