Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Chemoinformatic Classification Methods and their Applicability Domain.

Literature DB >> 27492083

Chemoinformatic Classification Methods and their Applicability Domain.

Miriam Mathea¹, Waldemar Klingspohn¹, Knut Baumann².

Abstract

Classification rules are often used in chemoinformatics to predict categorical properties of drug candidates related to bioactivity from explanatory variables, which encode the respective molecular structures (i.e. molecular descriptors). To avoid predictions with an unduly large error probability, the domain the classifier is applied to should be restricted to the domain covered by the training set objects. This latter domain is commonly referred to as applicability domain in chemoinformatics. Conceptually, the applicability domain defines the region in space where the "normal" objects are located. Defining the border of the applicability domain may then be viewed as detecting anomalous or novel objects or as detecting outliers. Currently two different types of measures are in use. The first one defines the applicability domain solely in terms of the molecular descriptor space, which is referred to as novelty detection. The second type defines the applicability domain in terms of the expected reliability of the predictions which is referred to as confidence estimation. Both types are systematically differentiated here and the most popular measures are reviewed. It will be shown that all common chemoinformatic classifiers have built-in confidence scores. Since confidence estimation uses information of the class labels for computing the confidence scores, it is expected to be more efficient in reducing the error rate than novelty detection, which solely uses the information of the explanatory variables.

© 2016 The Authors. Published by Wiley-VCH Verlag GmbH & Co. KGaA. This is an open access article under the terms of the Creative Commons Attribution Non-Commercial NoDerivs License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non-commercial and no modifications or adaptations are made.

Entities: Disease

Keywords: Applicability Domain; Confidence Estimation; Novelty Detection; Prediction Error; Validation

Mesh：

Substances：

Year: 2016 PMID： 27492083 DOI： 10.1002/minf.201501019

Source DB: PubMed Journal: Mol Inform ISSN： 1868-1743 Impact factor: 3.353

Keyword Cloud
Cited

22 in total

1. Validation strategies for target prediction methods.

Authors: Neann Mathai; Ya Chen; Johannes Kirchmair
Journal: Brief Bioinform Date: 2020-05-21 Impact factor: 11.622

2. Lipophilicity prediction of peptides and peptide derivatives by consensus machine learning.

Authors: Jens-Alexander Fuchs; Francesca Grisoni; Michael Kossenjans; Jan A Hiss; Gisbert Schneider
Journal: Medchemcomm Date: 2018-08-22 Impact factor: 3.597

3. Novel Development of Predictive Feature Fingerprints to Identify Chemistry-Based Features for the Effective Drug Design of SARS-CoV-2 Target Antagonists and Inhibitors Using Machine Learning.

Authors: Kelvin Cooper; Christopher Baddeley; Bernie French; Katherine Gibson; James Golden; Thiam Lee; Sadrach Pierre; Brent Weiss; Jason Yang
Journal: ACS Omega Date: 2021-02-05

Review 4. Automating drug discovery.

Authors: Gisbert Schneider
Journal: Nat Rev Drug Discov Date: 2017-12-15 Impact factor: 84.694

5. Assessment of tautomer distribution using the condensed reaction graph approach.

Authors: T R Gimadiev; T I Madzhidov; R I Nugmanov; I I Baskin; I S Antipin; A Varnek
Journal: J Comput Aided Mol Des Date: 2018-01-29 Impact factor: 3.686

6. Chemical toxicity prediction for major classes of industrial chemicals: Is it possible to develop universal models covering cosmetics, drugs, and pesticides?

Authors: Vinicius M Alves; Eugene N Muratov; Alexey Zakharov; Nail N Muratov; Carolina H Andrade; Alexander Tropsha
Journal: Food Chem Toxicol Date: 2017-04-12 Impact factor: 6.023

Review 7. In silico toxicology: From structure-activity relationships towards deep learning and adverse outcome pathways.

Authors: Jennifer Hemmerich; Gerhard F Ecker
Journal: Wiley Interdiscip Rev Comput Mol Sci Date: 2020-03-31

8. Efficiency of different measures for defining the applicability domain of classification models.

Authors: Waldemar Klingspohn; Miriam Mathea; Antonius Ter Laak; Nikolaus Heinrich; Knut Baumann
Journal: J Cheminform Date: 2017-08-03 Impact factor: 5.514

9. QSAR models of human data can enrich or replace LLNA testing for human skin sensitization.

Authors: Vinicius M Alves; Stephen J Capuzzi; Eugene Muratov; Rodolpho C Braga; Thomas Thornton; Denis Fourches; Judy Strickland; Nicole Kleinstreuer; Carolina H Andrade; Alexander Tropsha
Journal: Green Chem Date: 2016-10-06 Impact factor: 10.182

Review 10. Schistosomiasis Drug Discovery in the Era of Automation and Artificial Intelligence.

Authors: José T Moreira-Filho; Arthur C Silva; Rafael F Dantas; Barbara F Gomes; Lauro R Souza Neto; Jose Brandao-Neto; Raymond J Owens; Nicholas Furnham; Bruno J Neves; Floriano P Silva-Junior; Carolina H Andrade
Journal: Front Immunol Date: 2021-05-31 Impact factor: 7.561