Literature DB >> 27492083

Chemoinformatic Classification Methods and their Applicability Domain.

Miriam Mathea1, Waldemar Klingspohn1, Knut Baumann2.   

Abstract

Classification rules are often used in chemoinformatics to predict categorical properties of drug candidates related to bioactivity from explanatory variables, which encode the respective molecular structures (i.e. molecular descriptors). To avoid predictions with an unduly large error probability, the domain the classifier is applied to should be restricted to the domain covered by the training set objects. This latter domain is commonly referred to as applicability domain in chemoinformatics. Conceptually, the applicability domain defines the region in space where the "normal" objects are located. Defining the border of the applicability domain may then be viewed as detecting anomalous or novel objects or as detecting outliers. Currently two different types of measures are in use. The first one defines the applicability domain solely in terms of the molecular descriptor space, which is referred to as novelty detection. The second type defines the applicability domain in terms of the expected reliability of the predictions which is referred to as confidence estimation. Both types are systematically differentiated here and the most popular measures are reviewed. It will be shown that all common chemoinformatic classifiers have built-in confidence scores. Since confidence estimation uses information of the class labels for computing the confidence scores, it is expected to be more efficient in reducing the error rate than novelty detection, which solely uses the information of the explanatory variables.
© 2016 The Authors. Published by Wiley-VCH Verlag GmbH & Co. KGaA. This is an open access article under the terms of the Creative Commons Attribution Non-Commercial NoDerivs License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non-commercial and no modifications or adaptations are made.

Entities:  

Keywords:  Applicability Domain; Confidence Estimation; Novelty Detection; Prediction Error; Validation

Mesh:

Substances:

Year:  2016        PMID: 27492083     DOI: 10.1002/minf.201501019

Source DB:  PubMed          Journal:  Mol Inform        ISSN: 1868-1743            Impact factor:   3.353


  22 in total

1.  Validation strategies for target prediction methods.

Authors:  Neann Mathai; Ya Chen; Johannes Kirchmair
Journal:  Brief Bioinform       Date:  2020-05-21       Impact factor: 11.622

2.  Lipophilicity prediction of peptides and peptide derivatives by consensus machine learning.

Authors:  Jens-Alexander Fuchs; Francesca Grisoni; Michael Kossenjans; Jan A Hiss; Gisbert Schneider
Journal:  Medchemcomm       Date:  2018-08-22       Impact factor: 3.597

3.  Novel Development of Predictive Feature Fingerprints to Identify Chemistry-Based Features for the Effective Drug Design of SARS-CoV-2 Target Antagonists and Inhibitors Using Machine Learning.

Authors:  Kelvin Cooper; Christopher Baddeley; Bernie French; Katherine Gibson; James Golden; Thiam Lee; Sadrach Pierre; Brent Weiss; Jason Yang
Journal:  ACS Omega       Date:  2021-02-05

Review 4.  Automating drug discovery.

Authors:  Gisbert Schneider
Journal:  Nat Rev Drug Discov       Date:  2017-12-15       Impact factor: 84.694

5.  Assessment of tautomer distribution using the condensed reaction graph approach.

Authors:  T R Gimadiev; T I Madzhidov; R I Nugmanov; I I Baskin; I S Antipin; A Varnek
Journal:  J Comput Aided Mol Des       Date:  2018-01-29       Impact factor: 3.686

6.  Chemical toxicity prediction for major classes of industrial chemicals: Is it possible to develop universal models covering cosmetics, drugs, and pesticides?

Authors:  Vinicius M Alves; Eugene N Muratov; Alexey Zakharov; Nail N Muratov; Carolina H Andrade; Alexander Tropsha
Journal:  Food Chem Toxicol       Date:  2017-04-12       Impact factor: 6.023

Review 7.  In silico toxicology: From structure-activity relationships towards deep learning and adverse outcome pathways.

Authors:  Jennifer Hemmerich; Gerhard F Ecker
Journal:  Wiley Interdiscip Rev Comput Mol Sci       Date:  2020-03-31

8.  Efficiency of different measures for defining the applicability domain of classification models.

Authors:  Waldemar Klingspohn; Miriam Mathea; Antonius Ter Laak; Nikolaus Heinrich; Knut Baumann
Journal:  J Cheminform       Date:  2017-08-03       Impact factor: 5.514

9.  QSAR models of human data can enrich or replace LLNA testing for human skin sensitization.

Authors:  Vinicius M Alves; Stephen J Capuzzi; Eugene Muratov; Rodolpho C Braga; Thomas Thornton; Denis Fourches; Judy Strickland; Nicole Kleinstreuer; Carolina H Andrade; Alexander Tropsha
Journal:  Green Chem       Date:  2016-10-06       Impact factor: 10.182

Review 10.  Schistosomiasis Drug Discovery in the Era of Automation and Artificial Intelligence.

Authors:  José T Moreira-Filho; Arthur C Silva; Rafael F Dantas; Barbara F Gomes; Lauro R Souza Neto; Jose Brandao-Neto; Raymond J Owens; Nicholas Furnham; Bruno J Neves; Floriano P Silva-Junior; Carolina H Andrade
Journal:  Front Immunol       Date:  2021-05-31       Impact factor: 7.561

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.