Literature DB >> 22900941

Note on naive Bayes based on binary descriptors in cheminformatics.

Joe A Townsend1, Robert C Glen, Hamse Y Mussa.   

Abstract

A plethora of articles on naive Bayes classifiers, where the chemical compounds to be classified are represented by binary-valued (absent or present type) descriptors, have appeared in the cheminformatics literature over the past decade. The principal goal of this paper is to describe how a naive Bayes classifier based on binary descriptors (NBCBBD) can be employed as a feature selector in an efficient manner suitable for cheminformatics. In the process, we point out a fact well documented in other disciplines that NBCBBD is a linear classifier and is therefore intrinsically suboptimal for classifying compounds that are nonlinearly separable in their binary descriptor space. We investigate the performance of the proposed algorithm on classifying a subset of the MDDR data set, a standard molecular benchmark data set, into active and inactive compounds.

Mesh:

Substances:

Year:  2012        PMID: 22900941     DOI: 10.1021/ci200303m

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  3 in total

1.  ChemStable: a web server for rule-embedded naïve Bayesian learning approach to predict compound stability.

Authors:  Zhihong Liu; Minghao Zheng; Xin Yan; Qiong Gu; Johann Gasteiger; Johan Tijhuis; Peter Maas; Jiabo Li; Jun Xu
Journal:  J Comput Aided Mol Des       Date:  2014-07-17       Impact factor: 3.686

2.  Verifying the fully "Laplacianised" posterior Naïve Bayesian approach and more.

Authors:  Hamse Y Mussa; David Marcus; John B O Mitchell; Robert C Glen
Journal:  J Cheminform       Date:  2015-06-12       Impact factor: 5.514

3.  Full "Laplacianised" posterior naive Bayesian algorithm.

Authors:  Hamse Y Mussa; John Bo Mitchell; Robert C Glen
Journal:  J Cheminform       Date:  2013-08-23       Impact factor: 5.514

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.