| Literature DB >> 26628925 |
Hamse Y Mussa1, John B O Mitchell2, Robert C Glen3.
Abstract
It is common in cheminformatics to represent the properties of a ligand as a string of 1's and 0's, with the intention of elucidating, inter alia, the relationship between the chemical structure of a ligand and its bioactivity. In this commentary we note that, where relevant but non-redundant features are binary, they inevitably lead to a classifier capable of capturing only a linear relationship between structural features and activity. If, instead, we were to use relevant but non-redundant real-valued features, the resulting predictive model would be capable of describing a non-linear structure-activity relationship. Hence, we suggest that real-valued features, where available, are to be preferred in this scenario.Entities:
Keywords: Bernoulli distribution; Binary descriptors; Ligand chemical structure; Linear relationship
Year: 2015 PMID: 26628925 PMCID: PMC4665894 DOI: 10.1186/s13321-015-0105-3
Source DB: PubMed Journal: J Cheminform ISSN: 1758-2946 Impact factor: 5.514