| Literature DB >> 28185036 |
Dawid Warszycki1, Marek Śmieja2, Rafał Kafel3.
Abstract
The Average Information Content Maximization algorithm (AIC-MAX) based on mutual information maximization was recently introduced to select the most discriminatory features. Here, this methodology was applied to select the most significant bits from the Klekota-Roth fingerprint for serotonin receptors ligands as well as to select the most important features for distinguishing ligands with activity for one receptor versus another. The interpretation of selected bits and machine-learning experiments performed using the reduced interpretations outperformed the raw fingerprints and indicated the most important structural features of the analyzed ligands in terms of activity and selectivity. Moreover, the AIC-MAX methodology applied here for serotonin receptor ligands can also be applied to other target classes.Entities:
Keywords: Fingerprint reduction; Fingerprints; Machine learning; Selectivity studies; Serotonin receptors; Virtual screening
Mesh:
Substances:
Year: 2017 PMID: 28185036 PMCID: PMC5438429 DOI: 10.1007/s11030-017-9729-8
Source DB: PubMed Journal: Mol Divers ISSN: 1381-1991 Impact factor: 2.943
Number of active and inactive compounds for serotonin receptors retrieved from the ChEMBL database
| Receptor | Active | Inactive |
|---|---|---|
| ( | ( | |
| 5- | 4427 | 1230 |
| 5- | 731 | 577 |
| 5- | 877 | 236 |
| 5- | 84 | 28 |
| 5- | 2060 | 1081 |
| 5- | 428 | 341 |
| 5- | 1303 | 1050 |
| 5- | 291 | 248 |
| 5- | 382 | 153 |
| 5- | 69 | 146 |
| 5- | 1626 | 426 |
| 5- | 896 | 415 |
Fig. 1One hundred of the most informative KRFP bits (shown as black squares) selected using the AIC-MAX algorithm for each serotonin receptor. The most significant common bits are marked: blue—polarizable nitrogen atoms, green—aromatic systems, red—amide moiety. Two highly specific fragments that are typical of individual receptors are shown in orange circles (phenylsulfonylamide for 5- and o-metoxyphenyl for 5-). (Color figure online)
Fig. 2One hundred (per one ‘off-target’) of the most informative bits (shown as black squares) from KRFP selected using the AIC-MAX algorithm for the 5- receptor to discriminate its ligands from compounds that act on different serotonin receptors. The most significant common bits are marked: blue—polarizable nitrogen atoms, green—aromatic systems. (Color figure online)
Fig. 3Comparison between Mathews Correlation Coefficients values obtained in random forest experiments for raw (white background in panel a) and reduced fingerprints (grey background in panel a). Panel b shows when the reduced representation outperformed in conducted experiments the raw one ‘+’, vice versa ‘–’ or no changes ‘nc’. (Color figure online)