| Literature DB >> 31172382 |
Angela Serra1, Serli Önlü1,2, Pietro Coretto3, Dario Greco4,5,6.
Abstract
BACKGROUND: Traditional quantitative structure-activity relationship models usually neglect the molecular alterations happening in the exposed systems (the mechanism of action, MOA), that mediate between structural properties of compounds and phenotypic effects of an exposure.Entities:
Keywords: Human serum albumin binding; Integrative analysis; Lasso; MOA; Molecular descriptors; QSAR; QSMARt; Regression; Safe-by-design
Year: 2019 PMID: 31172382 PMCID: PMC6551915 DOI: 10.1186/s13321-019-0359-2
Source DB: PubMed Journal: J Cheminform ISSN: 1758-2946 Impact factor: 5.514
Fig. 1Predicted versus experimental values of training set (black) and test set (red) chemicals
Fig. 2Standardized residuals versus leverage values of training set (black) and test set (red) chemicals (Williams plot). Dashed lines indicate interval. Vertical line set at the warning leverage (critical hat value, )
Fig. 3Correlation graph of the six MDs/MOA features of the QSMARt model. Vertex color represent the sign of the associated beta value while edge colors show the sign of the correlation of the features across the X dataset
Fig. 4Predicted by Eq. 3 versus leverage values of training set (black), test set (red), and external set (green) chemicals (Insubria graph). Dashed lines indicate the model prediction range. Vertical line set at the warning leverage (critical hat value, )
Fig. 5TSNE projection of the drugs in the albumin and external dataset. The projection was performed by using the set of genes and MDs (a), only the genes (b) and only the MDs (c) in the optimal hybrid model. The outliers are in the border area of the dataset for the molecular descriptors (c), while they are similar to the rest of the external set fort the gene log-fold change (b). Likewise, the outliers still appear on the border for the combined two sets of features (a). In panel (d) the values of the three MDs is plotted (y axis) for the drugs in the albumin and external dataset (x axis). The drugs are ordered based on their predicted value. Drugs from the external set that falls in the model prediction range are marked in gray, while the ones that are outside the range are marked in blue. Drugs in the training set are marked in black while drugs in the test set are marked in red