| Literature DB >> 28527154 |
Sankalp Jain1, Melanie Grandits1, Lars Richter1, Gerhard F Ecker2.
Abstract
The bile salt export pump (BSEP) actively transports conjugated monovalent bile acids from the hepatocytes into the bile. This facilitates the formation of micelles and promotes digestion and absorption of dietary fat. Inhibition of BSEP leads to decreased bile flow and accumulation of cytotoxic bile salts in the liver. A number of compounds have been identified to interact with BSEP, which results in drug-induced cholestasis or liver injury. Therefore, in silico approaches for flagging compounds as potential BSEP inhibitors would be of high value in the early stage of the drug discovery pipeline. Up to now, due to the lack of a high-resolution X-ray structure of BSEP, in silico based identification of BSEP inhibitors focused on ligand-based approaches. In this study, we provide a homology model for BSEP, developed using the corrected mouse P-glycoprotein structure (PDB ID: 4M1M). Subsequently, the model was used for docking-based classification of a set of 1212 compounds (405 BSEP inhibitors, 807 non-inhibitors). Using the scoring function ChemScore, a prediction accuracy of 81% on the training set and 73% on two external test sets could be obtained. In addition, the applicability domain of the models was assessed based on Euclidean distance. Further, analysis of the protein-ligand interaction fingerprints revealed certain functional group-amino acid residue interactions that could play a key role for ligand binding. Though ligand-based models, due to their high speed and accuracy, remain the method of choice for classification of BSEP inhibitors, structure-assisted docking models demonstrate reasonably good prediction accuracies while additionally providing information about putative protein-ligand interactions.Entities:
Keywords: BSEP; Classification model; Drug-induced cholestasis; Inhibiton; Structure-based classification; Transporters
Mesh:
Substances:
Year: 2017 PMID: 28527154 PMCID: PMC5487762 DOI: 10.1007/s10822-017-0021-x
Source DB: PubMed Journal: J Comput Aided Mol Des ISSN: 0920-654X Impact factor: 4.179
Fig. 1CNS representation of the training set compounds based on MACCS Tc similarity threshold of 0.70. Communities with at least five representative members are color coded
Fig. 2Homology model structure of human BSEP in the inward-facing state. a Front view of the transporter. b Side view after a 90° rotation. c Top view from the extracellular space
Fig. 3Secondary structure of the protein over the simulation time
Fig. 4Distribution of BSEP inhibitors and non-inhibitors (training set) based on ChemScore scoring. Sensitivity, specificity, precision and MCC were calculated from the confusion matrix based on the intersection point of both curves
Models obtained from different scoring functions based on the training set
| Scoring function | Intersection point | AUC | Sensitivity | Specificity | Accuracy | G-mean | MCC |
|---|---|---|---|---|---|---|---|
| ChemScore | 29.50 | 0.87 | 0.60 | 0.88 | 0.81 | 0.73 | 0.50 |
| GoldScore | 53.50 | 0.82 | 0.74 | 0.75 | 0.75 | 0.74 | 0.45 |
| GlideXP | −6.80 | 0.77 | 0.80 | 0.65 | 0.69 | 0.72 | 0.39 |
| Xscore (ChemScore) | 6.15 | 0.92 | 0.71 | 0.95 | 0.88 | 0.82 | 0.69 |
| Xscore (GoldScore) | 6.10 | 0.93 | 0.68 | 0.95 | 0.88 | 0.80 | 0.68 |
The scoring function in brackets were used to generate the docking poses
Fig. 5a Hydrophobic interaction, b hydrogen bond interaction fingerprints of true positives (TP) and true negatives (TN) of the training set. The classification of the compounds is based on the ChemScore scoring function
Fig. 6Distribution of BSEP inhibitors and non-inhibitors based on the a molecular weight, b logP(o/w) of the training set
Models based on physicochemical properties
| Molecular property | Intersection point | Sensitivity | Specificity | Accuracy | G-mean | MCC |
|---|---|---|---|---|---|---|
| Molecular weight | 390 | 0.76 | 0.80 | 0.79 | 0.78 | 0.54 |
| logP | 3.6 | 0.57 | 0.87 | 0.77 | 0.71 | 0.47 |
Fig. 7Distribution of functional groups in the training dataset
Fig. 8a Heat map illustrating the PLIF analysis of the training set inhibitors (x-axis contact residues; y-axis functional groups of the ligand showing an interaction with the residue; color scale number of interacting ligands). b Docking pose of Glimepiride (yellow) in which its carbonyl groups interact with the residues Tyr337, Tyr772 and Asn996
Ligand-based and structure-based classification
| Model type | TP | TN | FP | FN | Sensitivity | Specificity | Accuracy | MCC | Precision |
|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
| SBC_C | 27 | 91 | 22 | 12 | 0.69 | 0.81 | 0.78 | 0.47 | 0.55 |
| SBC_G | 26 | 79 | 34 | 13 | 0.67 | 0.70 | 0.69 | 0.33 | 0.43 |
| SBC_C_X | 27 | 96 | 17 | 12 | 0.69 | 0.85 | 0.81 | 0.52 | 0.61 |
| LBC + SBC_C | 24 | 107 | 6 | 15 | 0.62 | 0.95 | 0.86 | 0.62 | 0.80 |
|
|
|
|
|
|
|
|
|
|
|
| Consensus | 27 | 106 | 7 | 12 | 0.69 | 0.94 | 0.88 | 0.66 | 0.79 |
The best model of the combined approach is highlighted in bold as well as the ligand-based classification
TP true positives, TN true negatives, FP false positives, FN false negatives, LBC Ligand-based classification (Montanari et al. [25]), SBC_C Structure-based classification using ChemScore scoring function, SBC_G Structure-based classification using GoldScore scoring function, SBC_C_X Structure-based classification using ChemScore scoring function (rescoring using Xscore). Consensus Combination of LBC, SBC_C and SBC_C_X