Literature DB >> 35882865

A curated binary pattern multitarget dataset of focused ATP-binding cassette transporter inhibitors.

Sven Marcel Stefan1,2,3,4, Patric Jan Jansson3,5, Jens Pahnke1,4,6,7, Vigneshwaran Namasivayam8,9.   

Abstract

Multitarget datasets that correlate bioactivity landscapes of small-molecules toward different related or unrelated pharmacological targets are crucial for novel drug design and discovery. ATP-binding cassette (ABC) transporters are critical membrane-bound transport proteins that impact drug and metabolite distribution in human disease as well as disease diagnosis and therapy. Molecular-structural patterns are of the highest importance for the drug discovery process as demonstrated by the novel drug discovery tool 'computer-aided pattern analysis' ('C@PA'). Here, we report a multitarget dataset of 1,167 ABC transporter inhibitors analyzed for 604 molecular substructures in a statistical binary pattern distribution scheme. This binary pattern multitarget dataset (ABC_BPMDS) can be utilized for various areas. These areas include the intended design of (i) polypharmacological agents, (ii) highly potent and selective ABC transporter-targeting agents, but also (iii) agents that avoid clearance by the focused ABC transporters [e.g., at the blood-brain barrier (BBB)]. The information provided will not only facilitate novel drug prediction and discovery of ABC transporter-targeting agents, but also drug design in general in terms of pharmacokinetics and pharmacodynamics.
© 2022. The Author(s).

Entities:  

Mesh:

Substances:

Year:  2022        PMID: 35882865      PMCID: PMC9325750          DOI: 10.1038/s41597-022-01506-z

Source DB:  PubMed          Journal:  Sci Data        ISSN: 2052-4463            Impact factor:   8.501


Background & Summary

The superfamily of ABC transporters is of highest importance in terms of novel drug discovery, design, and development. ABC transporters are ubiquitously present in the human body[1-4], and their (co-)expression has broad implications in human diseases. These diseases include prevalent [e.g., Alzheimer’s disease (AD)[5,6], atherosclerosis[7], or cancer[1,3,6,8]] and orphan [e.g., Tangier disease (ABCA1)[9], Stargardt’s disease (ABCA4)[10], harlequin ichthyosis (ABCA12)[11], pseudoxanthoma elasticum (ABCC6)[12], or adrenoleukodystrophy (ABCD1)[13]] pathological conditions. Together with tight-junction proteins, these membrane-bound efflux pumps are the backbone of systemic barrier formation[14,15]. Their localization at blood-tissue barriers impacts metabolite distribution and drug delivery, and hence, disease progress, treatment, and therapy[15-19]. Determinants that establish a correlation between the molecular structure of a small-molecule (drug) and its interaction with ABC transporters is key for the development of novel, safe, systemically applicable, and target-oriented (selective) drugs. These determinants include descriptors that conserve certain physicochemical features of the small-molecules of interest, such as the calculated octanol-water partition coefficient (CLogP), molecular weight (MW), molar refractivity (MR), or topological polar surface area (TPSA), but also the number of hydrogen bond (H-bond) donors, H-bond acceptors, or rotatable bonds[5]. Other than that, more complex attributes can be summarized in fingerprints that represent certain molecular features of the small-molecule in a binary code (e.g., feature-, path-, and radial-fingerprints[20-22]). Unfortunately, comprehensive binary datasets do not exist for ABC transporters. However, the knowledge about such binary fingerprints could facilitate the development of (i) drugs that avoid clearance mediated by ABC transporters [e.g., targeting the BBB to treat central nervous system-(CNS)-related diseases[23]]; (ii) agents targeting ABC transporters to study their expression and/or function with state-of-the-art imaging techniques [e.g., by positron emission tomography (PET)[16]]; (iii) drugs that selectively target well-studied ABC transporters in human diseases (e.g., cancer[1,3,4,6,8]); (iv) broad-spectrum drugs that target several ABC transporters to ameliorate/cure an ABC transporter-associated pathological condition[24]; (v) polypharmacological agents to target and study particularly less- and under-studied ABC transporters by a multitargeting approach[7,25-27]; or (vi) combined/extended fingerprints to create high-quality compound collections that would provide a starting point of polypharmacology-focused virtual screenings[7]. In the present work, we combined the concepts of the multitarget dataset[7,27] and the binary distribution of substructures[7]. The latest version of the multitarget dataset contains 1,167 compounds that were evaluated against the well-studied ABC transporters ABCB1, ABCC1, and ABCG2. A large substructure catalog was created, containing in total 604 active (= present) substructures within these 1,167 compounds of the updated multitarget dataset. The new binary pattern multitarget dataset (ABC_BPMDS) is freely available under the http://www.zenodo.org[28] URL as well as the http://www.panabc.info website, and its use is free of charge.

Methods

The generation of the ABC_BPMDS was a four-step process: (i) deep literature search including the selection of qualified reports, resulting in the exquisite compilation of the original multitarget dataset as reported earlier[27] [including updates in our former[7] and the present work (see below)]; (ii) manual curation of the given data, in particular: (a) calculation of bioactivity values for estimated bioactivity data and data determination, (b) unification and harmonization of bioactivity data, as well as (c) comparison, curation, and harmonization of molecular-structural data (SMILES codes); (iii) generation of a substructure catalog, in particular: (a) visual inspection of the 1,167 molecules of the updated multitarget dataset, (b) extraction of partial structures, (c) creation and extension of substitution patterns, as well as (d) screening of the multitarget dataset for these substructures, discovering 604 active substructures; and (iv) individual pattern analysis[7] for uncovering the statistical distribution of these 604 active substructures amongst the 1,167 compounds of the multitarget dataset. The following sections will provide a detailed description on how the final ABC_BPMDS was assembled. Figure 1 provides an overview of the taken steps.
Fig. 1

Depiction of the main workflow of assemble and validation as reported earlier in our preliminary work[27], as well as the main steps of data extension and curation as part of the current work to generate the ABC_BPMDS. This graphic was created with BioRender.com (https://biorender.com).

Depiction of the main workflow of assemble and validation as reported earlier in our preliminary work[27], as well as the main steps of data extension and curation as part of the current work to generate the ABC_BPMDS. This graphic was created with BioRender.com (https://biorender.com).

Literature Collection of the Original Dataset

Qualified Reports

A deep literature search was the first step to compile the original multitarget dataset, which has been reported in detail before[7,27]. The National Center for Biotechnological Information (NCBI; https://www.ncbi.nlm.nih.gov)[29] was used to search for qualified reports applying the keywords (i) ‘ABCB1’, (ii) ‘ABCC1’, (iii) ‘ABCG2’, (iv) ‘P-gp’, (v) ‘MRP1’, and (vi) ‘BCRP’. The keywords were used in all possible combinations to extract the maximal yield in reports. In addition to the genuine database search, the reference sections of the found reports were searched for potential additional literature to extract further qualified information.

Compounds

Compounds were considered only if they had been evaluated against all three focused targets, ABCB1, ABCC1, and ABCG2, including inactive compounds as well as selective, dual, and triple inhibitors. This information could be provided either in one single report (e.g., in case of the standard ABCG2 inhibitor Ko143[30]) or in several individual reports [e.g., in case of the standard ABCC1 inhibitor verlukast (MK571)[31-36]]. The molecular structures of qualified compounds were collected as SMILES codes. These were obtained either from (i) supplementary information of the respective report; (ii) the PubChem database (https://pubchem.ncbi.nlm.nih.gov)[37] [e.g., in case of known drugs and drug-like compounds, such as the standard inhibitors verapamil (ABCB1), cyclosporine A (ABCB1 and ABCC1), verlukast (ABCC1), or Ko143 (ABCG2)]; or (iii) manual drawing according to the 2D representations as outlined in the respective report using ChemDraw Pro version 20.1.1.125. Isomeric SMILES were considered where applicable. SMILES codes that encoded aromatic substructures with lower-case letters in certain reports[38,39] were unified according to the upper-case description scheme (structural curation)[7].

Assays

Only functional assays were considered using either fluorescence labeling or radionuclide detection applying either living (selected or transfected) cells or membrane vesicles with reconstituted transporters. ATPase assays were not considered because ATPase activity and transporter inhibition may not be directly connected to each other. MDR reversal assay data was not considered because of the complexity of the involved processes and the fact that the triggered response(s) may not only be caused by ABC transporter inhibition. Table 1 provides an exhaustive list of functional tracers (and substrates) that were used to assess the 1,167 compounds of the ABC_BPMDS against ABCB1, ABCC1, and ABCG2. Table 2 summarizes all used host systems (cell lines and membrane vesicles) used for the evaluation of the 1,167 compounds against ABCB1, ABCC1, and ABCG2.
Table 1

An exhaustive list of functional tracers that were used to functionally assess the 1,167 compounds of the ABC_BPMDS against ABCB1, ABCC1, and ABCG2[7,27].

The assessment of the corresponding transporter by the respective tracer is indicated by a black box. The provided references are examples in which details in terms of the stated assays can be found[7,27,47,55,84–101].

Table 2

An exhaustive list of transporter host systems that were used to functionally assess the 1,167 compounds of the ABC_BPMDS against the well-studied ABC transporters ABCB1, ABCC1, and ABCG2[7,27].

The assessment of the corresponding transporter by the respective host system is indicated by a black box. Regarding selected cells, the used cytotoxic agent is indicated under the respective transporter, and the cell subline abbreviation is given in brackets. The provided references are examples in which details in terms of the stated cell lines can be found[7,38,42,48,52,84,89,93,95,97–114].

An exhaustive list of functional tracers that were used to functionally assess the 1,167 compounds of the ABC_BPMDS against ABCB1, ABCC1, and ABCG2[7,27]. The assessment of the corresponding transporter by the respective tracer is indicated by a black box. The provided references are examples in which details in terms of the stated assays can be found[7,27,47,55,84-101]. An exhaustive list of transporter host systems that were used to functionally assess the 1,167 compounds of the ABC_BPMDS against the well-studied ABC transporters ABCB1, ABCC1, and ABCG2[7,27]. The assessment of the corresponding transporter by the respective host system is indicated by a black box. Regarding selected cells, the used cytotoxic agent is indicated under the respective transporter, and the cell subline abbreviation is given in brackets. The provided references are examples in which details in terms of the stated cell lines can be found[7,38,42,48,52,84,89,93,95,97-114].

Bioactivity

The bioactivities (IC50 values) of the compounds were extracted from either (i) tables of the respective reports (including supplementary information); or (ii) screening figures with relative inhibition (Irel) values (%) compared to a standard (Imax; 100%). In the latter case, the IC50 values were estimated (either span or >, ≥, <, ~) in the previous multitarget dataset[7,27].

Data Curation – Bioactivity Data

Dataset Update and Complementation

New reports particularly from 2021 and 2022 were taken into consideration to update the dataset with compounds that were evaluated against the three transporters ABCB1, ABCC1, and ABCG2. In total, 22 new compounds were included into the list of qualified compounds[7,40-42]. In addition, we focused an extended literature search, particularly of known standard inhibitors of ABCB1, ABCC1, and ABCG2 to obtain bioactivities with less mathematical uncertainty which also align well with our empirical experience in the laboratory. These compounds included verapamil (ABCB1[43]), cyclosporine A (ABCB1[41,43-46] and ABCC1[31,44-46]), verlukast (ABCC1[31-36]), and Ko143 (ABCG2[41,45]). As a side note, the additional literature search also resulted in an update of bioactivity data of the natural compound piperine[47]. In the curation process to complement bioactivity values, we found that two compounds were erroneously included into the dataset (apatinib[48] and ceritinib[49]). Both were not evaluated against ABCC1, and therefore, did not qualify for this dataset and were therefore removed.

Complementary Data Analysis

The bioactivity of several inhibitors could only be described as an estimation (either described as span, marked as ‘active’, or annotated with ‘>’, ‘≥’, ‘<’, ‘~’ in the previous dataset[7,27]). However, to allow for the use of the entire dataset in mathematical and computational operations, we sought to allocate defined bioactivity values to these compounds. Hence, the individual reports were analyzed and the given indications of bioactivity [e.g., screening figures, flow-cytometry histograms, or tables with bioactivity values other than IC50 values (e.g., percentages)] were taken into consideration for further data analysis. The specific bioactivity value (e.g., percentage inhibition) was extracted and correlated to the used compound concentration. By using GraphPad Prism version 8.4.0 applying the three-parameter logistic equation with a fixed Hill slope (=1.0), IC50 values were calculated and listed in the new multitarget dataset. A detailed curation protocol is provided on https://www.zenodo.org[50] as well as he http://www.panabc.info website, and the related GraphPad Prism file containing the concentration-effect curves can be accessed without restrictions. In total, the bioactivity data of 104, 77, and 73 ABCB1, ABCC1, and ABCG2 inhibitors, respectively, have been calculated and complemented.

Data Determination

The bioactivities of five compounds [ayanin[51], retusin[51] (flavone derivative 12[52]), dihydrodibenzoazepine derivative 4i[53], dregamine derivative 2[54], and tabernaemontanine derivative 22[54]] had to be determined without mathematical operations. The IC50 values of ayanin and retusin were stated as ‘>50 µM’ in the original report[51]. Usually, these kinds of statements (e.g., ‘>50 µM’, ‘>100’, ‘inactive’, etc) led to the allocation of such compounds into the ‘inactive’ category (arbitrary IC50 value of 2000 µM in the ABC_BPMDS). However, the authors of the respective publication stated that ayanin and retusin had some (weak) inhibitory activity[51]. Therefore, we decided to allocate an arbitrary value of 100 µM to these compounds to acknowledge their minor inhibitory potential against ABCC1. Dihydrodibenzoazepine derivative 4i[53], dregamine derivative 2[54], and tabernaemontanine derivative 22[54], on the other hand, reached over 100% inhibition at concentrations of 2.50 µM, 20.0 µM, and 20.0 µM, respectively. Unfortunately, these were the only indications of bioactivity by the authors of the original reports[53,54]. Hence, we decided to allocate arbitrary values of 0.999 µM[53], 4.99 µM[54], and 4.99 µM[54], respectively, to acknowledge their potentially (very) high inhibitory power against ABCB1 as well as ABCG2 considering the effect-concentrations used in the original reports. These arbitrary IC50 values have been chosen since sub-classifications of bioactivity classes according to bioactivity thresholds (e.g., 1 and 5 µM) provided a better prediction in our previous works[7].

Data Unification

Several compounds were evaluated in multiple assays, e.g., the mentioned standard inhibitors of ABCB1, ABCC1, and ABCG2. However, to allocate one bioactivity value to one compound, a unification process was necessary. As IC50 values do not follow a normal distribution, the multiple IC50 values associated with one compound were subject to a three-step mathematical operation: (i) logarithmization of the IC50 values; (ii) calculation of the mean; and (iii) delogarithmization of the log(IC50)-mean value. The resultant mean value was allocated to the respective compound. It shall be noted that the bioactivities of the compounds curcumin I-III (ABCC1)[55] and gefitinib (ABCB1 and ABCC1)[56] were only given as a span in the original reports[55,56], and hence, the mean of the respective span was taken for further operations. In total, 60, 48, and 209 ABCB1, ABCC1, and ABCG2 inhibitors have been given a new bioactivity value by these operations compared to the previous multitarget dataset[7,27].

Data Correction and Harmonization

Through the complementary analysis process, several bioactivity values were corrected. This applied for compounds that were falsely marked as ‘inactive’ in the previous multitarget dataset (ABCB1: 22 compounds; ABCC1: 26 compounds; ABCG2: 19 compounds)[7,27]. Lastly, all bioactivity values of the ABC_BPMDS were harmonized according to a number of three significant digits. This harmonization resulted in a standardized format of presentation: (i) ‘XXX0 µM’; (ii) ‘XXX µM; (iii) XX.X µM; (iv) X.XX µM; (v) 0.XXX µM; and (vi) 0.0XX (X = any numeric value between 1–9). Here, 11, 8, and 9 ABCB1, ABCC1, and ABCG2 values have been changed compared to the previous multitarget dataset[7,27].

Data Curation – Molecular-structural Data

The 1,167 compounds of the ABC_BPMDS were portrayed as canonical or isomeric SMILES codes as derived from the (i) respective report, (ii) PubChem database (https://pubchem.ncbi.nlm.nih.gov), or (iii) SMILES generation tool of ChemDraw Pro version 20.1.1.125. All smiles were compared to each other to identify duplicates by using InstantJChem version 21.13.0. Through this individual cross-check of the molecular-structural data, 13 compounds were discovered as duplicates[46,51,56-59] and their bioactivity values were merged with the original bioactivity data of the particular compound[52,59-62]. In addition, three compounds were identified to be incorrect in terms of their molecular structure and have been corrected in the dataset[46,57,63].

Binary Pattern Generation

Background

In contrast to common molecular fingerprints for similarity-based virtual screenings[20,64], the very recently reported novel drug discovery tool ‘computer-aided pattern analysis’ (‘C@PA’) identified that defined (=non-substituted) hydrogens and their positioning is particularly important in terms of the differentiation between selective and multitarget inhibition of ABC transporters[7,26,27]. Although certain fingerprints indeed consider polar hydrogens[21,22], C@PA particularly discovered non-polar hydrogens with critical discriminatory potential in the virtual screening process[7,26,27]. However, the original C@PA worked with a very preliminary and limited dataset of 308 substructures which were compiled after multitarget dataset visualization and literature consideration[65], of which only 162 substructures were active in the multitarget dataset of, at the time of the study, 1,049 compounds[27].

Substructure Visualization, Identification, and Extension

For the development of a complete, detailed, and novel (multitarget) fingerprint, which may also universally be used in (multitarget) virtual screening approaches, the 1,167 compounds of the updated multitarget dataset were visualized using ChemDraw Pro version 20.1.1.125, and substructures were identified and extracted. The extracted substructures [e.g., single-standing/centered (hetero-)aromatic rings, condensed (hetero-)aromatic rings, (un)saturated side chains, extremities, and non-aromatic (hetero-)cycles, etc.] were derivatized by applying a heavy atom substitution scheme as already reported earlier[26] (scaffold fragmentation and substructure hopping). Especially the presence and positioning of (non-polar) hydrogens in the sense of a proton/non-proton pattern scheme was stressed. These measures increased the quantity of substructural properties covered by the intended fingerprint. In addition, alternative datasets of ABC transporter modulators[5] and modes of action (particularly ABC transporter activators)[6,8] have been considered to gain complementary knowledge about potentially active substructures. The resultant substructures were subsequently searched in the 1,167 compounds (loaded as.csv file) using the query search function of InstantJChem version 21.13.0 and, if present, listed in the substructure catalog. As a result, a catalog of 604 active substructures has been assembled.

Individual Pattern Analysis[7]

In a final step, the multitarget dataset of 1,167 compounds was statistically analyzed for the listed 604 substructures of the substructure catalog. Here, the resultant list of hit molecules per substructure derived from the query search function of InstantJChem version 21.13.0 was saved and compared to the original list, translating the entry differences into a binary code [1 = substructure present (active substructure); 0 = substructure not present (inactive substructure)]. A binary pattern distribution scheme resulted which constituted the final ABC_BPMDS. It shall be taken note that the number of the very same substructure within the same compound was irrelevant; the presence (numeric value = 1) of the substructure was not an expression of how often the respective substructure appeared within the compound.

Data Records

The ABC_BPMDS is freely available in an .xlsx format under the http://www.zenodo.org[28] URL as well as the http://www.panabc.info website and its use is free of charge. The dataset consists of (i) an individual database identifier for each compound; (ii) the original name of the compounds according to the original report(s); (iii) the IUPAC nomenclature of each compound generated by using ChemDraw Pro version 20.1.1.125; (iii) The SMILES code obtained either from the (a) supporting information of the respective report, (b) PubChem database (https://pubchem.ncbi.nlm.nih.gov), or (c) manual drawing using ChemDraw Pro version 20.1.1.125; (iv) the physicochemical properties (a) CLogP, (b) calculated molecular water solubility (CLogS), (c) MW, (d) MR, (e) TPSA, (f) H-bond donors, (g) H-bond acceptors, (h) rotatable bonds, and (j) number of heavy atoms; (v) the associated bioactivity values expressed as (a) IC50 values [µM] against ABCB1, ABCC1, and ABCG2 presented in the standardized format of three significant digits as outlined above [10log(mean)], and (b) pIC50 values against ABCB1, ABCC1, and ABCG2; (vi) the binary code (active = 1; inactive = 0) for each of the 604 evaluated substructures of the substructure catalog including their (a) trivial name, (b) SMILES code, (c) number of defined hydrogens, (d) number of heavy atoms, (e) total hit count, and (f) individual substructure identifier. The substructures are sorted from most abundant (left) to most rare (right); and (vii) the PubMed (https://pubmed.ncbi.nlm.nih.gov) identifier (PMID) retrieved from NCBI (https://www.ncbi.nlm.nih.gov). In addition, a detailed curation protocol as well as an associated GraphPad Prism file can be found on https://www.zenodo.org[50] as well as the http://www.panabc.info website.

Technical Validation

Compounds

The 1,167 compounds were portrayed as canonical or isomeric SMILES codes as derived from the respective report or the PubChem database (https://pubchem.ncbi.nlm.nih.gov) and imported into the MarvinSketch editor implemented in InstantJChem version 21.13.0. If the loaded SMILES code appeared as the intended original molecular representation according to the respective report or the PubChem database (https://pubchem.ncbi.nlm.nih.gov) without any errors, it was considered as valid.

Bioactivity Space Validation

In total, 113 reports between 1994 and 2022 have been collected, resulting in a final number of 1,167 compounds that were evaluated against ABCB1, ABCC1, and ABCG2, including inactive compounds as well as selective, dual, and triple inhibitors. Amongst the 1,167 compounds are (i) 525 ABCB1 inhibitors, of which (a) 88 are selective ABCB1 inhibitors (no activity against ABCC1 and ABCG2; any given IC50 value), (b) 67 are potent ABCB1 inhibitors (IC50 values < 1 µM), and (c) 25 are selective and potent ABCB1 inhibitors; (ii) 344 ABCC1 inhibitors, of which (a) 61 are selective ABCC1 inhibitors (no activity against ABCB1 and ABCG2; any given IC50 value), (b) 45 are potent ABCC1 inhibitors (IC50 values < 1 µM), and (c) 11 are selective and potent ABCC1 inhibitors; (iii) 866 ABCG2 inhibitors, of which (a) 409 are selective ABCG2 inhibitors (no activity against ABCB1 and ABCC1; any given IC50 value), (b) 330 are potent ABCG2 inhibitors (IC50 values < 1 µM), and (c) 199 are selective and potent ABCG2 inhibitors. On the other hand, 38, 212, and 58 dual ABCB1/ABCC1, ABCB1/ABCG2, and ABCC1/ABCG2 inhibitors are present, respectively, of which 7, 99, and 13 can be considered as potent dual ABCB1/ABCC1, ABCB1/ABCG2, and ABCC1/ABCG2 inhibitors, respectively (IC50 < 10 µM). Finally, 187 triple ABCB1, ABCC1, and ABCG2 inhibitors can be defined, of which 54 can be considered as potent (IC50 < 10 µM; so-called ‘Class 7’ compounds[7,26,27]). Table 3 summarizes a survey of statistical parameters of the entire ABC_BPMDS as well as important sub-classes. Figure 2 depicts the distribution of the pIC50 values of ABCB1 (A), ABCC1 (B), and ABCG2 (C) inhibitors amongst the entire ABC_BPMDS, which followed in all three cases a Gaussian normal distribution.
Table 3

Statistical survey of the span as well as median and mean values of the bioactivity of the entire ABC_BPMDS as well as important sub-classes.

inhibitor classcountIC50 span [µM]pIC50 spanIC50 median [µM]pIC50 medianIC50 mean [µM]pIC50 mean
ABC_BPMDS1,1670.0153–16307.815–2.7884.395.3583.845.416
All ABCB15250.0153–14607.815–2.8366.375.1966.325.199
All ABCC13440.146–16306.836–2.78811.24.9519.265.033
All ABCG28660.0234–4057.631–3.3931.955.7102.005.698
Selective ABCB1880.0153–7087.815–3.1502.515.5993.415.467
Selective ABCC1610.222–1126.654–3.9515.975.2245.635.249
Selective ABCG24090.0234–4057.631–3.3931.065.9751.135.948
Dual ABCB1/ABCC1380.289–1806.539–3.74520.44.69215.24.819
Dual ABCC1/ABCG22120.0255–3337.593–3.4784.435.3543.855.415
Dual ABCC1/ABCG2580.0988–1637.005–3.78810.14.9966.925.160
Triple ABCB1/ABCC1/ABCG21870.0475–16307.323–2.7886.985.1566.745.172

The pIC50 values have been calculated by using the negative decadic logarithm of the respective bioactivity value.

Fig. 2

Distribution of bioactivity values (pIC50) of the 1,167 compounds of the ABC_BPMDS against ABCB1 (a), ABCC1 (b), and ABCG2 (c).

Statistical survey of the span as well as median and mean values of the bioactivity of the entire ABC_BPMDS as well as important sub-classes. The pIC50 values have been calculated by using the negative decadic logarithm of the respective bioactivity value. Distribution of bioactivity values (pIC50) of the 1,167 compounds of the ABC_BPMDS against ABCB1 (a), ABCC1 (b), and ABCG2 (c).

Physicochemistry Space Validation

Physicochemical properties shape not only the pharmacological profile of ABC transporter inhibitors[66-69], but are also very often used as additional discriminators in virtual screening processes[7,26,27,38]. To prove that the 1,167 compounds of the ABC_BPMDS have a balanced distribution of physicochemical attributes, the ABC_BPMDS was analyzed for the CLogP, MW, MR, and TPSA using MOE version 2019.01. Figure 3 demonstrates that these physicochemical properties are normally distributed within the ABC_BPMDS comparable to other reported datasets[23,70]. Table 4 summarizes the median and mean values of CLogP, MW, MR, and TPSA of the entire ABC_BPMDS as well as important sub-classes. The median and mean values are well-aligned, which accounts for the equal distribution of values.
Fig. 3

Distribution of the important physicochemical[115] properties CLogP (a), MW (b), MR (c), and TPSA (d) amongst the 1,167 compounds of the ABC_BPMDS as determined by MOE version 2019.01.

Table 4

Statistical survey of median and mean values of the important physicochemical properties CLogP, MW, MR, and TPSA amongst the entire ABC_BPMDS as well as important sub-classes as determined by MOE version 2019.01.

inhibitor classcountCLogP medianCLogP meanMW medianMW meanMR medianMR meanTPSA medianTPSA mean
ABC_BPMDS1,1674.334.26403.39418.3811.2711.7373.8677.87
All ABCB15254.404.78432.43458.6712.0712.9073.6379.56
All ABCC13443.913.88420.44442.9211.7212.3776.1084.82
All ABCG28664.344.25396.37415.3211.1311.6474.7379.01
Selective ABCB1885.225.47452.11481.9413.0813.7061.4265.98
Selective ABCC1613.373.41374.49377.2211.0710.7069.7775.29
Selective ABCG24094.384.35372.38381.5810.3910.6773.8675.14
Dual ABCB1/ABCC1385.034.80475.56471.0313.1913.2070.0578.29
Dual ABCC1/ABCG22124.424.52420.23434.6211.8612.3275.6975.83
Dual ABCC1/ABCG2583.623.68376.91398.3310.5211.2376.2680.98
Triple ABCB1/ABCC1/ABCG21873.953.91432.44472.4711.9113.1079.4490.45
Distribution of the important physicochemical[115] properties CLogP (a), MW (b), MR (c), and TPSA (d) amongst the 1,167 compounds of the ABC_BPMDS as determined by MOE version 2019.01. Statistical survey of median and mean values of the important physicochemical properties CLogP, MW, MR, and TPSA amongst the entire ABC_BPMDS as well as important sub-classes as determined by MOE version 2019.01.

Molecular-Structure Space Validation

H-bonds and molecular flexibility are crucial aspects in terms of ligand-target interactions, especially for ABC transporters[71]. Hence, we analyzed the 1,167 compounds of the ABC_BPMDS for their number of H-bond donors, H-bond acceptors, and rotatable bonds. Figure 4 visualizes the found distributions amongst the entire ABC_BPMDS. Together with CLogP and MW, H-bond donors and acceptors play a major role in the drug-likeliness as defined by Lipinsky[72], particularly influencing drug absorption, distribution, and permeation. Considering the ‘Lipinski rule of five’ (CLogP ≤ 5; MW ≤ 500; H-bond donors ≤ 5; H-bond acceptors ≤10), a large majority of compounds of the ABC_BPMDS fulfils these requirements. In particular, (i) 73.8% of compounds have CLogP values of ≤5, (ii) 84.0% of compounds have a MW of ≤500, (iii) 99.7% of compounds have ≤5 H-bond donors, and (iv) 98.6% of compounds have ≤10 H-bond acceptors. Table 5 summarizes the median and mean values of H-bond donors, H-bond acceptors, and rotatable bonds of the entire ABC_BPMDS as well as important sub-classes. Hence, the ABC_BPMDS contains suitable templates for future drug design and therapeutic development purposes, however, leaves also enough molecular-structural and physicochemical space for explorational analyses beyond the ‘Lipinski rule of five’ for the creation of inhomogeneous high-quality compound collections and compound libraries.
Fig. 4

Distribution of H-bond donors (a), H-bond acceptors (b), and rotatable bonds (c) amongst the 1,167 compounds of the ABC_BPMDS as determined by MOE version 2019.01.

Table 5

Statistical survey of median and mean values of H-bond donors, H-bond acceptors, and rotatable bonds amongst the entire ABC_BPMDS as well as important sub-classes as determined by MOE version 2019.01.

inhibitor classcountH-bond donors medianH-bond donors meanH-bond acceptors medianH-bond acceptors meanrotatable bonds medianrotatable bonds mean
ABC_BPMDS1,16711.1244.4366.50
All ABCB152511.0744.9577.72
All ABCC134411.2844.8766.91
All ABCG286611.1544.4256.51
Selective ABCB18810.7544.7087.70
Selective ABCC16111.4644.0365.57
Selective ABCG240911.1443.7955.35
Dual ABCB1/ABCC13810.89544.3266.74
Dual ABCC1/ABCG221210.98644.7877.87
Dual ABCC1/ABCG25811.1444.404.55.66
Triple ABCB1/ABCC1/ABCG218711.3545.4067.76
Distribution of H-bond donors (a), H-bond acceptors (b), and rotatable bonds (c) amongst the 1,167 compounds of the ABC_BPMDS as determined by MOE version 2019.01. Statistical survey of median and mean values of H-bond donors, H-bond acceptors, and rotatable bonds amongst the entire ABC_BPMDS as well as important sub-classes as determined by MOE version 2019.01.

Usage Notes

Status Quo

Practical Use

An easy-to-use sort function allows the user to discriminate the compounds regarding their bioactivities toward the targets, physicochemical properties, or molecular-structural features, but also in terms of the 604 different substructures. Hence, the user can retrieve the necessary binary pattern information for subsequent virtual screening and rational drug design approaches.

Special Considerations

The majority of the compounds was evaluated in proper full-blown concentration effect curves within the original report, providing either only one single IC50 or two IC50 values from different assays for biological validation, resulting mostly in minor standard deviations or standard errors. However, considering established reference compounds, many IC50 values have been reported that are not fully covered by the deep literature search. Moreover, these drugs and drug-like compounds were tested in various assays, and thus, their IC50 values vary in a greater span than of other compounds. In addition, data processing prior to the original publication varied from laboratory to laboratory [e.g., number of concentrations tested, manner of assay performance (non-standardized procedures), manner of data analysis (e.g., three- vs four-parameter logistic equation, relative vs absolute inhibition), data presentation (single-point screening graphic vs full-blown concentration effect curve, number of significant digits, in- or exclusion of standard deviation and/or standard error)] – contributing to a greater uncertainty of these particular data. Furthermore, the assays themselves that were considered for the ABC_BPMDS were various [e.g., influx vs efflux assay, fluorescence labeling vs radionuclide detection, manner of substrate (e.g., calcein AM vs mitoxantrone), selected cells vs transfected cells vs membrane vesicles) – contributing to a general variation in data that is hidden due to the fact that most compounds were only evaluated in one particular assessment system. These aspects should be considered when using the ABC_BPMDS, however, at the same time, it should be taken note that our previous work demonstrated the strength of substructural patterns based on the previous version of the ABC_BPMDS[7,26,27]. A list of compounds affected by these variations in assessment systems can be found in the curation protocol under the https://www.zenodo.org[50] URL (10.5281/zenodo.6405752) or on the http://www.panabc.info web site.

Future Perspective

Extension – New Compounds

The ABC_BPMDS provides the core application for extension to other, less- and under-studied ABC transporters. Particularly, the addressing of under-studied ABC transporters by multitarget agents poses a promising prospect for future drug discovery and development. Several compounds of the ABC_BPMDS have been demonstrated to address other ABC transporters as well[5,25,26], such as benzbromarone[5,7,25-27], cyclosporine A[5,7,25-27], dipyridamole[7,27,73-77], erlotinib[7,27,78,79], imatinib[5,7,25-27], nilotinib[7,27,78,80,81], ritonavir[7,27,82], verapamil[5,7,25-27], and verlukast[5,7,25-27,33]. These ‘truly multitarget pan-ABC transporter inhibitors’[25] are the primary focus for extension of the ABC_BPMDS, particularly with respect to their substructural elements that promote multitargeting. On the other hand, the addition of multitarget agents that are not part of the ABC_BPMDS will contribute valuable input to the polypharmacological space as charted by the future ABC_BPMDS_1.2.

Extension – New Substructures (‘ABC_BPMDS_1.2’)

The substructural elements of the mentioned truly multitarget pan-ABC transporter inhibitors include 4-anilinopyrimidine[7,27], benzyl[7], cyano[7,27], 3,4-dimethoxyphenyl[7], fluorine[7,27], furan[7,26], ethylene diamine[7], ethylene hydroxy[7], hydroxy[7], isopropyl[7,27], methylene hydroxy[7], phenethyl[7], piperazine[7,27], pyrimidine[7,26], quinazoline[7,27], thiazlole[7,26], and thioether[7]. These and other substructures will be re-evaluated with respect to true multitargeting, and thus, receive a differential value dependent on the purpose of the subsequent studies. Furthermore, the addition of multitarget agents that are not part of the ABC_BPMDS will contribute valuable input to the substructure catalog, extending the substructural output of the future ABC_BPMDS_1.2. Specifically, this information beyond known multitarget fingerprints will enable the exploration and exploitation of under-studied ABC transporters as potential drug targets of the future.

Extension – New Modes and Targets

Particularly, the inclusion of, for example, different modes of modulation (e.g., activation), bioactivity measurements [e.g., in vitro (ATPase assays or MDR reversal assays), in silico binding mode analyses (e.g., molecular docking or molecular dynamics simulations), or structural information (e.g., x-ray, cryo-EM, homology-modelling, or AlphaFold[83])] will promote the discovery of drug candidates with distinctive mode of action. Furthermore, the logistics outlined in this work also provide a useful framework for similar data mining and descriptor approaches with respect to different pharmacological targets [e.g., under-studied human/bacterial ABC transporters, G-protein coupled receptors (GPCRs), ion channels (ICs), solute carriers (SLCs; PANSLC, http://www.panslc.info) or tyrosine kinases (TKs)].
Measurement(s)Influx • Efflux • Tracer • Transport velocity
Technology Type(s)Fluorometry • Radioactivity • Plate reader • Flow cytometer • Tracer distribution
Factor Type(s)half-maximal inhibition concentration
Sample Characteristic - OrganismHomo sapiens
Sample Characteristic - Environmentcell culture
Sample Characteristic - LocationKingdom of Norway • Germany • Australia • Latvia
  112 in total

1.  The National Center for Biotechnology Information.

Authors:  D Benson; M Boguski; D Lipman; J Ostell
Journal:  Genomics       Date:  1990-02       Impact factor: 5.736

2.  The novel BCR-ABL and FLT3 inhibitor ponatinib is a potent inhibitor of the MDR-associated ATP-binding cassette transporter ABCG2.

Authors:  Rupashree Sen; Karthika Natarajan; Jasjeet Bhullar; Suneet Shukla; Hong-Bin Fang; Ling Cai; Zhe-Sheng Chen; Suresh V Ambudkar; Maria R Baer
Journal:  Mol Cancer Ther       Date:  2012-07-09       Impact factor: 6.261

3.  Characterization of P-glycoprotein inhibition by major cannabinoids from marijuana.

Authors:  Hao-Jie Zhu; Jun-Sheng Wang; John S Markowitz; Jennifer L Donovan; Bryan B Gibson; Holly A Gefroh; C Lindsay Devane
Journal:  J Pharmacol Exp Ther       Date:  2006-01-26       Impact factor: 4.030

Review 4.  The ABC subfamily A transporters: Multifaceted players with incipient potentialities in cancer.

Authors:  Michela Pasello; Anna Maria Giudice; Katia Scotlandi
Journal:  Semin Cancer Biol       Date:  2019-10-09       Impact factor: 15.707

5.  The FLT3 and PDGFR inhibitor crenolanib is a substrate of the multidrug resistance protein ABCB1 but does not inhibit transport function at pharmacologically relevant concentrations.

Authors:  Trevor J Mathias; Karthika Natarajan; Suneet Shukla; Kshama A Doshi; Zeba N Singh; Suresh V Ambudkar; Maria R Baer
Journal:  Invest New Drugs       Date:  2015-01-20       Impact factor: 3.850

6.  PET imaging to assess the impact of P-glycoprotein on pulmonary drug delivery in rats.

Authors:  Irene Hernández-Lozano; Severin Mairinger; Thomas Filip; Michael Sauberer; Thomas Wanek; Johann Stanek; Johannes A Sake; Thomas Pekar; Carsten Ehrhardt; Oliver Langer
Journal:  J Control Release       Date:  2021-12-28       Impact factor: 9.776

Review 7.  Overcoming ABC transporter-mediated multidrug resistance: The dual role of tyrosine kinase inhibitors as multitargeting agents.

Authors:  Giovanni Luca Beretta; Giuliana Cassinelli; Marzia Pennati; Valentina Zuco; Laura Gatti
Journal:  Eur J Med Chem       Date:  2017-08-03       Impact factor: 6.514

8.  Hedgehog pathway inhibitor HhAntag691 is a potent inhibitor of ABCG2/BCRP and ABCB1/Pgp.

Authors:  Yimao Zhang; John Laterra; Martin G Pomper
Journal:  Neoplasia       Date:  2009-01       Impact factor: 5.715

9.  The multidrug transporter ABCG2 (BCRP) is inhibited by plant-derived cannabinoids.

Authors:  M L Holland; D T T Lau; J D Allen; J C Arnold
Journal:  Br J Pharmacol       Date:  2007-10-01       Impact factor: 8.739

10.  PubChem Substance and Compound databases.

Authors:  Sunghwan Kim; Paul A Thiessen; Evan E Bolton; Jie Chen; Gang Fu; Asta Gindulyte; Lianyi Han; Jane He; Siqian He; Benjamin A Shoemaker; Jiyao Wang; Bo Yu; Jian Zhang; Stephen H Bryant
Journal:  Nucleic Acids Res       Date:  2015-09-22       Impact factor: 16.971

View more
  1 in total

1.  A curated binary pattern multitarget dataset of focused ATP-binding cassette transporter inhibitors.

Authors:  Sven Marcel Stefan; Patric Jan Jansson; Jens Pahnke; Vigneshwaran Namasivayam
Journal:  Sci Data       Date:  2022-07-26       Impact factor: 8.501

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.