Literature DB >> 23990708

In silico identification of EGFR-T790M inhibitors with novel scaffolds: start with extraction of common features.

Mingli Xiang1, Kai Lei, Wenjie Fan, Yuchun Lin, Gu He, Mingli Yang, Lijuan Chen, Yirong Mo.   

Abstract

BACKGROUND: Epidermal growth factor receptor (EGFR) is an attractive therapeutic target for a number of human tumors including non-small cell lung cancer (NSCLC). Most patients with NSCLC and somatic mutations have shown a dramatic initial clinical response to reversible EGFR inhibitors. The clinical efficacy of reversible inhibitors is, however, ultimately limited due to the emergence of drug resistance, which is usually conferred by the EGFR T790M mutation. Importantly, irreversible, synthetic small molecule inhibitors are currently evaluated and some of them have been shown to overcome the acquired resistance that is oftentimes observed in these patients. Thus far, irreversible EGFR inhibitors as a drug class have not received regulatory approval due in part to their poor effectiveness at clinically achievable concentrations. Therefore, there is an urgent need to discover and develop novel, potent irreversible inhibitors against the EGFR T790M mutation.
MATERIAL AND METHODS: In the following study, we report a novel "hybrid strategy" to identify irreversible EGFR inhibitors with active scaffolds starting with the identification and extraction of a common chemical reactive feature and a pharmacophore feature. The chemical reactive feature was elucidated by investigating 138 currently known irreversible inhibitors at B3LYP/6-31G(d) level using the density function theory method. The pharmacophore feature was extracted from the same inhibitors using pharmacophore modeling. Based on these unique features, two constraints were set while calibrating the protocols of in silico screening. Compounds bearing these specific features were obtained from the National Cancer Institute diversity database to form our subsequent library. Finally, a structure based virtual screening against the library was conducted using standard protocols validated in our lab.
RESULTS: Twenty-eight candidate compounds that demonstrated antitumor activity and that had novel scaffolds different from commonly known quinazoline/quinoline analogs were obtained. The interaction modes between three representative candidates and our model system are similar to that between the model system and the reference compound T-001, which has previously been reported to be one of the most potent of the 138 irreversible inhibitors.
CONCLUSION: The hybrid strategy starting with the extraction of common features is an effective approach to design potential irreversible inhibitors with novel scaffolds and therefore to obtain lead molecules in the selection process. These candidates possessing unique scaffolds have a strong likelihood to act as further starting points in the preclinical development of potent irreversible T790M EGFR inhibitors.

Entities:  

Keywords:  NCI database; drug resistant; mutant EGFR; pharmacophore modeling; quantum chemical calculation; virtual screening

Mesh:

Substances:

Year:  2013        PMID: 23990708      PMCID: PMC3748928          DOI: 10.2147/DDDT.S41305

Source DB:  PubMed          Journal:  Drug Des Devel Ther        ISSN: 1177-8881            Impact factor:   4.162


Introduction

As key regulators of critical cellular processes, the ErbB protein family or epidermal growth factor receptor (EGFR) family has received much attention for several decades.1–8 The human EGFR family consists of four members: EGFR [(Human Epidermal Growth Factor Receptor) HER1/ErbB1], HER2 (ErbB2), HER3 (ErbB3), and HER4 (ErbB4).2,3,9–11 They are structurally related receptor tyrosine kinases (RTKs) sharing a similar molecular architecture.3,10,12–14 Each of them comprises identical extracellular ligand-binding regions, a single hydrophobic transmembrane segment, and a cytoplasmic region. The extracellular region contains four sub-domains (I–IV)12–14 and the cytoplasmic region comprises a conserved protein tyrosine kinase (TK) catalytic domain as well as a carboxy terminal tail with tyrosine autophosphorylation sites.2,3 It is well recognized that ErbB members share remarkable homology in their endocellular TK domains, but are distinct in their extracellular component and carboxy terminal tails.13 The ectodomain structure of ErbB2, for example, is radically different from the others.14 ErbB2 has a fixed conformation that resembles the ligand-activated state of EGFR and ErbB3. Within the extracellular region of ErbB2, a unique sub-domain I–III interaction buries the ligand binding site and makes the site not accessible for interaction.14 As such, ErbB2 lacks a ligand-binding domain to interact with a growth factor ligand. Although the intracellular TK domain of ErbB receptors is highly conserved, the kinase domain of ErbB3 has a substitution in critical amino acids, which results in no ErbB3 intrinsic kinase activity.3,13,15,16 ErbB2 and ErbB3 are non-autonomous TKs. They form heterodimeric complexes with other ErbBs that are capable of generating potent downstream signaling. In contrast, the other two members are autonomous. When bound to ligand growth factors, the receptor dimerization is induced and intracellular protein TK is activated with subsequent initiation of numerous downstream signaling events, which ultimately leads to cell proliferation, migration, and differentiation.3,13 Aberrant ErbB receptor activation and their intracellular signaling pathways and mutations in RTKs have been causally linked to cancers, diabetes, inflammation, severe bone disorders, arteriosclerosis, and angiogenesis.12,13,17 Among all of the four members, EGFR was the first receptor protein-TK to be sequenced10 and linked directly to human tumors.3,14 Not only is it involved in development of numerous types of human cancers3,14,18 but it is even the host cofactor for hepatitis C virus entry as well.19 It has been intensely pursued as a therapeutic target.4,10,14,17,20 One of the most clinically advanced strategies toward aberrant ErbB receptor activation is small-molecule inhibition of the protein TK catalytic domain13,14,21–24 in which small-molecule TK inhibitors (TKIs) compete with adenosine triphosphate (ATP) in the TK domain. The first generation TKIs (erlotinib, gefitinib,4 and lapatinib, for instance) are reversible small-molecule inhibitors. They prevent autophosphorylation of the EGFR TK by competing with ATP.20 But cellular oncogenic transformation caused by different EGFR mutations results in differential sensitivity to the first generation TKIs.24,25 Although most patients with NSCLC and somatic mutations showed a dramatic initially clinical response to EGFR reversible inhibitors, they gradually developed acquired resistance to these drugs26 due to the EGFR T790M mutation.4,6,7,16,21,24,27 This mutation leads to a change in the 3-dimensional structure of the TK domain and prevents reversible inhibitors from binding to EGFR.11,23,24 In this situation, various second generation EGFR TKIs are currently being evaluated. In contrast to the first generation TKIs that join to the catalytic site in the EGFR TK domain through classic competitive binding with ATP, second generation TKIs such as HKI-272 (neratinib),15 EKB-569 (pelitinib),28 BIBW2992 (tovok),15,20,29 and PF00299804 (dacomitinib),15,20,30 which are shown in Figure 1, form covalent bonds to a conserved cysteine residue5 within the active site. The obvious advantage of these compounds over the first generation entities is the ability to prevent the development of acquired resistance or overcome acquired resistance in patients previously treated with the first generation of drugs.4,5,11,17,24,26,28,31 Although clinical trials with a number of such compounds have shown promising results, irreversible EGFR inhibitors as a drug class have not received regulatory approval.17,28 In fact, some of them did not demonstrate effectiveness to overcome this mutant at clinically achievable concentrations.16,25,31,32 It is clear that much work remains to be done on the road to develop second generation EGFR TKIs.
Figure 1

Structures of irreversible TK inhibitor.15,20,28–30 The colored ellipsoid marks the common features.

Abbreviation: TK, tyrosine kinase.

It was shown by our detailed review that the main group of currently known EGFR TKIs are almost all based on a quinazoline/quinoline scaffold.28,31,33–37 It is believed that the quinazoline/quinoline scaffold may not be the most potent or specific for inhibiting T790M EGFR.4 Therefore, to discover and develop novel, potent irreversible inhibitors against the EGFR T790M is quite important. Compounds with cores different from the quinazoline/quinoline scaffold and possessing inhibitory activity against T790M EGFR could represent promising starting points as lead compounds. Undoubtedly, 2nd generation inhibitors against T790M EGFR must have some common features and it is therefore important to find those compounds by revealing and utilizing these common features. In this study, we report a novel hybrid strategy, which combines quantum chemical calculation, pharmacophore modeling, and structure-based drug design, to identify irreversible inhibitors with novel scaffolds. It started with the extraction of a common chemical reactive feature and a pharmacophore feature. Compounds bearing those specific common features were obtained from the National Cancer Institute (NCI) database (release 3 files) and were docked to the binding site of the model system with a calibrated protocol. Finally, 28 compounds that demonstrated antitumor activity and bear novel scaffolds different from commonly known quinazoline/quinoline analogs were identified. For three representative candidates, their binding modes to the model system were predicted.

Material and methods

As an attractive target, EGFR TK plays a crucial role in cancer therapy. Although several inhibitors of EGFR TK have been clinically validated for the treatment of patients with cancers during the past several years,26 the search for novel and more potent irreversible inhibitors against this target is still meaningful and considerably challenging. Our aim is to identify irreversible inhibitors with novel scaffolds against T790M EGFR starting with the identification and extraction of the common features from currently known irreversible inhibitors. The schematic representation chart of our workflow is shown in Figure 2.
Figure 2

Schematic diagram of workflow for identifying potential EGFR-T790M inhibitors with novel scaffolds.

Abbreviations: EGFR, epidermal growth factor receptor; TKI, tyrosine kinase inhibitor; NCI, National Cancer Institute.

Extraction of common features

In view of the properties of currently known irreversible inhibitors that can form covalent bonds with a conserved cysteine residue within the active site of EGFR, we presumed that there must exist some common features among them. Obviously, the features must at least include a chemical reactive feature and a pharmacophore feature. The former is related to the forming of covalent bonds and the latter is essential for a drug’s biological activity.

Reactive site of irreversible inhibitors

At first, 138 TKIs (Table S1 in supplementary material) with biological activity values (IC50) ranging from 0.002 to 13.570 μM were selected from published documents.34–37 It was noted that almost all of them contain either an acrylamide group or a propargyl amide group. The exact location of Michael receptor in irreversible TKIs was determined by optimizing structures of 138 TKIs and calculating their reactive sites. The structures were drawn using MarvinSketch (ChemAxon Ltd., Budapest, Hungary).38 Next, their geometries were obtained at the same operating environment by calculating the lowest energy conformer using a MMFF94 force field.39 Later, these geometries were further optimized with density functional theory (DFT) method at B3LYP/6-31G(d) level by employing Gaussian 03.40 Finally, the Natural Bond Orbital (NBO) charge distribution of molecules was calculated using the B3LYP/6-31G(d)//B3LYP/6-31G(d) method. For a molecule containing N electrons, independent calculations were made for the corresponding N-electron molecule and (N + 1)-electron molecule with the same molecular geometry.41 The variation of NBO charges on an atom of an (N + 1)-electron molecule relative to the N-electron molecule reflects the site reactivity of nucleophilic attack.41 It is obvious that the atom with the largest variation of NBO charge is most probably the reactive site of a molecule that involves a Michael addition reaction with the sulfur atom of Cysteine (Cys)797 of EGFR.

Pharmacophore feature

For all of the 138 currently known 2nd generation TKIs, their common pharmacophore feature was extracted as follows. At first, conformations of these 138 compounds were generated with Omega 2.2.5 (OpenEye Scientific Software Inc., Santa Fe, NM, USA)42 using the geometry optimized at B3LYP/6-31G(d) level with DFT method. Then, pharmacophore models were built by employing Discovery Studio (Accelrys Inc., San Diego, CA, USA)43 with compound T-001 (Figure 3A),37 a reference compound that is one of the most potent irreversible TKIs among these 138 compounds. By manually adjusting the number and types of the function groups and through mapping all conformations of these 138 TKIs onto the pharmacophore models, the common pharmacophore feature of these inhibitors was finally identified and extracted. As shown in Figure 3, the common pharmacophore feature of these 2nd generation inhibitors consists of one hydrogen bond donor, one aromatic hydrophobic center, and two hydrogen bond acceptors.
Figure 3

The common pharmacophore feature generated from compound T-00137 (A) and 138 tyrosine kinase inhibitors (TKIs)34–37 superimposed on the feature (B). Aromatic hydrophobic center is represented by the orange wire mesh ball. Hydrogen bond accepter is depicted with the green arrow and wire mesh balls while the hydrogen bond donor is described by the pink arrow and wire mesh balls.

After that, another 21 irreversible inhibitors4,34–37 coupled with two reversible inhibitors4 (Table S2 in supplementary materials) were used to test whether these extracted features are effective enough to identify potential irreversible inhibitors.

Docking with constraints

As a reliable, cost-effective, and time-saving technique, structure based virtual screening (SBVS) has shown a powerful ability to discover lead compounds.32 In the SBVS pipeline, preparation of receptor, determination of docking protocols and preparation of a compound library are the committed steps.

Preparation of receptor

TK cSrc was chosen as the model system of the mutant EGFR and its 3-dimensional structure was obtained from the Protein Data Bank (PDB ID: 2QQ7).44 Water molecules in the crystallographic complex were stripped. Preparation of receptor for docking was done using FRED RECEPTOR 2.2.5 (OpenEye Scientific Software Inc., Santa Fe, NM, USA).45 Four hundred different molecule sensors which were stored internally by the program were used to describe the binding site for docking. The size of the box is 6,140 cubic Angstroms. The inner contour and outer contour filters, which were detected with 400 sensor molecules, are 92 cubic and 1,442 cubic angstroms, respectively.

Identification of docking protocols

As many as 252 docking strategies were tested to find a satisfactory way upon which TKIs with potent activities could be enriched at the top of high-score ranking. At first, conformers of the 138 TKIs were created via Omega 2.2.542 and the number of conformers for each TKI was limited up to 2,000. Then, in total, 252 schedules were tested for obtaining optimal docking strategy. At the conformer searching stage, four scoring functions including Shapegauss,46 Plp,47 Chemgauss2,48 Chemgauss348 were chosen for exhaustive searching of conformers. At the optimization stage, seven scoring functions (Chemgauss2,48 Chemgauss3,48 Chemscore,48 Shapegauss,46 Plp,47 OEChemscore, and Screenscore49) were hired for rigid optimization of poses. For each one of the above 28 combinations, nine scoring functions (those seven used in rigid optimization plus Zapbind50 and consensus score) were used for evaluating the interaction between the TKIs and the active site of EGFR. Scoring functions employed for identifying optimal docking protocols are listed in Table 1.
Table 1

Scoring functions employed for identifying optimal docking protocols at different stages*

Exhaustive searchRigid optimizationInteraction evaluation
Shapegauss46Shapegauss46Shapegauss46
Plp47Plp47Plp47
Chemgauss248Chemgauss248Chemgauss248
Chemgauss348Chemgauss348Chemgauss348
Chemscore48Chemscore48
OEChemscoreOEChemscore
Screenscore49Screenscore49
Zapbind50
Consensus score

Notes:

In total, 252 docking strategies were gained using the combinations of scoring functions at different stages. Scoring functions employed in the optimal docking protocols are marked with bold in the table.

Abbreviation: Plp, linear pairwise potential.

In the process of docking currently known irreversible TKIs, two constraints were added. The first one was that each potential irreversible TKI should provide a hydrogen acceptor to form a hydrogen bond with the hinge region M793. There is strong evidence that inhibitors make a hydrogen bond to the hinge region atom N-H in M793 in the protein kinase family.4,5,8,17,28,51,52 The other constraint was to require an alkyne carbon or an alkene carbon being within 4 ± 1 angstrom range of the sulfur atom of the conserved Cys797 residue, which would allow for covalent bond formation.53 This is supported by our quantum chemical calculations that demonstrated that the reactive site of the known irreversible TKIs locates at the β-carbon of the acrylamide or propargyl amide group.

Preparation of the molecule library

As described above, common features of currently known irreversible TKIs were extracted (Figures 1–3). Commonly, each of those TKIs has an acrylamide group or a propargyl amide group. At the same time, they all bear one hydrogen bond donor, one aromatic hydrophobic center, and two hydrogen bond acceptors. We expected that potential irreversible TKIs with novel scaffolds would have such features. These features were used as a filter (Figure 2). Compounds in the NCI database (260,071) were filtered using these features. Those that do not meet these features were eliminated. After filtering, only 868 compounds in the NCI database were remained. Then, reference compound T-001 and another three irreversible inhibitors4 with novel scaffolds mixed into these molecules. Finally, conformations of these compounds were generated using Omega 2.2.542 and the maximum number of conformations was set to 2,000.

Constraint virtual screening and interaction mode analysis

With the optimal docking protocols mentioned above and two added constraints, 868 compounds, from the NCI database and bearing the common features, were docked to the binding site of a model system using FRED program version 2.2.5.45 Potential irreversible TKIs were obtained from the top 200 of the SBVS result. For the reference compound T-00137 and the screened potential irreversible TKIs with representative scaffolds, the interaction between them and the model system was analyzed and visualized using DS visualizer 3.1 (Accelrys Inc., San Diego, CA,USA).43

Results and discussion

Common features

The irreversible TKIs can form covalent bonds with a conserved cysteine residue (Cys797) within the active site of T790M EGFR. Apparently, as shown in Figure 1 and Table S1 in supplementary material, currently known 2nd generation TKIs34–37 contain either an acrylamide group or a propargyl amide group. Quantum chemical calculations revealed that the reactive site of the known irreversible TKIs, which form a covalent bond to the conserved Cys797 within T790M EGFR, locates at the β-carbon of the acrylamide or propargyl amide group (Figure 4 and Table 2). This is the theoretical basis for us to set one of the constraints while conducting SBVS.
Figure 4

The electrophilic reactive site of some known irreversible TKIs34–37 calculated at B3LYP/6-31G(d) level with DFT method by employing Gaussian 0340 and the effect of replacement on the bioactivity of irreversible TKI. Electrophilic center was highlighted by the red color and the nucleophilic center was painted with a blue color. Red arrow points to the β-carbon of the acrylamide group or propargyl amide group.

Notes: T-005, 0.008 μM (A); T-084, 0.250 μM (B); T-092, 0.310 μM (C); T-125, 1.124 μM (D); T-084, 0.250 μM (E); T-008, 0.011 μM (F); T-092, 0.310 μM (G).

Abbreviations: DFT, density functional theory; TKI, tyrosine kinase inhibitor.

Table 2

NBO charge distribution and the relevant change of charge distribution in the N-electron and (N + 1)-electron molecule for N electrons systems T-005 and T-008

NoT-005
T-008
Elementq(N)q(N + 1)ΔqElementq(N)q(N + 1)Δq
1C0.1530.133−0.021C0.1660.141−0.025
2C−0.239−0.257−0.017C−0.232−0.246−0.013
3C−0.203−0.278−0.074C−0.200−0.268−0.068
4C0.1730.1790.006C0.1720.1760.004
5C−0.130−0.1150.015C−0.133−0.1190.014
6C−0.240−0.333−0.093C−0.248−0.329−0.081
7N−0.497−0.554−0.057N−0.491−0.547−0.056
8C0.2620.218−0.044C0.2600.215−0.044
9N−0.560−0.591−0.030N−0.559−0.587−0.027
10C0.4660.376−0.090C0.4620.373−0.089
11N−0.568−0.577−0.009N−0.571−0.578−0.007
12C0.1710.1740.003C0.1730.170−0.003
13C−0.274−0.294−0.020C−0.273−0.290−0.016
14C−0.090−0.0880.002C−0.091−0.0900.001
15C−0.267−0.315−0.049C−0.267−0.313−0.046
16C−0.212−0.226−0.014C−0.212−0.224−0.012
17C−0.265−0.279−0.014C−0.268−0.285−0.017
18Br0.0510.008−0.043Br0.0550.016−0.039
19N−0.605−0.5970.008N−0.614−0.6060.008
20C0.6350.552−0.083C0.6670.617−0.050
21C−0.139−0.1170.022C−0.340−0.353−0.013
22O−0.586−0.649−0.063O−0.615−0.675−0.060
23C0.073−0.045−0.118C−0.119−0.234−0.116
24C−0.750−0.7300.020C−0.290−0.2790.011
25H0.2350.208−0.028N−0.490−0.4880.002
26H0.2560.227−0.029C−0.471−0.4670.004
27H0.2560.241−0.015C−0.477−0.4720.004
28H0.2170.186−0.031H0.2800.265−0.016
29H0.4230.419−0.004H0.2570.231−0.026
30H0.2900.2890.000H0.2080.186−0.022
31H0.2530.235−0.019H0.2170.189−0.028
32H0.2440.227−0.017H0.4130.408−0.004
33H0.2350.229−0.006H0.2910.2910.000
34H0.4290.410−0.018H0.2540.237−0.017
35H0.2700.249−0.021H0.2440.228−0.016
36H0.2660.243−0.023H0.2270.220−0.007
37H0.2690.242−0.027H0.4140.400−0.014
38H0.2180.198−0.020
39H0.2460.230−0.016
40H0.2120.190−0.023
41H0.2330.222−0.011
42H0.2300.218−0.012
43H0.1900.179−0.011
44H0.2250.221−0.004
45H0.2310.217−0.014
46H0.2280.2340.006
47H0.1900.179−0.011

Note: β-carbon atoms are in bold. Data obtained from Tsou HR, Mamuya N, Johnson BD, et al. 6-Substituted-4-(3-bromophenylamino)quinazolines as putative irreversible inhibitors of the epidermal growth factor receptor (EGFR) and human epidermal growth factor receptor (HER-2) tyrosine kinases with enhanced antitumor activity. J Med Chem. 2001;44(17):2719–2734.37

Abbreviations: NBO, natural bond orbital; q, electric charge.

It is interesting to note that often the β-carbon of acrylamide group or propargyl amide group in the currently known irreversible TKIs is the most electrophilic, as shown in Figure 4A and D. While results of NBO analysis show that the β-carbon is no longer the most electrophilic atom when an electron-withdrawing atom directly links to the β-carbon (Figure 4B) or the nitrogen atom at position three of the quinazoline group is replaced by a carbon linked with a cyano group (Figure 4C). The β-carbon becomes one of the most electrophilic atoms in the molecule. Even though it still keeps the tendency to attack the nucleophilic thiol group of the conserved Cys797 in T790M EGFR, the bioactivity of the molecule is much less than its analog (Figure 4E, 4F and 4G). Except for the chemical reactive feature mentioned above, the currently known irreversible TKIs have a common pharmacophore feature that was extracted previously. As shown in Figure 3, the common pharmacophore feature can be described to include one hydrogen bond donor, one aromatic hydrophobic center, and two hydrogen bond acceptors. It is known that pharmacophore refers to the 3-dimensional arrangement of functional groups essential to a drug’s biological activity.54,55 A diverse set of lead compounds, which potentially possess desired biological activity but have totally different chemical scaffolds, often share a common pharmacophore.56 In a sense, compounds sharing these common chemical reactive and pharmacophore features could be guaranteed to form covalent bonds to the conserved Cys797. At the same time, they possess the desired biological activity and have different chemical scaffolds as well.

Selection of model system

There are many 3-dimensional structures of T790M EGFR determined using X-ray crystallography. We choose TK cSrc (PDB ID: 2QQ7)44 as the model system of the mutant EGFR based on the following considerations. At first, the resolution of 2QQ7 is 2.38 angstrom and the crystal structure is complexed with an irreversible inhibitor. It was believed that when the resolution value of crystallographic data is lower than 2.5 angstrom the modeling of details in the protein structures is much more subjective than desired.57,58 It was also confirmed that selecting structures with poor resolution may produce false predicted conformations.59 Thus, as a rule of thumb, data worse than 2.5 angstrom resolution should not be considered reliable for virtual library screening applications.60 In addition, cSrc is a tractable target for the structural investigation of irreversible EGFR. It has many advantages44 for the study of homologous protein kinases over EGFR. The kinase domains between EGFR and cSrc share high identity44 in structure and sequence. The sequence identity between their ATP binding sites is as high as 70%. Most importantly, cSrc has been validated as a model system for the structural investigation of irreversible EGFR inhibitors.33

Constrained docking against the NCI database

Having a proper pose within the binding site is indispensable for ligand being effective. In addition, adding constraints is an effective way to gain proper poses of ligands. Therefore, two constraints, which are discussed above, were added while docking molecules to the binding site of the model system. With those two constraints, 138 known irreversible TKIs were docked to the binding site of the model system. Finally, an optimal strategy that provides the highest enrichment rate was identified from among 252 combinations. Scoring functions Chemgauss2,48 Screenscore,49 and Shapegauss46 were used for exhaustive searching of conformers, rigid optimization of poses, and interaction evaluation, respectively. According to this strategy, five potent TKIs were enriched within the top six of the 138 compounds. These results are presented in Table 3.
Table 3

The IC50 value and Shapegauss46 score of five potent TKIs34–37 enriched within the top six of 138 second-generation TKIs

TKIsIC50 (μm·L−1)Shapegauss score
T-0130.016−629.309
T-0160.020−606.586
T-0190.024−602.346
T-1060.360−601.050
T-0180.024−598.813
T-0090.011−596.112

Abbreviations: IC50, half maximal inhibitory concentration; TKI, tyrosine kinase inhibitor.

Together with reference compound T-001 and another three irreversible inhibitors,4 868 molecules retrieved from the NCI database containing 260,071 compounds were docked to the binding site of the model system according to the optimal docking strategy. Of these, 419 compounds successfully docked to the site. By manually analyzing compounds among the top 200 compounds, 28 of them are found to have novel scaffolds different from the currently known quinoline/quinazoline core and demonstrated bioactivity against various types of tumors. The average antitumor activities (GI50) of these 28 compounds are in the order of magnitude of 10−5 μg/mL (Table 4).
Table 4

The Shapegauss46 score and antitumor activities of the 28 potential irreversible TKIs plus four reference irreversible TKIs4,37

Serial numberNSC numberShapegauss scorePosition among the top 200Average GI50* (μg·mL−1)Classification
1621158−500.858429.85 × 10−5Class A
2623897−494.190469.80 × 10−5Class A
3630912−472.511652.14 × 10−5Class A
4637681−469.007729.43 × 10−5Class A
5633920−462.811817.63 × 10−5Class A
6637635−462.406848.46 × 10−5Class A
7637606−459.648897.61 × 10−5Class A
8637638−458.330915.54 × 10−6Class A
9630913−456.322936.49 × 10−5Class A
10623881−452.136987.35 × 10−5Class A
11630908−452.072996.60 × 10−5Class A
12637630−451.9361005.67 × 10−5Class A
13637698−450.3571044.25 × 10−5Class A
14623867−450.0461068.94 × 10−5Class A
15642576−443.2511112.17 × 10−5Class B
1625603−440.2101141.13 × 10−5Class B
17637614−436.2581181.13 × 10−5Class A
18623882−430.1121222.55 × 10−5Class A
19637605−429.1361241.18 × 10−5Class A
20637626−428.0691258.06 × 10−5Class A
21111135−426.8221262.01 × 10−5Class B
22623877−421.3011341.97 × 10−5Class A
23623879−415.8051421.09 × 10−5Class A
24637706−409.3421525.50 × 10−5Class A
25637700−408.8191549.56 × 10−5Class A
26636097−406.9731607.16 × 10−5Class A
27623875−400.4591701.54 × 10−5Class A
2857443−381.5321985.46 × 10−5Class C
29T-00137−419.7841370.002**
30WZ31464−471.704660.029**
31WZ40024−526.444210.047**
32WZ80404−469.951690.009**

Notes:

The average GI50 overall cell lines from national Cancer Institute (NCI) Cancer Screening Results, December 2010;

in μM/L.

Abbreviations: TKI, tyrosine kinase inhibitor; GI50, the concentration required to achieve 50% growth inhibition; NSC, Nomenclature Standards Committee.

Validation of the hybrid strategy

The committed step in the pipeline of SBVS is to validate docking strategy.61 In order to test whether the extracted common features could be used to identify potential irreversible inhibitors, 23 molecules were collected to work as a testing set. This group compromised compound T-001, another 17 irreversible inhibitors34–37 with the quinoline scaffold, three irreversible inhibitors4 with a core different from quinoline, and two reversible inhibitors (gefitinib and erlotinib).4 As expected, the common features worked as a filter: reversible inhibitors were filtered out due to the fact that they lack the chemical reactive feature and only irreversible inhibitors with the quinoline scaffold or other scaffolds remained. As shown in Figure 5, they could be mapped onto the pharmacophore feature. It was demonstrated that the extracted common features could work as an effective filter to identify irreversible inhibitors whether they contain the quinoline scaffold or not.
Figure 5

In total, 21 irreversible TKIs4,34–37 in the testing set (refer to Table S2 in supplementary materials) were mapped on the common pharmacophore feature generated from compound T-001.37 T-001 is depicted by the stick and its carbon atoms are colored in green. WZ-31464 is depicted by the stick and its carbon atoms are colored pink; WZ-40024 is depicted by the stick and its carbon atoms are colored sky blue; WZ-80404 is depicted by the stick and its carbon atoms are colored blue.

Abbreviation: TKI, tyrosine kinase inhibitor.

Before conducting a SBVS, four reference compounds (compound T-001,37 WZ-3146,4 WZ-4002,4 and WZ-80404) with a scaffold different from quinoline/quinazoline were mixed into a molecule library that includes 868 compounds retrieved from the NCI database with the aid of a common feature filter. According to the optimal docking protocols, these molecules were docked to the binding site of the model system. As demonstrated in Table 4, the scorings of reference compound T-001 and the other three irreversible inhibitors with a scaffold different from quinoline/quinazoline are among the top 150 high score ranking results. These four irreversible inhibitors and the 28 potential inhibitors could be mapped onto the extracted pharmacophore feature (Figures 5 and 6). The results show that the hybrid strategy is an effective way to identify potent irreversible inhibitors bearing novel scaffolds different from quinoline/quinazoline analogs.
Figure 6

Potential irreversible TKIs that had been identified from the NCI database (refer to Table 4) were mapped onto the common pharmacophore feature generated from reference compound T-001.37 T-001 is depicted by the stick and its carbon atoms are colored green. Representative compounds NSC-621158, NSC-642576, and NSC-57443 are depicted by the stick and their carbon atoms are colored pink, sky blue, and blue, respectively.

Abbreviations: NCI, national Cancer Institute; NSC, Nomenclature Standards Committee; TKI, tyrosine kinase inhibitor.

Interaction mode analysis

It is easy to identify that these 28 compounds belong to three types of scaffolds (Figure 7 and Table S3 in supplementary materials). Class A is a group of symmetric compounds. There are two symmetric acrylamide functional groups in this class of compound and the alkene moieties are parts of the dimethyl-1,4-dihydropyridine group. Class B is asymmetric. An aromatic ring is directly linked to the N atom of the acrylamide functional group in both class A and class B, while in class C, it is a fluorenone group that directly binds to the N atom of the acrylamide.
Figure 7

Three classes of potential irreversible TKIs with novel scaffolds and their corresponding representative molecules.

Notes: Class A: NSC number 621158 (A); Class B: NSC number 642576 (B); Class C: NSC number 57443 (C).

Abbreviations: NSC, Nomenclature Standards Committee; TKI, tyrosine kinase inhibitor.

From each class, we selected one representative molecule that had the highest score among its own class. Together with reference compound T-001,37 the interaction modes between them and the mutant EGFR were analyzed with the aid of Discovery Studio.43 According to a pharmacophore model27,51 of the ATP binding site in protein kinases, the conserved site is divided into five regions: (1) adenine region, which contains two amino acid residues, Glutamine (Gln)791 and Methionine (Met)793 on the hinge region corresponding to Glutamic acid (Glu)339 and Met341 in TK cSrc; (2) sugar pocket, which is the region extending from Cys797 (corresponding to Cys345 in TK cSrc) toward the solvent; (3) hydrophobic region I, which is composed of two residues Threonine (Thr)790 and Thr854, corresponding to Met338 and Alanine (Ala)403 in TK cSrc; (4) hydrophobic region II, which is formed by residues Leucine (Leu)718 and Glycine (Gly)796, corresponding to Leu273 and Gly344 in tyrosine kinase cSrc; and (5) phosphate binding region, which is largely solvent exposed. The ATP binding site in TK cSrc is shown in Figure 8.
Figure 8

Pharmacophore model27,51 of the ATP binding site in tyrosine kinase cSrc (PDB ID: 2QQ7).44 This site is represented by the solid surface. Adenine region, sugar region, hydrophobic region I, and hydrophobic region II are colored pink, yellow, blue, and green, respectively.

Abbreviations: ATP, adenosine triphosphate; PDB, protein data bank; Ala, alanine; Gln, glutamine; Glu, glutamic acid; Met, methionine; Thr, threonine; Leu, leucine; Gly, glycine; Cys, cysteine

As shown in Figure 9, the four selected molecules similarly form hydrogen bonds with the N-H of residue Met341 and the β-carbon of acrylamide is near the S atom of Cys345. Surely, all of them have an interaction with the sugar region of the binding site. In hydrophobic region II, they interact with Leu273 and Gly344, while in the adenine region they only interact with Met341; Glu339 is far from them. Conversely, reference compound T-001,37 compound A (NSC-621158), and compound B (NSC-642576) do not interact with residues in the hydrophobic region I. In contrast, compound C (NSC-57443) interacts with Met338 in this region (Figure 9D).
Figure 9

The illustrations showing the interaction between the model system and three representative compounds as well as between the reference compound T-00137 and the model system. Residues are displayed in thin stick style. Carbon atoms of residues corresponding to adenine region, sugar region, hydrophobic region I, and hydrophobic region II are colored pink, yellow, blue, and light green, respectively. Compounds are represented by a thick stick and their carbon atoms are colored green. Interactions such as hydrogen bond, cation-π interaction, and π-σ interaction are highlighted.

Notes: Compounds in (A–D) correspond to T-001,37 NSC-621158, NSC-642576, and NSC-57443, respectively.

Abbreviations: Ala, alanine; Gln, glutamine; Glu, glutamic acid; Met, methionine; Thr, threonine; Leu, leucine; Gly, glycine; Cys, cysteine.

Specifically, it was noted that there is a π-σ interaction between T-00137 and the side chain of Leu273 (Figure 9A). For compound A, a hydrogen bond interaction with the side chain of Asp348 is observed. Meanwhile, a cation-π interaction is noted between this compound and the side chain of Lys295 (Figure 9B). The hydrogen bond formed between compound B and N-H of Met341 is quite strong. The distance between the heavy atoms with high electro-negativity values is only 2.254 angstrom. The 3,3′-(phenylazanediyl) dipropanenitrile moiety of compound B extends to solvent (Figure 9C). Additional interaction between compound C and the hydrophobic region II was also observed. That is, a hydrogen bond forms between compound C and the carbonyl oxygen of Leu273 (Figure 9D).

Conclusion

In this work, we were trying to find irreversible TKIs with novel scaffolds. At first, quantum chemical calculations and pharmacophore modeling were employed to extract common features of 138 currently known irreversible TKIs. The β-carbon of acrylamide group or propargyl amide group was identified by quantum chemical calculations as the electrophilic reactive site or one of the sites in irreversible TKIs. Commonly, each such TKI bears one aromatic hydrophobic center, one hydrogen donor, and two hydrogen acceptors. Next, optimal docking protocols were identified and validated. By adding two constraints and docking 138 currently known irreversible TKIs to the binding site of a model system, optimal docking protocols were identified from selecting the best one among 252 types of docking strategies. According to this optimal strategy, five potent irreversible TKIs were enriched within the top six of the 138 TKIs. From the NCI diversity database (260,071 compounds), 868 molecules were retrieved with the aforementioned common features. These molecules plus another four irreversible inhibitors were then docked to the binding site of a model system by employing optimal docking protocols. Finally, 28 potential irreversible TKIs, which belong to three classes of novel scaffolds and exhibited antitumor activity, were gained from the top 200 with high score rankings. The interaction modes between three representative candidates and the model system are similar to that between the model system and the reference compound T-001, one of the most potent inhibitors among the identified 138 irreversible inhibitors. Still, there are subtle differences in their interactions with the model system, which contribute to their different inhibition activities. The interaction scores of the four irreversible inhibitors are among the top 150 with high score rankings. Average antitumor activities (GI50) of the 28 compounds are in the order of magnitude of 10−5 μg·mL−1. The hybrid strategy starting with the extraction of common features is shown to be an effective approach to identify irreversible inhibitors with novel scaffolds. Candidates possessing unique scaffolds have a strong likelihood to act as further starting points in the preclinical development of potent irreversible inhibitors against T790M EGFR. 138 currently known irreversible tyrosine kinase inhibitors1–4 used for extracting common features and calibrating docking protocols Abbreviations: EGFR, epidermal growth factor receptor; IC50, half maximal inhibitory concentration; (S),sinister; (R), rectus; (Z), zusammen; (E), entgegen. Structures of 21 irreversible1–5 and two reversible5 TKIs used for testing the effect of the hybrid strategy to identify irreversible inhibitors Abbreviations: TKI, tyrosine kinase inhibitor; EGFR, epidermal growth factor receptor; IC50, half maximal inhibitory concentration; (R), rectus; (E), entgegen. The structures of 28 potential irreversible TKIs with three classes of novel scaffolds identified by docking molecules from NCI database (release 3 files) to the binding site of T790M EGFR using FRED 2.2.51 (OpenEye Scientific Software Inc., Santa Fe, NM, USA). Abbreviations: TKIs, tyrosine kinase inhibitors; EGFR, epidermal growth factor receptor; NCI, National Cancer Institute; NSC, Nomenclature Standards Committee; IC50, half maximal inhibitory concentration; (S),sinister; (R), rectus; (Z), zusammen; (E), entgegen.
Serial numberStructureEGFR IC50 (μM)
T-001 0.002
T-002 0.002
T-003 0.006
T-004 0.007
T-005 0.008
T-006 0.009
T-007 0.010
T-008 0.011
T-009 0.011
T-010 0.012
T-011 0.014
T-012 0.015
T-013 0.016
T-014 0.020
T-015 0.020
T-016 0.020
T-017 0.023
T-018 0.024
T-019 0.024
T-020 0.026
T-021 0.030
T-022 0.031
T-023 0.031
T-024 0.032
T-025 0.034
T-026 0.034
T-027 0.041
T-028 0.042
T-029 0.045
T-030 0.045
T-031 0.051
T-032 0.053
T-033 0.057
T-034 0.059
T-035 0.062
T-036 0.062
T-037 0.064
T-038 0.065
T-039 0.065
T-040 0.065
T-041 0.068
T-042 0.068
T-043 0.071
T-044 0.072
T-045 0.073
T-046 0.074
T-047 0.074
T-048 0.078
T-049 0.080
T-050 0.085
T-051 0.085
T-052 0.090
T-053 0.090
T-054 0.092
T-055 0.095
T-056 0.097
T-057 0.098
T-058 0.100
T-059 0.106
T-060 0.107
T-061 0.110
T-062 0.111
T-063 0.115
T-064 0.125
T-065 0.130
T-066 0.130
T-067 0.131
T-068 0.140
T-069 0.150
T-070 0.151
T-071 0.165
T-072 0.185
T-073 0.188
T-074 0.194
T-075 0.195
T-076 0.203
T-077 0.210
T-078 0.214
T-079 0.215
T-080 0.221
T-081 0.236
T-082 0.238
T-083 0.247
T-084 0.250
T-085 0.250
T-086 0.250
T-087 0.255
T-088 0.269
T-089 0.277
T-090 0.282
T-091 0.285
T-092 0.310
T-093 0.321
T-094 0.330
T-095 0.339
T-096 0.341
T-097 0.343
T-098 0.345
T-099 0.360
T-100 0.380
T-101 0.400
T-102 0.400
T-103 0.401
T-104 0.403
T-105 0.445
T-106 0.500
T-107 0.519
T-108 0.550
T-109 0.574
T-110 0.590
T-111 0.650
T-112 0.650
T-113 0.719
T-114 0.768
T-115 0.770
T-116 0.770
T-117 0.790
T-118 0.810
T-119 0.894
T-120 0.910
T-121 0.930
T-122 0.940
T-123 0.960
T-124 0.990
T-125 1.124
T-126 1.132
T-127 1.569
T-128 1.620
T-129 1.790
T-130 1.910
T-131 2.290
T-132 2.320
T-133 2.880
T-134 3.830
T-135 5.390
T-136 7.500
T-137 12.700
T-138 13.570

Abbreviations: EGFR, epidermal growth factor receptor; IC50, half maximal inhibitory concentration; (S),sinister; (R), rectus; (Z), zusammen; (E), entgegen.

Serial numberStructureT-790M EGFR IC50 (μM)
V-01 2.700
V-02 0.223
V-03 0.075
V-04 5.220
V-05 4.840
V-06 10.590
V-07 0.080
V-08 1.191
V-09 1.600
V-10 10.320
V-11 13.000
V-12 0.163
V-13 0.362
V-14 0.340
V-15 0.100
V-16 2.640
V-17 0.560
T-001 0.002
wz-3146 0.0295
wz-4002 0.0475
wz-8040 0.0095
Gefitinib >3.35
Erlotinib

Abbreviations: TKI, tyrosine kinase inhibitor; EGFR, epidermal growth factor receptor; IC50, half maximal inhibitory concentration; (R), rectus; (E), entgegen.

Serial noNSC noShapegauss score2StructurePosition among the top 200Classification of scaffold
1621158−500.858 42Class A
2623897−494.190 46Class A
3630912−472.511 65Class A
4637681−469.007 72Class A
5633920−462.811 81Class A
6637635−462.406 84Class A
7637606−459.648 89Class A
8637638−458.330 91Class A
9630913−456.322 93Class A
10623881−452.136 98Class A
11630908−452.072 99Class A
12637630−451.936 100Class A
13637698−450.357 104Class A
14623867−450.046 106Class A
15642576−443.251 111Class B
1625603−440.210 114Class B
17637614−436.258 118Class A
18623882−430.112 122Class A
19637605−429.136 124Class A
20637626−428.069 125Class A
21111135−426.822 126Class B
22623877−421.301 134Class A
23623879−415.805 142Class A
24637706−409.342 152Class A
25637700−408.819 154Class A
26636097−406.973 160Class A
27623875−400.459 170Class A
2857443−381.532 198Class C

Abbreviations: TKIs, tyrosine kinase inhibitors; EGFR, epidermal growth factor receptor; NCI, National Cancer Institute; NSC, Nomenclature Standards Committee; IC50, half maximal inhibitory concentration; (S),sinister; (R), rectus; (Z), zusammen; (E), entgegen.

  55 in total

Review 1.  Cell signaling by receptor tyrosine kinases.

Authors:  J Schlessinger
Journal:  Cell       Date:  2000-10-13       Impact factor: 41.582

Review 2.  Predicting molecular interactions in silico: I. A guide to pharmacophore identification and its applications to drug design.

Authors:  Oranit Dror; Alexandra Shulman-Peleg; Ruth Nussinov; Haim J Wolfson
Journal:  Curr Med Chem       Date:  2004-01       Impact factor: 4.530

3.  Empirical scoring functions: I. The development of a fast empirical scoring function to estimate the binding affinity of ligands in receptor complexes.

Authors:  M D Eldridge; C W Murray; T R Auton; G V Paolini; R P Mee
Journal:  J Comput Aided Mol Des       Date:  1997-09       Impact factor: 3.686

Review 4.  Cell signaling by receptor tyrosine kinases.

Authors:  Mark A Lemmon; Joseph Schlessinger
Journal:  Cell       Date:  2010-06-25       Impact factor: 41.582

Review 5.  EGFR signaling and drug discovery.

Authors:  Georg Lurje; Heinz-Josef Lenz
Journal:  Oncology       Date:  2010-02-02       Impact factor: 2.935

Review 6.  Epidermal growth factor receptor in relation to tumor development: EGFR-targeted anticancer therapy.

Authors:  Isamu Okamoto
Journal:  FEBS J       Date:  2009-11-18       Impact factor: 5.542

Review 7.  Second-generation epidermal growth factor receptor tyrosine kinase inhibitors in non-small cell lung cancer.

Authors:  Gregory J Riely
Journal:  J Thorac Oncol       Date:  2008-06       Impact factor: 15.609

8.  Synthesis and structure-activity relationships of 6,7-disubstituted 4-anilinoquinoline-3-carbonitriles. The design of an orally active, irreversible inhibitor of the tyrosine kinase activity of the epidermal growth factor receptor (EGFR) and the human epidermal growth factor receptor-2 (HER-2).

Authors:  Allan Wissner; Elsebe Overbeek; Marvin F Reich; M Brawner Floyd; Bernard D Johnson; Nellie Mamuya; Edward C Rosfjord; Carolyn Discafani; Rachel Davis; Xiaoqing Shi; Sridhar K Rabindran; Brian C Gruber; Fei Ye; William A Hallett; Ramaswamy Nilakantan; Ru Shen; Yu-Fen Wang; Lee M Greenberger; Hwei-Ru Tsou
Journal:  J Med Chem       Date:  2003-01-02       Impact factor: 7.446

Review 9.  The epidermal growth factor receptor ligands at a glance.

Authors:  Marlon R Schneider; Eckhard Wolf
Journal:  J Cell Physiol       Date:  2009-03       Impact factor: 6.384

10.  Novel mutant-selective EGFR kinase inhibitors against EGFR T790M.

Authors:  Wenjun Zhou; Dalia Ercan; Liang Chen; Cai-Hong Yun; Danan Li; Marzia Capelletti; Alexis B Cortot; Lucian Chirieac; Roxana E Iacob; Robert Padera; John R Engen; Kwok-Kin Wong; Michael J Eck; Nathanael S Gray; Pasi A Jänne
Journal:  Nature       Date:  2009-12-24       Impact factor: 49.962

View more
  6 in total

Review 1.  Cheminfomatic-based Drug Discovery of Human Tyrosine Kinase Inhibitors.

Authors:  Terry-Elinor Reid; Joseph M Fortunak; Anthony Wutoh; Xiang Simon Wang
Journal:  Curr Top Med Chem       Date:  2016       Impact factor: 3.295

2.  The study of homology between tumor progression genes and members of retroviridae as a tool to predict target-directed therapy failure.

Authors:  Janaina Fernandes
Journal:  Front Pharmacol       Date:  2015-05-01       Impact factor: 5.810

3.  A computational method for the identification of candidate drugs for non-small cell lung cancer.

Authors:  Lei Chen; Jing Lu; Tao Huang; Yu-Dong Cai
Journal:  PLoS One       Date:  2017-08-18       Impact factor: 3.240

Review 4.  Role of cytokines in combinatorial immunotherapeutics of non-small cell lung cancer through systems perspective.

Authors:  Pragya Misra; Shailza Singh
Journal:  Cancer Med       Date:  2019-04-17       Impact factor: 4.452

5.  Hybrid Pharmacophore- and Structure-Based Virtual Screening Pipeline to Identify Novel EGFR Inhibitors That Suppress Non-Small Cell Lung Cancer Cell Growth.

Authors:  Chia-Wei Weng; Chi-Hsuan Wei; Jeng-Yuan Tsai; Yi-Hua Lai; Gee-Chen Chang; Jeremy J W Chen
Journal:  Int J Mol Sci       Date:  2022-03-23       Impact factor: 5.923

6.  Combining structure-based pharmacophore modeling, virtual screening, and in silico ADMET analysis to discover novel tetrahydro-quinoline based pyruvate kinase isozyme M2 activators with antitumor activity.

Authors:  Can Chen; Ting Wang; Fengbo Wu; Wei Huang; Gu He; Liang Ouyang; Mingli Xiang; Cheng Peng; Qinglin Jiang
Journal:  Drug Des Devel Ther       Date:  2014-09-02       Impact factor: 4.162

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.