Literature DB >> 30899155

Molecular designing, virtual screening and docking study of novel curcumin analogue as mutation (S769L and K846R) selective inhibitor for EGFR.

Noor Ahmad Shaik1,2, Huda M Al-Kreathy3, Ghada M Ajabnoor4, Prashant Kumar Verma1, Babajan Banaganapalli1,2.   

Abstract

The somatic mutations in ATP binding cleft of the tyrosine kinase binding domain of EGFR are known to occur in 15-40% of non-small cell lung cancer (NSCLC) patients. Although first and second generation anti-EGFR inhibitors are widely used to treat these patients, their therapeutic efficacy is modest and often results in adverse effects or drug resistance. Therefore, there is a need to develop novel as well as safe anti-EGFR drugs. The rapid emergence of computational drug designing provided a great opportunity to both discover and predict the efficacy of novel EGFR inhibitors from plant sources. In the present study, we designed several chemical analogues of edible curcumin (CUCM) compound and assessed their drug likeliness, ADME and toxicity properties using a diverse range of advanced computational methods. We also have examined the structural plasticity and binding characteristics of EGFR wild-type and mutant forms (S769L and K846R) against ligand molecules like Gefitinib, native CUCM, and different CUCM analogues. Through multidimensional experimental approaches, we conclude that CUCM-36 ((1E,4Z,6E)-1-(3,4-Diphenoxyphenyl)-5-hydroxy-7-(4-hydroxy-3-phenoxyphenyl)-1,4,6-heptatrien-3-one) is the best anti-EGFR compound with high drug-likeness, ADME properties, and low toxicity properties. CUCM-36 compound has demonstrated better affinity towards both wild-type (ΔG is -8.5 kcal/Mol) and mutant forms (V769L & K846R; ΔG for both is >-9.20 kcal/Mol) compared to natural CUCM and Gefitinib inhibitor. This study advises the future laboratory assays to develop CUCM-36 as a novel drug compound for treating EGFR positive non-small cell lung cancer patients.

Entities:  

Keywords:  Curcumin analogue; EGFR genetic; Molecular docking; Mutations; Novel compound

Year:  2018        PMID: 30899155      PMCID: PMC6408711          DOI: 10.1016/j.sjbs.2018.05.026

Source DB:  PubMed          Journal:  Saudi J Biol Sci        ISSN: 2213-7106            Impact factor:   4.219


Introduction

The lung carcinoma (LC) is a leading form of lung disease causing huge morbidity and mortality worldwide. This disease starts in the lung as a primary metastatic growth and then spreads to other parts of the body. The two main forms of LC are small cell lung cancer (SCLC) and non-small cell lung carcinoma (NSCLC) (Collins et al., 2007). The classical symptoms of LC are losing weight, breathing difficulties, cough (often with blood) and pain in the chest (Wang et al., 2016). LC has become the 4th leading reason for the hospitalization of respiratory disease patients (Salim et al., 2011). The leading cause of LC associated mortality (up to 85% of LC) is due to non-small cell lung cancer (NS-CLC). LC develops because of genetic as well as epigenetic changes of the cellular genome. The comprehensive molecular dissection of NS-CLC has laid the foundation to develop novel small drug molecules targeting mutations in EGFR, ALK, K-Ras, B-Raf, c-MET, NKX2-1, LKB1 genes, which are critical to the disease progression (Stella et al., 2013). Out of these LC genes, approximately 10–40% NS-CLC patients demonstrate activating mutations in EGFR gene. The EGFR gene encodes a transmembrane epidermal growth factor receptor protein that once activated (by ligand binding), transduces the signals that are important for cellular proliferation, differentiation, migration, and survival (Stewart et al., 2015). Therefore, targeting ATP binding cleft of the tyrosine kinase binding domain of EGFR by potential inhibitors (like Gefitinib and Erlotinib) has become an attractive treatment strategy for treating patients suffering from NS-CLC (Zhang et al., 2012). Interestingly, these EGFR inhibitors show strong binding affinity with mutant forms of EGFR compared to the native form, and they were initially seen to be giving encouraging results for treating NS-CLC patients. However, the emergence of acquired drug resistance in patients limits its usage in clinical settings (Stella et al., 2012). The acquired drug resistance of EGFR is attributed to the threonine to methionine substitution at residue position 790 (Zhang et al., 2012). The underlying molecular cause of this drug resistance is assumed to be due to the mutation led steric interferences in the EGFR and inhibitor binding characteristics. Although some irreversible inhibitors like CL 387–785 and HK Inh-272 are developed to counter the acquired resistance of EGFR molecule, they are found to modify the covalent bonds in EGFR protein structure, thus limiting their practical application (Sato et al., 2012). Therefore, there is a need to search and develop novel as well as safe treatment regimes (for treating NS-CLC patients) which can easily counteract the drug resistance induced by EGFR mutations. The traditional compounds obtained from nature are proven to be a potential source of several anti-cancer lead molecules (Banaganapalli et al., 2013b). Most of the successful anti-cancer drugs currently being used are derived from natural products or their analogues (Mondal et al., 2012). In this context, Curcumin (CUCM) (diferuloylmethane), a plant polyphenol (extracted from turmeric plants) is well known for its potential low toxic anti-cancer activity (see Fig. 1). The effectiveness of CUCM in treating lung, colon, breast and prostate cancers is already well reported (Starok et al., 2015). The CUCM compound is known to act against several molecular targets like EGFR, PKB/Akt, NF-κB, and MAPK inside the cancer cells (Kasi et al.,2016). In breast cancer cell lines, CUCM is demonstrated to inhibit the expression of EGFR and also induces the apoptosis (Sun et al., 2012). The chemically synthesized CUCM is being intensively studied to enhance its properties. However, whether CUCM or its analogues show similar effects to shut down EGFR expression (both in wild and mutant forms) in lung cancer cells is not yet investigated.
Fig. 1

Molecular Structure of native CUCM compound.

Molecular Structure of native CUCM compound. The classical laboratory investigations demand expensive drug compound (analogues) synthesis by series of chemical methods and laboratory investigations involving cellular systems and animal models. In contrary, the rapid development of bioinformatics discipline has provided a great opportunity for designing the anti-EGFR inhibitor compounds with desired specificity and sensitivity. Computational approaches have proven to be highly reliable in predicting the mutation induced drug resistance and also to design resistance evading drugs. The computational approaches built on machine learning and pattern classification methods (decision trees, support vector machine, and neural networks) can potentially classify the pathogenic mutations, create the three-dimensional protein structures, assist in designing therapeutic inhibitors and also predict the resistance of target proteins towards these inhibitor molecules. Owing to the lack of substantial amount of data in this direction, we sought to design novel CUCM analogues which can competitively inhibit the ATP binding cleft of tyrosine kinase domain in both native and mutated forms of EGFR molecules.

Methods

Designing the curcumin analogues library

CUCM consists of two aromatic phenolic groups connected by unsaturated carbonyl moieties. We used two aromatic phenolic moieties (consists of four functional groups -R1, -R2, -R3 and -R4) to design different chemical derivatives of CUCM. The linkers (functional chemical groups), electron donors (—OH, —CH2—CH3, —CH3) and electron acceptors (—NO2) were manually replaced or modified at four functional group of the two aryl rings. Through an exhaustive combination of the two aryl rings and four chemical substituents, we generated a unique library of 50 CUCM analogues. Table 1 reveals the aryl group substitutions of the CUCM compound we generated in the presented study.
Table 1

List of Linkers (electron donor and electron acceptor) used at R1, R2, R3, R4 sites of curcumin compound used in generating multiple curcumin analogues.

CompoundsR1R2R3R4
CURCUMIN—OH—OCH3—OCH3—OH
CUCM-1—OCH3—OH—OH—OCH3
CUCM-2—OCH3—OCH3—OH—OCH3
CUCM-3—OCH3—OCH3—OCH3—OCH3
CUCM-4—OCH3—ONH2—ONH2—OCH3
CUCM-5—OH—ONH2—ONH2—OH
CUCM-6—ONH2—OHONH(CH2CH3)—ONH2
CUCM-7—OH—OH—O(OH)—OH
CUCM-8—OH—ONH2—OCH3—OH
CUCM-9—OH—OH—OH—OH
CUCM-10—OCH2CH2CH3—OCH2CH2CH3—H2CH2CH <svg xmlns="http://www.w3.org/2000/svg" version="1.0" width="20.666667pt" height="16.000000pt" viewBox="0 0 20.666667 16.000000" preserveAspectRatio="xMidYMid meet"><metadata> Created by potrace 1.16, written by Peter Selinger 2001-2019 </metadata><g transform="translate(1.000000,15.000000) scale(0.019444,-0.019444)" fill="currentColor" stroke="none"><path d="M0 440 l0 -40 480 0 480 0 0 40 0 40 -480 0 -480 0 0 -40z M0 280 l0 -40 480 0 480 0 0 40 0 40 -480 0 -480 0 0 -40z"/></g></svg> CH—OCH2CH2CH3
CUCM-11—OCH2CH2C <svg xmlns="http://www.w3.org/2000/svg" version="1.0" width="20.666667pt" height="16.000000pt" viewBox="0 0 20.666667 16.000000" preserveAspectRatio="xMidYMid meet"><metadata> Created by potrace 1.16, written by Peter Selinger 2001-2019 </metadata><g transform="translate(1.000000,15.000000) scale(0.019444,-0.019444)" fill="currentColor" stroke="none"><path d="M0 520 l0 -40 480 0 480 0 0 40 0 40 -480 0 -480 0 0 -40z M0 360 l0 -40 480 0 480 0 0 40 0 40 -480 0 -480 0 0 -40z M0 200 l0 -40 480 0 480 0 0 40 0 40 -480 0 -480 0 0 -40z"/></g></svg> C—OH—OCH2CH2CC—OH
CUCM-12—OCH2CHCHCH3—OCH2CH2CC—OCH2CH2CC—OH
CUCM-13—OH—OCH2CH2CC—OCH2CH2CC—OH
CUCM-14—ONH2—OH—OCH2CH2CC—OH
CUCM-15—OH—OH—OH—ONH2
CUCM-16—OCH2CH2CC—OH—OHONH(OH)
CUCM-17—OCHCH—ONH2—OCHCH—OH
CUCM-18—OH—OCHCH—OH—OCH2(OH)
CUCM-19—OCHCH—OCHCH—OCHCH—OCH2(OH)
CUCM-20—OCHCH—OCHCH—OCH2NH2—OH
CUCM-21—OCH2NH—OH—OH—OH
CUCM-22—OCH2NH2—OCH2NH2—OCH2NH2—OCH2NH2(OH)
CUCM-23—OH—OH—OCH2NH2—OCH2(OH)
CUCM-24—OHONH(OH)—ONH(OH)—OCH2C6H5
CUCM-25—OCH2(OH)—OH—OH—OH
CUCM-26—OCH2(OH)ONH(OH)—OH—OH
CUCM-27—OHONH(OH)—ONH(OH)—O(CH2(OH)
CUCM-28—OH—OH—ONH(OH)—OH
CUCM-29—OCH2(OH)—OCH2(OH)—OCH2(OH)—OCOCH3
CUCM-30—OCH2NH3—OCH2(OH)—OH—OCOCH3
CUCM-31—OCH2NH3—OH—OCH2(OH)—OH
CUCM-32—OCH2NH2(OH)—OH—OCH2(OH)—OCOCH3
CUCM-33—OCH2NH2(OH)—OCH2NH2(OH)—OCH2(OH)—O(OCH3)
CUCM-34—OCH2(OH)—OCH2(OH)—OCH2NH2(OH)—OH
CUCM-35—OCH2C6H5—OCH2C6H5—OCH2C6H5—OH
CUCM-36—OH—OH—OCH2C6H5—OH
CUCM-37—OCH2C6H5—OH—OCH2C6H5—OCH2(OH)
CUCM-38—O(CH2(OH)—O(CH2(OH)—O(CH2(OH)—OCH2(OH)
CUCM-39—O(CH2(OH)—OH—O(CH2(OH)—OH
CUCM-40—OCOCH3—OCOCH3—OH—OH
CUCM-41—OH—OCOCH3—OCOCH3—OCH2(OH)
CUCM-42—OH—OH—OCOCH3—OCH2NH3
CUCM-43—OCOCH3—OCOCH3—OCOCH3—OCH2NH3
CUCM-44—O(OCH3)—O(OCH3)—OH—OCH2NH2(OH)
CUCM-45—OH—OH—O(OCH3)—OCH2NH2(OH)
CUCM-46—O(OCH3)—OH—O(OCH3)—OCH2(OH)
CUCM-47—O(OCH3)—OH—OH—OCH2C6H5
CUCM-48—ONH(OCH3)—ONH(OCH3)—OH—OH
CUCM-49—ONH(OCH3)—OH—ONH(OCH3)—OH
CUCM-50—ONH(OCH3)—ONH(OCH3)—ONH(OCH3)—OCOCH3
List of Linkers (electron donor and electron acceptor) used at R1, R2, R3, R4 sites of curcumin compound used in generating multiple curcumin analogues.

Determination of drug likeliness properties

The 3D topology of energy minimized coordinates of all CUCM analogues was described with the help of PRODRG tool (Schuttelkopf and van Aalten, 2004). Generally, natural bioactive molecules comprise functional chemical groups which possess certain properties that are similar to known drugs. Therefore, calculating the molecular properties of these bioactive compounds is significant drug discovery and development. Herein, we have used Molinspiration program (accessible at http://www.molinspiration.com/) to estimate the drug-likeness of CUCM analogues. This web server has several options to design any chemical compound either manually, or by intaking the query compounds in the format of canonical SMILES for calculating their molecular properties as well as bioactivity scores. Molinspiration web server calculates milogP (partition coefficient), TPSA (topological polar surface area), mass (molecular size), natoms (range of atoms), range of O or N (number of hydrogen bond acceptor), range of OH (number of hydrogen bond donor), nrotb (range of rotatable bonds) and molecular volume (volume of drug distribution) characteristics for any bioactive compound. The Molinspiration server is also used to estimate the drug likeliness of designed analogues by predicting the bioavailability scores of G-protein-coupled receptors (GPCR) ligands, ion channel modulator, enzymes and nuclear receptors (Lipinski 2004). Overall different molecular properties are taken into consideration while screening the potential lead CUCM analogue from a large pool of query molecules we designed in this study.

Determination of ADME-Tox properties

The ADME (absorption, digestion, metabolism, and excretion) and toxicity (mutagenic, tumorigenic, irritant) properties are directly related to the biological effect of drugs and their metabolic fate in an organism. Therefore, we determined ADME properties of potential CUCM analogues and their possible effects on health using Variable Nearest Neighbor ADMET (vNN-ADMET) web server (https://vnnadmet.bhsai.org/vnnadmet/home.xhtml) (Schyman et al., 2017). This web server can process both prebuilt or customized ADMET models by accepting one or more query molecules in canonical SMILES format as an input. This web server calculates the structural distance between molecules to construct ADMET models of potential CUCM analogues. These ADMET models quickly assess some of the important properties like cytotoxicity, mutagenicity, cardiotoxicity, drug-drug interactions, microsomal stability and the likelihood of causing liver injury of any potential drug candidates.

Constructing Gefitinib sensitive and resistance mutations in 3-dimensional structure of EGFR molecule and domain mapping

In this study, we studied the differential interaction of both Gefitinib (N-(3-Chloro-4-fluorophenyl)-7-methoxy-6-(3-morpholinopropoxy)quinazolin-4-amine) (a known EGFR inhibitor) sensitive and resistant mutations of EGFR molecules. For this purpose, we have initially downloaded the EGFR wild-type protein structure from Protein Databank (PDB ID: 5XWD, chain A (Matsuda et al., 2018) and 3GOP, chain A (Red Brewer et al., 2009)). This protein structure served as a template for constructing EGFR mutant forms. For two mutant models, we used homology-based computer modeling tool like Modeller9v11(Webb abd Sali, 2016). The full-length amino acid sequence of EGFR (in FASTA format) extracted from KEGG gene database (entry number KE1956) was used to incorporate mutated against wild-type amino acid residues for providing input to modeler program. The Modeller is an easily accessible web interface, which depends on protein NMR information to satisfy spatial restraints in creating probability density function for determining atomic locations in the protein models. This method aligns the input amino acid sequences and template protein structures. The built protein models, whether native or mutant forms were energy minimized, by using Gromacs program (Pronk et al., 2013). The structure quality of energy-minimized protein models is assessed with the help of validation tools like Procheck (Laskowski et al., 1996) and ProSA (Wiederstein and Sippl, 2007). PyMOL (http://pymol.sourceforge.net) program was used in the visualization and analysis of all the protein structures built. The NCBI conserved domain search (https://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi) web server is used in identifying and mapping the gefitinib-sensitive and resistant mutations in the EGFR molecule.

Analysis of structural drifts in mutated EGFR

To estimate the structural drifts in mutated EGFR molecules, we have superposed the Cα traces and backbone atoms of 3D structures using Yasasra (Krieger and Vriend, 2014). The exact structural fit (in term of Root Mean Square Deviation-RMSD values) between two amino acid residues or whole polypeptide chains of EGFR is measured. RMSD value is a quantitative metric of structural resemblance between two atomic coordinates when superimposed on each other (Banaganapalli et al., 2016).

Protein-drug interaction

In this study, we used AutoDock 4.0 (Morris et al., 2008) to execute a docking simulation of Gefitinib and CUCM analogues against EGFR protein using a Lamarckian Genetic Algorithm (LGA). Throughout the procedure, the ligand molecule was maintained in the flexible form and protein in its rigid form. The equal distribution of polar hydrogens and Gasteiger charges is ensured for both protein and ligand molecules, before initiating the molecular docking procedure. The histidine amino acids (at delta-HD1 or epsilon-HE2 positions) on a protein molecule were neutralized using edit Histidine hydrogens option in Autodock MGL tool. The grid parameter file (calculate the grid map of protein-ligand) was prepared using the default parameters of a grid of 60 × 60 × 60 points in x, y, and z directions and center spacing of the grid is 0.367 Å (approximately 1/4 of the length of c–c covalent bond). Finally, a docking file with different set parameters was prepared with AutoDock tool. The corresponding LGA parameters were set to default settings, which includes 150 runs, 150 conformational possibilities, 50 populations and 2,50,0000 energy evaluations. For docking procedure, translation parameters were set at 0.2 Å; the quaternion to 5.0 Å; the torsion angle to 0.5 Å, and the RMS cluster tolerance level to 0.75 Å (Banaganapalli et al., 2013a). At the end of docking step, ligand molecules which showed the maximum binding energy in the protein-ligand docking complex were selected. The resultant complex structures were explored using Pymol program (Yuan et al., 2016).

Results

The physicochemical screening of Curcumin derivatives

The physiochemical and pharmaceutical properties such as miLogP value, molecular weight, number of hydrogen bond acceptors, number of hydrogen bond donors, and number of rotatable bonds for each CUCM analogue were analyzed. These properties were evaluated against Lipinski's rule of five that predicts drug-likeness of the potential drug compound. Lipinski's rule of five states that most of the molecules with good membrane permeability will have LogP ≤5, molecular weight ≤500, the number of hydrogen bond acceptors ≤10, and the number of hydrogen bond donors ≤5. Hence, all the 50 chemical analogues of CUCM were evaluated for various parameters that would help to adjudge the particular substance to be a probable drug. Accordingly, we observed that 17 CUCM analogues are found to be compliant to the Lipinski's requirement for a potential drug compound (Table 2). The biophysical scores of all those 17 CUCM analogues were as follows, miLogPvalue is <4.63, MW is <423.42 kDa, TPSA is <140 Å, nON is <9, noHNH is <5, RB is <10 and molecular volume is >374.99 g/mol. Rotatable bonds are important for conformational changes of molecules and also determines the binding characteristics between receptors and their ligand molecules. It has been reported that the number of rotatable bonds should be ≤10 for passing oral bioavailability criteria (Priya et al., 2015). The CUCM analogues under investigation had low to high number of rotatable bonds (0–8 in general).
Table 2

Drug-likeness physico-chemical properties of curcumin analogues by Molinspiration.

CompoundmiLogPTPSA (Å)MW (kDa)nONnOHNHnrotbVolume (g/mol)
CUCM-13.0596.22368.38637331.83
CUCM-23.3585.23382.41628349.36
CUCM-33.6674.23396.44619366.89
CUCM-42.83126.28398.42859356.34
CUCM-72.62127.44356.33756305.76
CUCM-82.63122.25369.37757326.56
CUCM-92.43118.21340.33655296.77
CUCM-143.06122.25407.42759365.86
CUCM-163.12128.48423.428510374.99
CUCM-173.54111.25407.427410366.43
CUCM-182.71116.45396.39749351.26
CUCM-252.10127.44370.36757322.56
CUCM-262.36128.48399.40859352.49
CUCM-364.63107.22416.43647369.15
CUCM-391.76136.68400.38859348.35
CUCM-422.42131.48413.438510369.15
CUCM-452.66137.71415.409510361.48
Curcumin3.0596.22368.38637331.83

MiLogP = molinspiration Octonal/water partition coefficient; nON = number of H-bond acceptor; nOHNH = number of H-bond donors; nrotb = number of rotatable bonds; MW = molecular weight; TPSA = total polar surface area and molecular volume.

Drug-likeness physico-chemical properties of curcumin analogues by Molinspiration. MiLogP = molinspiration Octonal/water partition coefficient; nON = number of H-bond acceptor; nOHNH = number of H-bond donors; nrotb = number of rotatable bonds; MW = molecular weight; TPSA = total polar surface area and molecular volume.

Molecular properties of Curcumin analogues obtained from Molinspiration

Fig. 2 reveals the drug-likeliness model scores generated by molsoft program, where blue color refers to drug-like behavioral properties, and green color refers to non-drug-like properties. Drug-like compounds show prediction values in a positive value, and non-drug-like compounds show zero or negative values. The drug-likeness prediction scores for CUCM compound (with a score of −0.66) and most (11/17; 64.70%) of its chemical analogues were negative. Only CUCM-7 (with a score of 0.37), CUCM-14 (with a score of 0.22), CUCM-26 (with a score of 0.21), CUCM-36 (with a score of 0.22), CUCM-42 (with a score of 0.27) and CUCM-45 (with a score of 0.26) analogues showed positive values. The drug-likeliness scores of these CUCM analogues are comparable to the standard anti-EFGR drug Gefitinib (1.26). Therefore, it is assumed that CUCM-7, CUCM-14, CUCM-26, CUCM-36, CUCM-42, CUCM-45, are good bioactive molecules which can potentially act as inhibitors of EGFR.
Fig. 2

Drug-likeness model score of newly designed CUCM analogues, native CUCM compound and Gefitinib, a standard anti-EGFR drug. Positive score for any query compound indicates its drug potential.

Drug-likeness model score of newly designed CUCM analogues, native CUCM compound and Gefitinib, a standard anti-EGFR drug. Positive score for any query compound indicates its drug potential.

ADMET predictions

The pharmacokinetics and safety profile, simply known as ADME-Tox of the CUCM analogues was predicted by the variable nearest neighbor computational method. ADME influences the drug levels and kinetics of drug exposure to the tissues. Of the six drug like CUCM analogues, selected in the previous stage, 5 compounds have shown negative predictions for ADMET endpoints. The compounds CUCM-7 and CUCM-26 showed negative predictions for three endpoints like Human Liver Microsomal (HLM) Stability test, acts as an inhibitor for Cyp2C9 and Cyp2C19 enzymes. However, in addition to HLM, Cyp2C9 and Cyp2C19 enzymes, native CUCM and its CUCM-14, CUCM-42 and CUCM-45 analogues act as inhibitors for matrix metalloproteinases (MMP). Only CUCM-36 ((1E,4Z,6E)-1-(3,4-Diphenoxyphenyl)-5-hydroxy-7-(4-hydroxy-3-phenoxyphenyl)-1,4,6-heptatrien-3-one) compound was predicted to be a potential drug compound as it showed negative predictions for all 15 ADMET endpoints of vNN method. The maximum recommended therapeutic dose of this CUCM-36 compound is 421 mg per day (Table 3).
Table 3

ADME-TOX predictions of curcumin analogues by variable nearest neighbor (vNN) method.

CompoundCyto-toxicityHLMCyp1A2 inhibitorCyp3A4 inhibitorCyp2D6 inhibitorCyp2C9 inhibitorCyp2C19 inhibitorBBBP-gp inhibitor & substrateMMPAMESMRTD (mg/day)
CUCM-7NoYesNoNoNoYesYesNoNoNoNo226
CUCM-14NoYesNoNoNoYesYesNoNoYesNo242
CUCM-26NoYesNoNoNoYesYesNoNoNoNo218
CUCM-36NoNoNoNoNoNoNoNoNoNoNo421
CUCM-42NoYesNoNoNoYesYesNoNoYesNo249
CUCM-45NoNoYesNoNoNoNoNoNoNoYes406
CurcuminNoYesNoNoNoYesYesNoNoYesNo428

*HLM = Human Liver Microsomal Stability, Cyp1A2 = Cytochrome p450 1A2, Cyp3A4 = Cytochrome p450 3A4, Cyp2D6 = Cytochrome p450 2D6, Cyp2C9 = Cytochrome p450 2C9, Cyp2C19 = Cytochrome p450 2C19, BBB = blood brain barrier, P-gp = glycoprotein, MMP = metallo matrix protein, MRTD = maximum recommended therapeutic dose.

ADME-TOX predictions of curcumin analogues by variable nearest neighbor (vNN) method. *HLM = Human Liver Microsomal Stability, Cyp1A2 = Cytochrome p450 1A2, Cyp3A4 = Cytochrome p450 3A4, Cyp2D6 = Cytochrome p450 2D6, Cyp2C9 = Cytochrome p450 2C9, Cyp2C19 = Cytochrome p450 2C19, BBB = blood brain barrier, P-gp = glycoprotein, MMP = metallo matrix protein, MRTD = maximum recommended therapeutic dose.

EGFR molecular modeling and determination of structural divergence

The identification of mutant protein structure is essential to understand how the amino acid substitution can change structural features of proteins. Therefore, we constructed two mutant forms of EGFR protein models (S769L & K846R) by comparative modeling methods using MODELLER sever, which provided approximately 100 models as an output. We selected the best predicted model based on the RMSD and template modeling (TM) scores, which were generated when native and mutant EGFR proteins were aligned against each other. All the EGFR mutant models showed an RMSD value of <2.0 and a TM value in between 0.5 and 1, confirming their good topology. The stereochemical and geometrical parameters of the built EGFR models was assessed through PROCHECK and PROSA servers. The PROCHECK analysis revealed that for both mutated models, 95% their amino acids are located in the allowed region. The Ramachandran plot constructed for EGFR models showed that 99.1% amino acids were present in the allowed regions (Fig. 3), and only 0.9% amino acids in disallowed regions confirming the reliability of built EGFR models. The G-factor value representing the dihedral angles in side chains of EGFR mutated model is found to be 1.0. This value is well within the permitted range to confirm the normality of the protein structure. The ProSA-web analysis of EGFR muted models (S769L and K846R) showed overall model quality (Z-score) values −10.02 and −10.23, respectively. These values are within the range characteristic of native proteins indicating good quality of the built model. The overall quality or Z score of the EGFR and EGFR mutated models reveals a similar correlation of the energy pattern between X-ray structures. The Z-score measures the deviation of total energy of the structure concerning an energy distribution derived from random conformations.
Fig. 3

The stereochemical quality analysis of EGFR wildtype and mutant protein models by Procheck [Ramachandran plot – favored region is red in color; allowed region is yellow in color; disallowed region is white in color] and ProSa [overall model quality (black dots indicate the match between experimentally solved protein structures distinguished by dark blue (X-ray) and light blue (NMR); and residue model quality graphs (amino acid energies, +ve values regions are error part of the structure, whereas −ve value region considered to high structural quality)].

The stereochemical quality analysis of EGFR wildtype and mutant protein models by Procheck [Ramachandran plot – favored region is red in color; allowed region is yellow in color; disallowed region is white in color] and ProSa [overall model quality (black dots indicate the match between experimentally solved protein structures distinguished by dark blue (X-ray) and light blue (NMR); and residue model quality graphs (amino acid energies, +ve values regions are error part of the structure, whereas −ve value region considered to high structural quality)].

Superimposition analysis

The biophysical orientation of a three-dimensional protein structure could determine its stability, ligand binding efficiency, and other associated functional properties. In this study, we analyzed the EGFR structural drifts (in terms of RMSD scores) among Gefitinib resistant (V769L) and sensitive (K846R) mutations of EGFR molecule (Fig. 4). The amino acid sequence similarity among both forms of EGFR molecules is 99.33%. The structurally similar amino acid residues or whole polypeptide chain levels show the RMSD values in between 0 and <2.0 Å. The larger the RMSD value between two query structures indicates their dissimilarity, and zero means they are identical in structure. The RMSD values of EFGR mutants at whole 3-dimensional structures and at amino acid residue levels is found to be 0.38 Å and 2.675 Å for V769L and 0.23 Å and 2.835 Å for K846R, respectively. These findings highlight the subtle structural changes caused by substituted amino acid residues in the structure of native EGFR protein molecule.
Fig. 4

The superimposition of EGFR wildtype and mutant V769L& K846R proteins in PyMOL software (circle zoom view).

The superimposition of EGFR wildtype and mutant V769L& K846R proteins in PyMOL software (circle zoom view).

Domain analysis

The NCBI domain scan has predicted six functional domains in EGFR protein molecule, of which domain 1, Receptor L domain is localized in between 57th and168th amino acids. This single-stranded right-hand beta-helix domain creates the bilobal ligand binding site in EGFR protein. The second one, Furin-like cysteine rich domain localized in between 185th and 355th amino acids functions like protease domain. The third one, Receptor L3 domain performs similar functions like domain 1 of EGFR. The fourth one, growth factor receptor domain IV located in between 505 and 637 amino acids, plays an important role in interacting with furin-like domain of EGFR. The fifth domain is transmembrane domain located in between 634 and 677 amino acids. The sixth one, catalytic domain of the protein tyrosine kinase located in between 704th and 1016th amino acids catalyzes the transfer of the gamma-phosphoryl group from ATP to tyrosine (Tyr) residues in protein substrates. The V769L (gefitinib sensitizing) and K846R (gefitinib resistance) mutations we analyzed in this study are localized in protein tyrosine kinase (6th domain) of EGFR molecule.

Molecular docking

The Gefitinib, CUCM and shortlisted CUCM-36 compound were initially energy minimized by adding partial surface charges using PRODRG web server. Then, these energy minimized structures were used in molecular docking against both native and mutant forms of EGFR to assess their inhibitory properties (Fig. 5). Molecular docking results showed that gefitinib and CUCM- 36 compounds interact with ATP binding cleft of EGFR via non-covalent interactions (hydrogen bonding). The gefitinib was determined to release the binding energy (ΔG) of −7.5 kcal/Mol and forms 3 hydrogen bonds with the LYS745, Gly768 and Glu863 of the EGFR ATP binding pocket. The binding energy of Gefitinib with mutated EGFR (K846R) is −8.1 kcal/Mol and forms 3 hydrogen bonds with Lys745, Arg748 and Glu 762 amino acids. Whereas, with V769L mutant form of EGFR, Gefitinib forms only one hydrogen bond with Leu862 amino acid and releases the binding energy of −6.8 kcal/Mol. The parent CUCM compound has shown a better binding affinity than Gefitinib towards EGFR molecule (Table 4). With native EGFR molecule, CUCM forms hydrogen bonds with Arg23, Ser246, Lys261 and Asn604 and releases the binding energy (ΔG is −7.8 kcal/Mol). The binding affinity of mutated EGFR is seen to be higher for both K846R (ΔG is −8.0 kcal/Mol) and V769L (ΔG is −8.1 kcal/Mol) forms. Interestingly, the CUCM-36 compound shows higher affinity while binding to V769L (ΔG is −9.60 kcal/Mol) compared to the K846R-mutant (ΔG is −9.2 kcal/Mol) and native EGFR (ΔG is −8.5 kcal/Mol) molecules. These results show that the CUCM-36 performs better than CUCM and Gefitinib compounds in effectively inhibiting EGFR molecule both in native, and mutant (gefitinib-sensitive and resistant) forms.
Fig. 5

The visualization of molecular docking analysis of wild type and mutant forms of EGFR molecule against CUCM-36, native CUCM compound and Gefitinib drugs.

Table 4

Molecular Docking Analysis Results of Curcumin, CUCM-36 and Gefitinib Compounds.

DrugProteinBinding energy* (kcal/Mol)No of H bonds (drug-enzyme)Interacting amino acids
GefitinibEGFR−7.53Lys745, Gly768 and Glu863
EGFR(V769L)−6.81Leu862
EGFR(K846R)−8.23Lys745, Arg748 and Glu762



CurcuminEGFR−7.85Arg23, Ser246, Lys261 and Asn604
EGFR(V769L)−8.07Arg23, Ser246, Arg244, Lys261 and Asn604
EGFR(K846R)−8.13Gly696, Arg429 and Arg705



CUCM-36EGFR−8.53Lys745, Lys754, Arg748
EGFR(V769L)−9.604Lys745, Arg748, Gly863, Ala864
EGFR(K846R)−9.24Lys745, Arg748, Val765, Gly863

The change in binding free energy is related to the inhibition constant as per the following the equation: ΔG = RT in Ki, where R is the gas constant 1.987 cal K−1 Mol−1, and T is the absolute temperature assumed to be 298.15 K.

The visualization of molecular docking analysis of wild type and mutant forms of EGFR molecule against CUCM-36, native CUCM compound and Gefitinib drugs. Molecular Docking Analysis Results of Curcumin, CUCM-36 and Gefitinib Compounds. The change in binding free energy is related to the inhibition constant as per the following the equation: ΔG = RT in Ki, where R is the gas constant 1.987 cal K−1 Mol−1, and T is the absolute temperature assumed to be 298.15 K.

Discussion

Targeting EGFR molecule with drugs like Gefitinib and Imatinib remains as a first line therapy for lung cancer treatment (Gridelli et al., 2011). Gefitinib is a small molecule drug that competitively inhibits the binding of ATP at the active site and modulates tyrosine kinase activity of EGFR. The EGFR sequencing of 10% of gefitinib responders of NSCLC revealed the evidence of somatic gain-of-function mutations in its tyrosine kinase domain. The Gefitinib responders’ rate is up to 40% among the patients belonging to Asian ethnic group, non-smokers and those presenting adenocarcinoma histology (Chan and Hughes, 2015). Approximately 77% of these clinical responders to Gefitinib revealed mutations in EGFR gene as compared to the 7% of NSCLC who are refractory to gefitinib (Sharma et al., 2007). Most of the EGFR mutations are found to be clustered around ATP-binding pocket within the tyrosine kinase domain (exons 18–24). However, deletion mutations at exon 19 and point mutation at exon 21 (L858R) represent the most common EGFR mutations (Shigematsu et al., 2005) and their presence indicates sensitivity to gefitinib. Gefitinib treatment sometimes leads to the phenomenon of acquired resistance in around 45–60% of NSCLC tumors, through the accumulation of T90M mutation in exon 20 of EGFR molecule. This EGFR mutation leads to constitutive activation of downstream signaling pathways and promotes cellular growth and proliferation. The high cost, numerous side effects and secondary resistance caused by the acquisition of new mutations due to Gefitinib therapy pose a big challenge for using them in lung cancer treatment (Hong et al., 2016). Owing to the adverse effects caused by chemotherapeutic agents, there has been an increasing interest in using multitargeted, inexpensive, innocuous and readily available phytomedicine or nutraceuticals for treating diseases like lung cancer (Hosseini and Ghorbani, 2015). Especially, if these herbal medicine compounds are derivatives of ethnic food agents, then it makes them more acceptable from perspective of safety and effectiveness. The availability of EGFR-tyrosine kinase structure has provided an opportunity to virtually screen potential active anti-EGFR compounds (Choowongkomon et al., 2010). CUCM, a phenolic compound [1, 7-bis (4-hydroxy-3-methoxyphenyl)-1, 6-heptadien-3, 5-Dione] derived from plant Curcuma longa is traditionally used as an edible agent (in the form of turmeric power) to fight inflammation and microbial infections due to its versatile pharmacological properties (Gupta et al., 2013). CUCM molecule acts against a diverse range of therapeutically important molecular targets of cancer signaling pathways such as EGFR, Ras, p53, AKT, Wnt-β catenin, PI3K, and mTOR, etc. (Kasi et al., 2016). The current literature suggests that CUCM can block proliferation, transformation, and invasion of lung cancer cells both in vitro and in vivo (Aggarwal and Harikumar, 2009). CUCM shows various effects on cancer cells like G1/S arrest and apoptosis induction (Karunagaran et al., 2005). Phase 1 and II clinical trials have shown that CUCM compound is orally well tolerated and have no dose-limiting toxicity (Gupta et al., 2013). However, the relative poor bioavailability due to intestinal absorption, rapid metabolism, and systemic elimination limits the clinical usage of CUCM (Hatcher et al., 2008). Therefore, numerous efforts have been made to enhance the metabolic stability and anti-proliferative activity of CUCM by designing CUCM derivatives, which resemble native compound but possess modified chemical side chains on functional moieties (Vyas et al., 2013). In recent decades, some studies have successfully synthesized new CUCM analogues [symmetrical 1, 5-diarylpentadienone molecules with extra alkoxy substitutions] and demonstrated them to possess 30 times additional growth-suppressive activity compared to their native counterpart (Park et al., 2013). Furthermore, these analogues have down-regulated the expression of beta-catenin, k-ras, cyclin D1, c-myc, at a 1/8th concentration at which normal CUCM shows its effect (Ohori et al., 2006). In this report, we have designed 50 different chemical derivatives of CUCM compound with the aim of identifying the differential affinity of CUCM derivatives against Gefitinib sensitive and resistant forms of EGRF molecule. The presence of a —OH group at the C4, C15 positions and —OCH3 group at C5, C16 positions in C21H20O6 [(1E, 6E)-1, 7–bis (4-hydroxy-3-methoxyphenyl) hepta1,6-diene-3-5-dione] did not favor the bioavailability of CUCM. Hence, the introduction of polar groups, such as hydroxy or methoxy, around the aryl moiety of CUCM is likely to enhance the bioactivity. In the CUCM-36 ((1E, 4Z, 6E)-1-(3,4-Diphenoxyphenyl)-5-hydroxy-7-(4-hydroxy-3-phenoxyphenyl)-1,4,6-heptatrien-3-one) analogue, an additional reactive —OH groups at C4, C5, C15 positions, and an aromatic hydrocarbon ring (—CH2C6H5) at the C16 position were added to the native CUCM (C21H20O6) molecule. This chemical modification might have contributed to increased bioavailability of the modified CUCM analogue compared with native CUCM compound. Additionally, the presence of additional—OH groups at C4, C5 and C16 positions in CUCM-36 appears to have favored the anti-EGFR activity. In the other CUCM analogues, less polar side chains are present at R1 (—OH), R2 (—OCH3), R3 (—OCH3) and R4 (—OH) positions which could have affected their overall molecular activity and bioavailability. In this study, we designed CUCM-36 which can effectively inhibit both Gefitinib sensitive (K846R located in exon 21) and resistant (V769L located in exon 20) mutations of EGFR molecule with higher affinity. In conclusion, this report describes the atomic scale modification of edible CUCM to design CUCM-36 analogue as a probable drug for targeting EGFR mutations. Computational testing showed that this CUCM analogue is the best probable anti-EGFR drug, due to its drug-likeness, ADME properties, and low toxicity properties. When compared to the known inhibitors like native CUCM or Gefitinib, CUCM-36 showed better efficacy in binding ATP binding cleft of EGFR in both native and mutant forms. Our multidimensional drug screening approaches demonstrate the utility of computational tools in designing and rapid preliminary screening of potential anti-EGFR drug compounds from natural compounds. This study confirms that computational protocols are highly efficient in discovering potential anti-EGFR drug compounds with both minimal resources and less technical expertise. However, our prediction approaches cannot fully elucidate the complex drug metabolism reactions taking place inside the human body. Therefore, we recommend future studies to synthesize CUCM-36 compound chemically, and test its EFGR inhibitory action as well as drug metabolism in cell lines and animal models.
  42 in total

Review 1.  Induction of apoptosis by curcumin and its implications for cancer therapy.

Authors:  D Karunagaran; R Rashmi; T R Santhosh Kumar
Journal:  Curr Cancer Drug Targets       Date:  2005-03       Impact factor: 3.428

2.  PyMOL and Inkscape Bridge the Data and the Data Visualization.

Authors:  Shuguang Yuan; H C Stephen Chan; Slawomir Filipek; Horst Vogel
Journal:  Structure       Date:  2016-12-06       Impact factor: 5.006

Review 3.  Therapeutic roles of curcumin: lessons learned from clinical trials.

Authors:  Subash C Gupta; Sridevi Patchva; Bharat B Aggarwal
Journal:  AAPS J       Date:  2012-11-10       Impact factor: 4.009

4.  Comparative Protein Structure Modeling Using MODELLER.

Authors:  Benjamin Webb; Andrej Sali
Journal:  Curr Protoc Bioinformatics       Date:  2016-06-20

5.  EGFR Inhibition by Curcumin in Cancer Cells: A Dual Mode of Action.

Authors:  Marcelina Starok; Pascal Preira; Muriel Vayssade; Karsten Haupt; Laurence Salomé; Claire Rossi
Journal:  Biomacromolecules       Date:  2015-04-24       Impact factor: 6.988

6.  Identification of novel drug-resistant EGFR mutant inhibitors by in silico screening using comprehensive assessments of protein structures.

Authors:  Tomohiro Sato; Hisami Watanabe; Keiko Tsuganezawa; Hitomi Yuki; Junko Mikuni; Seiko Yoshikawa; Mutsuko Kukimoto-Niino; Takako Fujimoto; Yumiko Terazawa; Motoaki Wakiyama; Hirotatsu Kojima; Takayoshi Okabe; Tetsuo Nagano; Mikako Shirouzu; Shigeyuki Yokoyama; Akiko Tanaka; Teruki Honma
Journal:  Bioorg Med Chem       Date:  2012-04-27       Impact factor: 3.641

7.  Experimental and computational studies on newly synthesized resveratrol derivative: a new method for cancer chemoprevention and therapeutics?

Authors:  Babajan Banaganapalli; Chaitanya Mulakayala; Madhusudana Pulaganti; Naveen Mulakayala; C M Anuradha; Chitta Suresh Kumar; Noor Ahmad Shaik; Jumana Yousuf Al-Aama; Dhananjaya Gudla
Journal:  OMICS       Date:  2013-09-17

8.  A Computational Protein Phenotype Prediction Approach to Analyze the Deleterious Mutations of Human MED12 Gene.

Authors:  Babajan Banaganapalli; Kaleemuddin Mohammed; Imran Ali Khan; Jumana Y Al-Aama; Ramu Elango; Noor Ahmad Shaik
Journal:  J Cell Biochem       Date:  2016-02-10       Impact factor: 4.429

9.  Receptor-based virtual screening of EGFR kinase inhibitors from the NCI diversity database.

Authors:  Kiattawee Choowongkomon; Orathai Sawatdichaikul; Napat Songtawee; Jumras Limtrakul
Journal:  Molecules       Date:  2010-06-04       Impact factor: 4.411

Review 10.  Pulmonary Toxicities of Gefitinib in Patients With Advanced Non-Small-Cell Lung Cancer: A Meta-Analysis of Randomized Controlled Trials.

Authors:  Dongsheng Hong; Guobing Zhang; Xingguo Zhang; Xingguang Liang
Journal:  Medicine (Baltimore)       Date:  2016-03       Impact factor: 1.889

View more
  4 in total

1.  Metabolites Profiling, In Vitro, In Vivo, Computational Pharmacokinetics and Biological Predictions of Aloe perryi Resins Methanolic Extract.

Authors:  Rasha Saad Suliman; Sahar Saleh Alghamdi; Rizwan Ali; Dimah A Aljatli; Sarah Huwaizi; Rania Suliman; Ghadeer M Albadrani; Abdulellah Al Tolayyan; Bandar Alghanem
Journal:  Plants (Basel)       Date:  2021-05-30

2.  Unraveling the role of salt-sensitivity genes in obesity with integrated network biology and co-expression analysis.

Authors:  Jamal Sabir M Sabir; Abdelfatteh El Omri; Babajan Banaganapalli; Nada Aljuaid; Abdulkader M Shaikh Omar; Abdulmalik Altaf; Nahid H Hajrah; Houda Zrelli; Leila Arfaoui; Ramu Elango; Mona G Alharbi; Alawiah M Alhebshi; Robert K Jansen; Noor A Shaik; Muhummadh Khan
Journal:  PLoS One       Date:  2020-02-06       Impact factor: 3.240

3.  ZnCl2 catalyzed new coumarinyl-chalcones as cytotoxic agents.

Authors:  Konidala Sathish Kumar; Vijay Kotra; Ch B Praveena Devi; Nutakki Anusha; Bollikolla Hari Babu; Syed Farooq Adil; Mohammed Rafi Shaik; Mujeeb Khan; Abdulrahman Al-Warthan; Osamah Alduhaish; M Mujahid Alam
Journal:  Saudi J Biol Sci       Date:  2020-10-22       Impact factor: 4.219

4.  Serum Platelet-Derived Growth Factor Is Significantly Lower in Patients with Lung Cancer and Continued to Decrease After Platinum-Based Chemotherapy.

Authors:  Rong Ma; Qing Yang; Shengya Cao; Siwen Liu; Haixia Cao; Heng Xu; Jianzhong Wu; Jifeng Feng
Journal:  Onco Targets Ther       Date:  2020-03-04       Impact factor: 4.147

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.