| Literature DB >> 33075469 |
Yazan Haddad1, Marek Remes1, Vojtech Adam1, Zbynek Heger2.
Abstract
Most of the available crystal structures of epidermal growth factor receptor (EGFR) kinase domain, bound to drug inhibitors, originated from ligand-based drug design studies. Here, we used variations in 110 crystal structures to assemble eight distinct families highlighting the C-helix orientation in the N-lobe of the EGFR kinase domain. The families shared similar mutational profiles and similarity in the ligand R-groups (chemical composition, geometry, and charge) facing the C-helix, mutation sites, and DFG domain. For structure-based drug design, we recommend a systematic decision-making process for choice of template, guided by appropriate pairwise fitting and clustering before the molecular docking step. Alternatively, the binding site shape/volume can be used to filter and select the compound libraries.Entities:
Mesh:
Substances:
Year: 2020 PMID: 33075469 PMCID: PMC7567673 DOI: 10.1016/j.drudis.2020.10.007
Source DB: PubMed Journal: Drug Discov Today ISSN: 1359-6446 Impact factor: 7.851
Figure 1N-lobe structural variations in the epidermal growth factor receptor (EGFR) kinase domain. (a) 3D structure of the EGFR kinase domain complexed with third-generation drug osimertinib showing the C-lobe (blue) and N-lobe (green). Distinct mutation sites are highlighted in orange, whereas the DFG domain is highlighted in red. (b) Dendrogram showing weighted pair-group average clustering of the N-lobe of EGFR based on distance matrix. Structure–structure alignment was used to compare four wild-type apo EGFR with representative structures from classified families of co-complexed EGFR with ligands. Method: The structure–structure multiple-alignment using aln.malign3d() function in Modeller (9.17, r10881) was used to perform throughout (N−1) dynamic programming where each new structure is compared to the average of the previous structures in each new cycle. Default settings were used [off diagonal = 100, gap penalties 3d = (0.0, 1.75), fit atoms = ‘CA’]. Dendrogram was built in Modeller with a cluster cutoff of −1.0.
Figure 2Dendrogram showing weighted pair-group average clustering of the N-lobe of epidermal growth factor receptor (EGFR) based on a distance matrix. Structure–structure alignment was used to compare 110 3D structures and resulted in two distinct clans that can be divided further into two to three families each. The largest clan of 78 structures was highly similar and divided into family A (22 structures with one or no mutations in EGFR), family B (31 structures with mostly double EGFR mutations), and family C (25 structures with one or two EGFR mutations, divided into subfamily 1 with 21 structures and subfamily 2 with four structures). The other clan was more divergent and was divided into family D (17 structures divided into subfamily 1 with seven wild-type EGFR structures and subfamily 2 with ten triple-mutated EGFR structures) and family E (15 highly divergent structures divided into two subfamilies of ten and five structures). Method: Using the keyword ‘EGFR’, a RCSB protein databank (www.rcsb.org) database search resulted in 260 structures. Structures were ordered according to best resolution (Å), and then entries that did not cover the kinase domain of EGFR were excluded. Thirteen entries were without mutations and also without inhibitor compounds: 2RFD, 2RF9, 4R3R, 2J5E, 2ITX, 3GT8, 4R3P, 2RFE, 4ZJV, 3VJO, 2GS6, 2GS7, and 4WRG. Fifteen entries were with mutations and without inhibitor compounds: 2ITN, 3UG1, 3VJN, 5CNN, 2EB3, 2ITV, 5CZH, 5CZI, 4I21, 5SX5, 4RIW, 4RIX, 4RIY, 4ZSE, and 5CNO. Only chain A was retrieved (the number of chains ignored were 28 from all structures). The 50% viability, dissociation, and inhibition constants (IC50, Kd, and Ki) were acquired from RCSB database links to the binding databases: PDBBind (www.pdbbind-cn.org), BindingDB (www.bindingdb.org), and BindingMOAD (bindingmoad.org) databases. *Data for the same ligand but the sequence identity of crystal structure was <100% indicating other mutant variants. The structure–structure alignment was done by the same method described in Fig. 1 in the main text.
Figure 33D superposition of N-lobe of epidermal growth factor receptor (EGFR) kinase domain (gray ribbons) and complexed ligands (green wire with HETATOM coloring). Residues within a 3-Å distance are shown as gray sticks with HETATOM coloring. Subfamilies are colored accordingly. (a) Family A (22 structures with one or no mutations in EGFR) showing the C-helix in proximity to ligands (arrow) and contacts with Glu762. (b) Family B (31 structures with mostly double EGFR mutations) showing T790M mutants and the C-helix in proximity to ligands (arrow) by contacts with Glu762. (c) Family C1 in gray (21 structures with one or two EGFR mutations) showing T790M mutants and the C-helix in proximity to ligands by contacts with Glu762. Family C2 in purple (four structures) showing shifting of the C-helix away from ligands because of a steric effect. (d) Family D1 in brown (seven wild-type EGFR structures) showing steric effect of ligands (brown wire inside the red circle) and the C-helix is shifting away with no contacts against Glu762. Family D2 in gray (ten triple-mutated EGFR structures) showing shifting of the C-helix away from ligands possibly because of effects of mutations. (e) Family E1 in gray (ten structures) showing the steric effect of ligands and the C-helix shifting away with no contacts against Glu762. (F) Family E1 and E2 superposed (in gray and gold, respectively). Family E2 (five highly divergent structures) showing a back-shift in the C-helix with ligand contacts against Glu762 and Ile759. Visualization of protein and ligand 3D Structures was performed using UCSF Chimera (version 1.10.2). The matchmaker plugin was used for superposition of all heavy atoms via the BLOSUM-62 scoring matrix and Needleman-Wunsch alignment algorithm.