| Literature DB >> 35480330 |
Aylin Del Moral-Morales1, Marisol Salgado-Albarrán1,2, Elizabeth Ortiz-Gutiérrez1, Gerardo Pérez-Hernández1, Ernesto Soto-Reyes1.
Abstract
KDM4 proteins are a subfamily of histone demethylases that target the trimethylation of lysines 9 and 36 of histone H3, which are associated with transcriptional repression and elongation respectively. Their deregulation in cancer may lead to chromatin structure alteration and transcriptional defects that could promote malignancy. Despite that KDM4 proteins are promising drug targets in cancer therapy, only a few drugs have been described as inhibitors of these enzymes, while studies on natural compounds as possible inhibitors are still needed. Natural compounds are a major source of biologically active substances and many are known to target epigenetic processes such as DNA methylation and histone deacetylation, making them a rich source for the discovery of new histone demethylase inhibitors. Here, using transcriptomic analyses we determined that the KDM4 family is deregulated and associated with a poor prognosis in multiple neoplastic tissues. Also, by molecular docking and molecular dynamics approaches, we screened the COCONUT database to search for inhibitors of natural origin compared to FDA-approved drugs and DrugBank databases. We found that molecules from natural products presented the best scores in the FRED docking analysis. Molecules with sugars, aromatic rings, and the presence of OH or O- groups favor the interaction with the active site of KDM4 subfamily proteins. Finally, we integrated a protein-protein interaction network to correlate data from transcriptomic analysis and docking screenings to propose FDA-approved drugs that could be used as multitarget therapies or in combination with the potential natural inhibitors of KDM4 enzymes. This study highlights the relevance of the KDM4 family in cancer and proposes natural compounds that could be used as potential therapies.Entities:
Keywords: KDM4 inhibitor; cancer; drug discovery; epigenetics (chromatin remodeling); natural compounds; structural biology
Year: 2022 PMID: 35480330 PMCID: PMC9036480 DOI: 10.3389/fgene.2022.860924
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
Structures used for the molecular docking screenings.
| Enzime | PDB accession | References |
|---|---|---|
| KDM4A | 5F32 |
|
| KDM4B | 4LXL |
|
| KDM4C | 2XML |
|
| KDM4D | 4HON |
|
| KDM4E | 4HOO |
|
| 5F5C |
| |
| 5FP4 |
| |
| 5FP7 | ||
| 5FP8 | ||
| 5FPA | ||
| 5FPB | ||
| 6H10 |
| |
| 2W2I |
|
FIGURE 1KDM4 family expression in cancer. (A) The number of samples used for the transcriptomic and survival analysis. Samples were obtained from TCGA, TARGET and GTEx databases. (B) Gene expression and survival analysis for each KDM4 protein. The first panel shows the differential expression analysis of the tumor samples vs. the non-neoplastic tissue. The second panel shows the CoxPH and Kaplan Meier survival analysis as adjacent columns for each KDM4 protein. For CoxPH analysis (first column), the tile color indicates if high levels of the KDM4 are of bad or good prognosis (p value <0.05). For the Kaplan-Meier analysis (second column), tumor samples were divided into two groups according to their KDM expression: Low-KDM and patients with High-KDM (p value <0.05). White tiles represent non significant association. (C) Significant Kaplan Meier curves of the KDM4 protein overexpressed in the cancer type where only a bad prognosis relationship was found with p value <0.05.
FIGURE 2Differential expression and enrichment analysis of the KDM4 family. (A) The left panel of squares represents the 12 types of tumors where the deregulation of the KDM4 subfamily is of bad prognosis. The differential expression analysis was performed comparing High-KDM vs. Low-KDM samples. Color intensity is related to the log2(FC). The right panel represents the number of differentially expressed genes (DEG) for each comparison. (B) Hallmarks of Cancer enrichment analysis for the DEG in each sample. Color intensity represents the pvalue and size of the intersection size.
FIGURE 3Molecular docking against the KDM4 subfamily. (A) Structural 3D alignment (upper panel) and root mean square fluctuation (RMSF, lower panel) for the aminoacid residues of the KDM4 subfamily structures used in this work. The red arrow indicates the location of the residues with the highest RMSF. (B) FRED/Chemgauss4 score distribution for the top 100 compounds from each database (COCONUT, FDA, and DrugBank) that were predicted to bind to each of the KDM4 family members. The fill indicates the enzyme system used, APO (without metal cofactors), and HOLO (with all metallic ions). Size is proportional to each compound’s number of targets according to our docking analysis. (C) FRED/Chemgauss4 score distribution for each of the three databases evaluated. Outlier points are shown in gray.
FIGURE 4Similarity flexophores map for COCONUT, DrugBank, and FDA top hits. (A) Similarity flexophores map. Each node represents a compound, the node color depicts its FRED/Chemgauss4 score. The node shape indicates which KDM4 the ligand binds. The network edges indicate a relationship of at least 95% of flexophore similarity between compound pairs (neighbors). Black dots represent the compounds from Baby et al., 2021. (B) Node distribution for the compounds belonging to each of the databases evaluated (COCONUT, DrugBank, and FDA). (C) 2D structure for a representative compound from each of the chosen clusters. Clusters were selected based on their size, and edge number.
FIGURE 5Molecular dynamics simulations and absolute binding energy calculation. (A) FRED/Chemgauss4 vs ΔGPBSA correlation for the best scoring ligands from each database. Ligands that showed a favorable binding energy (<0 kcal/mol) and negative FRED/Chemgauss4 score were considered as successful (red dots). The success percentage represents the proportion of successful molecules for each database. (B) Upper panel: Graphical representation (20 frames) of the molecular dynamics simulation for KDM4A (PDB ID: 5F32) in complex with the CNP0371131 ligand from COCONUT. Lower panel: RMSF value for each residue. (C) Electrostatic potential for the KDM4A-CNP0371131 complex. (D) Graphical representation of the CNP0371131 molecule (green) bound to KDM4A’s catalytic site. The residue numbers correspond to PDB structure 5F32. (E) Average per residue MM-PBSA binding free energy contribution for the KDM4A-CNP0371131 complex.
List of the top molecules with potential inhibitory activity of KDM4 subfamily proteins determined with molecular docking using COCONUT, DrugBank and FDA databases.
| Target | Database | Ligand |
|---|---|---|
| KDM4A | COCONUT | CNP0058667, CNP0150788, CNP0216191, CNP0002425, CNP0371131, Pulchellidin 3-Glucoside (CNP0359043), CNP0223133, CNP0258703 (Epigallocatechin gallate) |
| DrugBank | 6-O-capryloylsucrose, Zanamivir, Acteoside, DB04211, DB03249, DB07719, DB12116 | |
| FDA | Glucosamine, Glucosamine sulfate, Doripenem, Neohesperidin, Sulisobenzone, Verbascoside | |
| Wedelolactone, Epigallocatechin gallate | ||
| KDM4B | COCONUT | CNP0322725, CNP0216191, CNP0098686, CNP0316754, CNP0107391, CNP0239128, Crispine D (CNP0119105) |
| DrugBank | Carba-glucotropaeolin, Ascorbyl glucoside, Zanamivir, Iodo-Willardiine, beta-D-arabinofuranose 5-phosphate, DB03250,DB02488 | |
| FDA | Methazolamide, Sulisobenzone, Baricitinib, Lanraplenib, Pentostatin | |
| KDM4C | COCONUT | CNP0187735, CNP0417860, CNP0226084, CNP0298305, CNP0289146, CNP0350449, CNP0106665 |
| DrugBank | Peramivir, DB03717, Edotecarin, 3′-Uridine Monophosphate | |
| FDA | Cynarin, Quercitrin, Chlorogenic acid, (-)-Epigallocatechin gallate, Hyperoside, Gastrodin, Polydatin | |
| KDM4D | COCONUT | 6-C-Glucosylorobol (CNP0299696), CNP0002425, CNP0362352, CNP0243580, CNP0216191, Isovolubilin (CNP0151675), CNP0397301 |
| DrugBank | 6-O-Capryloylsucrose, Balanol, 10-hydroxycamptothecin, DB07102, 2′-Deoxycytidine-5′-Monophosphate, Cidofovir, Levoglucose | |
| FDA | Glucosamine, Glucosamine Sulfate, Oleuropein, Sulpiride, Sulisobenzone, Levosulpiride (Levogastrol), Hydroxycamptothecin | |
| KDM4E | COCONUT | CNP0131606, CNP0186792, CNP0125603, 4-hydroxy-2-ketoarginine (CNP0433705), CNP0295348, Quercetin 5-Glucuronide (CNP0081446), CNP0249133 |
| DrugBank | Azacitidine, Meglumine, Balanol, Levoglucose, Ascorbic acid, L-Xylulose 5-Phosphate, 5-phospho-D-arabinohydroxamic acid | |
| FDA | Glucosamine, Glucosamine Sulfate, Minoxidil Sulphate, Sulfamonomethoxine, Sulpiride, Xylitol, Orotic Acid (6-Carboxyuracil) |
*For long compound names only the database ID is provided.
FIGURE 6Top potential inhibitors of the KDM4A family. The network represents the Drug-Protein and Disease-Protein relationship between the members of the KDM4 family. The Drug-Protein edge width and color intensity represents the FRED/Chemgauss4 score. For long compound names only the database ID is provided.
FIGURE 7Integrative network analysis of KDM4 potential inhibitors in different cancer types. A network enriched with differentially expressed genes obtained from each cancer type selected is shown. The circular nodes represent proteins, and the edges the interactions between them. The color of the circular nodes represents the fold change in gene expression between tumors with high and low KDM4 expression. Drugs targeting the proteins are represented by diamond nodes, where dark green is used for Drugbank drugs and light green for natural compounds (Coconut database). The Drugbank drug and protein target interactions were retrieved from curated databases (NeDRex platform), while the natural compound interactions with proteins are predicted by the in silico analysis performed previously. The colored shadow highlights the proteins that participate in a cellular process according to g.Profiler enrichment. Overall, the network depicts the KDM4 proteins, their protein interaction context and shared interactions with known drugs and natural compounds.