Literature DB >> 35492478

Exploring the therapeutic mechanisms of Cassia glauca in diabetes mellitus through network pharmacology, molecular docking and molecular dynamics.

Prarambh S R Dwivedi¹, Rajesh Patil², Pukar Khanal³, Nilambari S Gurav⁴, Vaishali D Murade⁵, Dinesh P Hase⁶, Mohan G Kalaskar⁷, Muniappan Ayyanar⁸, Rupesh V Chikhale⁹, Shailendra S Gurav¹⁰.

Abstract

Cassia glauca is reported as anti-diabetic medicinal plant and is also used as an ethnomedicine. However, its mode of action as an anti-diabetic agent has not been clearly elucidated. Hence, the present study investigated the probable mechanism of action of C. glauca to manage diabetes mellitus via network pharmacology and molecular docking and simulations studies. The reported bioactives from C. glauca were retrieved from an open-source database, i.e. ChEBI, and their targets were predicted using SwissTargetPrediction. The proteins involved in the pathogenesis of diabetes were identified from the therapeutic target database. The targets involved in diabetes were enriched in STRING, and the pathways involved in diabetes were identified concerning the KEGG. Cytoscape was used to construct the network among bioactives, proteins, and probably regulated pathways, which were analyzed based on edge count. Similarly, molecular docking was performed using the Glide module of the Schrodinger suite against majorly targeted proteins with their respective ligands. Additionally, the drug-likeness score and ADMET profile of the individual bioactives were predicted using MolSoft and admetSAR2.0 respectively. The stability of these complexes were further studied via molecular dynamics simulations and binding energy calculations. Twenty-three bio-actives were retrieved from the ChEBI database in which cassiarin B was predicted to modulate the highest number of proteins involved in diabetes mellitus. Similarly, GO analysis identified the PI3K-Akt signaling pathway to be primarily regulated by modulating the highest number of gene. Likewise, aldose reductase (AKR1B1) was majorly targeted via the bioactives of C. glauca. Similarly, docking study revealed methyl-3,5-di-O-caffeoylquinate (docking score -9.209) to possess the highest binding affinity with AKR1B1. Additionally, drug-likeness prediction identified cassiaoccidentalin B to possess the highest drug-likeness score, i.e. 0.84. The molecular dynamics simulations and the MMGBSA indicate high stability and greater binding energy for the methyl-3,5-di-O-caffeoylquinate (ΔG bind = -40.33 ± 6.69 kcal mol-1) with AKR1B1, thus complementing results from other experiments. The study identified cassiarin B, cassiaoccidentalin B, and cinnamtannin A2 as lead hits for the anti-diabetic activity of C. glauca. Further, the PI3K-Akt and AKR1B1 were traced as majorly modulated pathway and target, respectively. This journal is © The Royal Society of Chemistry.

Entities: Chemical

Year: 2021 PMID： 35492478 PMCID： PMC9044434 DOI： 10.1039/d1ra07661b

Source DB: PubMed Journal: RSC Adv ISSN： 2046-2069 Impact factor: 4.036

Introduction

Diabetes mellitus is characterized by hyperglycemia which may occur due to impaired insulin secretion, insulin action, or a combination of both.[1] Increased risk for diabetes is primarily associated with age, ethnicity, and family history of diabetes, smoking, obesity, and physical inactivity which may lead to cardiovascular complications.[2] In 2005, it was estimated that more than 20 million people had diabetes in the United States and 30% of the cases were undiagnosed. Moreover, it has been estimated that these numbers may rise to 48 million by 2050.[3] Also, this number has been estimated to increase to 578 million, representing 10.2% of the total world adult population by 2030, and further increase to 700 million by the year 2045. Additionally, in 2019, diabetes prevalence among men and women has been estimated to be 9.6% and 9.0%, respectively, of the total respective gender world population.[4] Diabetes mellitus is a complex pathogenesis and varied presentation; thus, any classification is arbitrary;[5] majorly categorized into type 1 diabetes mellitus (T1DM) and type 2 diabetes mellitus (T2DM).[6,7] T1DM is an autoimmune disorder characterized by T-cell-mediated destruction of pancreatic β-cells, which results in insulin deficiency and ultimately hyperglycemia[8] whereas, the T2DM is characterized by two main insulin-related anomalies i.e. insulin resistance and pancreatic β-cell dysfunction.[9] Chronic hyperglycemia may result in dysfunctioning, and failure of multiple organs like kidneys, eyes, nerves, blood vessels, and heart;[6] often cauterized for diabetic complications. The current pharmacotherapy of T2DMs utilizes synthetic single-target oral hypoglycaemic agents; includes sulphonylureas, biguanides, thiazolidinediones, and α-glucosidase inhibitors;[10,11] however, these agents are concerned with multiple side effects like heartburn, stomach discomfort, ketoacidosis, pancreatitis, and nephropathy risk.[12] Since ancient times, various medicinal plants were the first choice to treat diabetes[13,14] as they are concerned with minimum side effects.[15] However, no clear mechanism has been established for their potential therapeutic action including C. glauca against diabetes; in addition, traditional medicines are explained as "hard to swallow" agents due to unproposed mode of action which is one of the prime reason to limit their utility. Moreover, in the present study, an approach has been made to propose the probable mechanism of action of C. glauca via the concept of multiple bioactives-multiple targets interaction using the concept of network pharmacology[10] by extending the theory of “lock and key” to “a master key may unlock multiple locks”. Cassia glauca is an ornamental tree with a height of 10 m belonging to the sub-family Caesalpiniaceae and family Fabaceae.[16] It has been reported for its various pharmacological activities like anti-inflammatory, anti-oxidant, and anti-diabetic properties.[17] Traditionally, the bark and leaves of C. glauca are used to treat diabetes and gonorrhea.[18] Further, it has been reported to lower the elevated blood glucose level in streptozotocin-induced hyperglycemia.[19] Additionally, C. glauca has been reported to inhibit α-glucosidase and α-amylase; may contribute in managing the post-prandial hyperglycemia.[20] In addition, C. glauca acetone fraction has been reported to cause a significant decrease in the levels of hepatic enzyme aspartate transaminase, alanine transaminase, creatine kinase, and lactate dehydrogenase in streptozotocin-induced diabetic rats.[21] Further, multiple bioactives from C. glauca have been traced to regulate the PPAR-γ to deal with the pathogenesis of diabetes using computational approaches.[22] However, the complete interaction of the secondary metabolites from C. glauca to regulate the multiple pathways and associated proteins in diabetes has not been reported yet. In this regard, the present study aimed to predict the mode of action of C. glauca against diabetes via the implementation of network pharmacology, molecular docking, and molecular dynamics simulation.

Materials and methods

Mining of phytoconstituents and targets

A list of reported phytoconstituents was retrieved from the Chemical Entities of Biological Interest (ChEBI; https://www.ebi.ac.uk/chebi/) database. The molecular formula, molecular weight, PubChem CID, InChI key, and canonical SMILES were obtained from the PubChem (https://pubchem.ncbi.nlm.nih.gov/) database. The targets involved in diabetes were retrieved from the therapeutic target database (http://db.idrblab.net/ttd/) using the keyword “Diabetes mellitus” and cross-matched with the proteins regulated by the bioactives of C. glauca with reference to the bioactives-regulated proteins from SwissTargetPrediction (http://www.swisstargetprediction.ch/).[23]

Drug-likeness and ADMET profile of bio-actives

The drug-likeness score of individual bioactive was retrieved from MolSoft (https://molsoft.com/mprop/) by querying their SMILES. Further, absorption, distribution, metabolism, excretion, and toxicity (ADMET) profile of each bio-active was predicted using admetSAR 2.0 [24] (http://lmmd.ecust.edu.cn/admetsar2/) to predict human intestinal absorption, Caco-2 permeability, blood–brain barrier permeability, P-gp inhibition, P-gp substrate, CYP3A4/CYP2C9/CYP2D6 substrate, CYP3A4/CYP2C9/CYP2C19/CYP2D6/CYP1A2 inhibition, Ames mutagenesis, human ether-a-go-go inhibition, micronuclear toxicity, hepatotoxicity, plasma protein binding, acute oral toxicity.

Gene expression and enrichment analysis

UniProt database (https://www.uniprot.org/) was used to identify the gene code of regulated proteins which were queried in STRING[25] (https://string-db.org/) for GO analysis. Likewise, the pathways involved in diabetes were identified by concerning the KEGG database (https://www.genome.jp/kegg/). In addition, network among bioactives, proteins, and pathways was constructed using Cytoscape[26]ver. 3.7.2 (https://cytoscape.org/); any duplication was avoided. The whole network was treated as directed, and node size and color were set as “low values to small sizes” and “low values to bright colors” concerning edge count.

Molecular docking

Ligand preparation

The 2D structure of ligands were retrieved from the PubChem database (https://pubchem.ncbi.nlm.nih.gov) in .sdf format and uploaded to the workspace. Then LigPrep module (https://www.schrodinger.com/products/ligprep) was used to generate the 3D structure, and the OPLS-2005 force field was used to generate the different conformations of each ligand. The stable conformer of each ligand with minimum energy was further processed for molecular docking.

Preparation of macromolecule

The docking simulation study of bioactives were performed on the AKR1B1 (PDB ID: 4JIR) retrieved from a protein data bank (https://www.rcsb.org). The protein preparation wizard of Schrodinger Maestro (Schrödinger, LLC, New York, NY, 2021) was used to prepare the structure by adding hydrogen, deleting water molecules, and assigning partial charges by using the OPLS-2005 force field, then assigning protonation states and determining restrained, and further partial energy was minimized with a 0.3 Å RMSD limit. The binding sites were defined after removing the ligand and the grid was generated using a grid box volume of 10_10_10 Å.

Ligand–protein docking

The glide module of the Schrodinger suite was used to dock the individual bioactive into the identified binding site of the grid, and the lowest binding pose of each docking run was chosen to visualize the ligand–protein interaction glide XP visualizer. The important active site interactions were analyzed along with the scoring functions.

Molecular dynamics (MD) simulation and molecular mechanics-generalized born solvent accessibility (MM-GBSA) analysis

The AMBER18 molecular dynamics simulation package was used to perform all the simulation calculations. All the ligand molecules were parameterized with the ANTECHAMBER tool implemented with AMBER18. The protein–ligand complexes were prepared in xLeap to obtain an explicit solvent model. These complexes were subjected to the minimization cycles, simulated annealing, equilibration, and finally, a production run of 100 ns on Nvidia V100-SXM2-16GB Graphic Processing Unit using the PMEMD.CUDA module. Further, the 100 ns trajectories were stripped of water molecules and salt ions with CPPTRAJ and subjected to MM-GBSA analysis using Amber18 and Amber18 tools on all the 10 000 frames (ESI file†).

Results

Mining of phytoconstituents and targets involved in diabetes

Twenty-three bio-actives were identified in the plant C. glauca from the ChEBI database in which all bioactives modulated proteins involved in diabetes. Two hundred and sixteen proteins were found to be involved in diabetes mellitus (as recorded by the therapeutic target database); among them, eighty-four were modulated by the bio-actives from C. glauca. Table 1 summarizes the list of bioactives, their PubChem CID, molecular formula, and molecular weight.

Reported bioactives from C. glauca

Bioactives	PubChem CID	Molecular formula	Molecular weight
[Epicatechin-(4beta->8)]5-epicatechin	16129623	C₉₀H₇₄O₃₆	1731.5
1-Tetracosanoylglycerol	14542801	C₂₇H₅₄O₄	442.7
3-O-Methylcalopocarpin	467497	C₂₁H₂₂O₄	338.4
3-Phenylpropanal	7707	C₉H₁₀O	134.17
Cassiaoccidentalin B	70698280	C₂₇H₂₈O₁₄	576.5
Cassiarin B	17756116	C₁₈H₁₉NO₄	313.3
Cinnamic acid	444539	C₉H₈O₂	148.16
Sandwicensin	467498	C₂₁H₂₂O₄	338.4
Sennoside	656822	C₄₂H₃₈O₂₀	862.7
Torososide B	9988405	C₄₀H₅₂O₂₅	932.8
Cassiaside C2	10350826	C₃₉H₅₂O₂₅	920.8
Methyl 3,5-di-O-caffeoyl quinate	10075681	C₂₆H₂₆O₁₂	530.5
Cassiarin A	17756117	C₁₃H₁₁NO₂	213.23
Cassiatannin A	44610605	C₆₀H₄₈O₂₄	1153
Cinnamtannin A2	16130899	C₆₀H₅₀O₂₄	1155
Cinnamtannin B-1	475277	C₄₅H₃₆O₁₈	864.8
Cinnamtannin D-1	46173958	C₄₅H₃₆O₁₈	864.8
Diphenyl sulfone	31386	C₁₂H₁₀OS	218.27
Kaempferol 3-beta-gentiobioside	9960512	C₂₇H₃₀O₁₆	610.5
Norrubrofusarin	1.35 × 10⁸	C₁₄H₁₀O₅	258.23
Parameritannin A-1	16146159	C₆₀H₄₈O₂₄	1153
Pentadecanal	17697	C₁₅H₃₀O	226.4
3,5-Di-O-caffeoyl quinic acid	6474310	C₂₅H₂₄O₁₂	516.4

Drug-likeness score and ADMET profile of bioactives

Fourteen bioactives scored a positive drug-likeness score in which cassiaoccidentalin B scored the highest, i.e. 0.84 (Table 2). Similarly, the pharmacokinetic profile of each bioactive is represented in a heat map to present human intestinal absorption (HIA), Caco-2 permeability (Caco-2p), blood–brain barrier (BBB) permeability, P-glycoprotein (p-gp) inhibitor/substrate, CYP3A4, CYP2C9, CYP2D6, CYP3A4, CYP2C9, CYP2C19, CYP2D6, CYP1A2 inhibitor/substrate, Ames mutagenesis, human ether-a-go-go inhibition, hepatotoxicity, water-solubility, plasma protein binding, and acute oral toxicity (Fig. 1) in which 3-phenylpropanal and cassiatannin A showed highest HIA, Caco-2p by sennoside, and BBB permeability by 1-tetracosanoylglycerol. Additionally, cassiarin A and cassiarin B were predicted to possess the highest p-gp inhibition. Further, 3-phenylpropanal and cassiatannin A were predicted to possess the highest affinity towards p-pg as a substrate. Likewise, 3,5-di-O-caffeoylquinic acid, 3-O-methylcalopocarpin, and cinnamtannin D-1 and sandwicensin were predicted for their highest affinity towards hERG inhibition, hepatotoxicity, and plasma protein binding respectively (Fig. 1).

Drug likeness score of bioactivesa

Bioactive	Number of HBA	Number of HBD	MolLogP	MolLogS		DLS
Bioactive	Number of HBA	Number of HBD	MolLogP	log(mol L⁻¹)	mg L⁻¹	DLS
[Epicatechin-(4beta->8)]5-epicatechin	3	1	2.68	−2.84	375.19	0.29
1-Tetracosanoylglycerol	4	2	10.2	−5.98	0.46	−0.11
3-O-Methylcalopocarpin	4	1	5.14	−4.96	3.75	−0.42
3-Phenylpropanal	1	0	1.62	−1.31	6623.72	−1.38
Cassiaoccidentalin B	14	8	0.04	−1.59	14 846.5	0.84
Cassiarin B	4	0	1.66	−1.49	10 022.7	−0.96
Cinnamic acid	2	1	2.22	−2.45	522.96	−1.17
Sandwicensin	4	1	5.06	−4.86	4.64	0.05
Sennoside	20	12	1.08	−2.02	8246.81	0.71
Torososide B	25	14	−4.46	−1.52	27 878.3	0.63
3,5-Di-O-caffeoyl quinic acid	12	7	1.73	−2.03	4764.38	0.74
Cassiaside C2	25	14	−4.95	−1.56	25 553.5	0.52
Methyl 3,5-di-O-caffeoyl quinate	12	6	2.34	−2.65	1174.72	0.78
Cassiarin A	2	1	1.27	−1.79	3435.66	−1.53
Cassiatannin A	24	19	3.47	−3.15	815.43	0.75
Cinnamtannin A2	24	20	3.01	−2.86	1602.46	0.73
Cinnamtannin B-1	18	14	2.5	−2.7	1740.12	0.75
Cinnamtannin D-1	18	14	2.5	−2.7	1740.12	0.75
Diphenyl sulfone	2	0	2.96	−3.1	172.89	−1.54
Kaempferol 3-beta- gentiobioside	16	10	−1.99	−1.79	9833.52	0.64
Norrubrofusarin	5	3	1.23	−1.49	8329.69	−1.3
Parameritannin A-1	24	19	3.47	−3.15	815.43	0.75
Pentadecanal	1	0	6.58	−5.9	0.29	−1.19

HBA: hydrogen bond acceptor; HBD: hydrogen bond donor; DLS: drug-likeness score.

Fig. 1

ADMET profile of bio-actives via heat map presenting (A) human intestinal absorption, (B) Caco-2 permeability, (C) blood–brain barrier permeability, (D) P-glycoprotein inhibitior, (E) P-glycoprotein substrate, (F) CYP3A4 substrate, (G) CYP2C9 substrate, (H) CYP2D6 substrate, (I) CYP3A4 inhibition, (J) CYP2C9 inhibition, (K) CYP2C19 inhibition, (L) CYP2D6 inhibition, (M) CYP1A2 inhibition, (N) Ames mutagenesis, (O) human ether-a-go-go inhibition, (P) micronuclear, (Q) hepatotoxicity, (R) water solubility, (S) plasma protein binding, (T) acute oral toxicity of (1) [epicatechin-(4beta->8)]5-epicatechin, (2) 1-tetracosanoylglycerol, (3) 3,5-di-O-caffeoyl quinic acid, (4) 3-O-methylcalopocarpin, (5) 3-phenylpropanal, (6) cassiaoccidentalin B, (7) cassiarin A, (8) cassiarin B, (9) cassiaside C2, (10) cassiatannin A, (11) cinnamic acid, (12) cinnamtannin A2, (13) cinnamtannin B-1, (14) cinnamtannin D-1, (15) diphenyl sulfone, (16) kaempferol 3-beta-gentiobioside, (17) methyl 3,5-di-O-caffeoyl quinate, (18) norrubrofusarin, (19) parameritannin A-1, (20) pentadecanal, (21) sandwicensin, (22) sennoside, and (23) torososide B.

HBA: hydrogen bond acceptor; HBD: hydrogen bond donor; DLS: drug-likeness score.

Enrichment and network analysis

Cassiarin B was predicted to modulate the highest number of genes, i.e. 24 (Table 3). Similarly, Fig. 2 represents the protein–protein interaction of all the regulated proteins. In the constructed network of bioactives, targets, and pathways, AKR1B1 was majorly targeted by the highest number of bio-actives. Similarly, the PI3K-Akt pathway was primarily regulated by triggering the highest number of genes, i.e. 13 (Table 4).

Number of genes modulated by bioactives of C. glauca

Bioactive	Gene count	Bioactive	Gene count
[Epicatechin-(4beta-8)]5-epicatechin	16	Cinnamtannin A2	19
1-Tetracosanoylglycerol	12	Cinnamtannin B-1	12
3,5-Di-O-caffeoyl quinic acid	12	Cinnamtannin D-1	11
3-O-Methylcalopocarpin	18	Diphenyl sulfone	17
3-Phenylpropanal	10	Kaempferol 3-beta-gentiobioside	12
Cassiaoccidentalin B	17	Methyl 3,5-di-O-caffeoyl quinate	14
Cassiarin A	15	Norrubrofusarin	12
Cassiarin B	24	Parameritannin A-1	14
Cassiaside C2	12	Pentadecanal	19
Cassiatannin A	17	Sandwicensin	15
Cinnamic acid	16	Sennoside	19
Torososide B	17	Sennoside	19
Grand total		350

Fig. 2

Protein–proteins interaction of the targets regulated by the bio-actives of C. glauca.

Number of genes modulated by respective pathways

Pathway	Gene count	Pathway	Gene count
Adherens junction	3	Jak-STAT signaling pathway	4
Adipocytokine signaling pathway	5	Longevity regulating pathway	3
AGE-RAGE signaling pathway in diabetic complications	3	Longevity regulating pathway – multiple species	4
Aldosterone-regulated sodium reabsorption	3	mTOR signaling pathway	4
AMPK signaling pathway	5	Neurotrophin signaling pathway	4
Bile secretion	3	Notch signaling pathway	2
Cytokine–cytokine receptor interaction	9	PI3K-Akt signaling pathway	13
FoxO signaling pathway	5	PPAR signaling pathway	5
Galactose metabolism	2	Progesterone-mediated oocyte maturation	4
Glucagon signaling pathway	5	T Cell receptor signaling pathway	4
Insulin resistance	7	Type II diabetes mellitus	4
Insulin signaling pathway	7	Wnt signaling pathway	3
Grand total		111

Likewise, gene ontology of protein interaction (Fig. 3) analysis identified 1179 biological processes in which response to an organic substance (GO:00110033) scored the lowest false discovery rate via the modulation of 58 genes, i.e. (SIRT1, MIF, GCK, NR3C1, RAF1, PDGFRA, PDGFRB, KDR, REN, AKR1B1, CSF1R, PPARG, CCR5, CCR1, AVPR1A, NOD2, TACR1, INSR, S1PR1, CXCL8, ADAM17, PPARD, HSD11B2, GSK3B, P2RX7, CCR4, HDAC6, HSP90AA1, ACACB, NR3C2, CASP8, AVPR2, PIK3CG, RXRG, TLR9, COMT, PARP1, MAPKAPK2, PTGS2, CTSS, CNR1, PTPN1, TLR4, CNR2, HSPA1A, SYK, MAPK8, ITGA4, GCGR, PPARA, JAK3, FGFR1, TNF, ESR1, HDAC9, CASP1, HTR2A, NR1H4) against 2815 background genes at a strength of 0.68. Similarly, 146 molecular functions were identified in which signaling receptor activity (GO:0038023) scored the lowest false discovery rate via modulation of 39 genes (NR3C1, PDGFRA, PDGFRB, KDR, CSF1R, PPARG, CCR5, CCR1, AVPR1A, TACR1, INSR, ADORA2B, S1PR1, PPARD, P2RX7, NPY2R, CCR4, CCKBR, NR3C2, AVPR2, RXRG, TLR9, ADORA1, ADRB1, CNR1, TLR4, NPY4R, CNR2, ADRA1D, HCAR2, ITGA4, GCGR, PPARA, ADRA2C, FGFR1, ESR1, GPBAR1, HTR2A, NR1H4) against 1429 background genes at a strength of 0.8. Likewise, 53 cellular components were identified in which intrinsic component of the plasma membrane (GO:0031226) scored the minimum false discovery rate by modulation of 33 genes (SLC10A2, PDGFRA, PDGFRB, KDR, SLC5A1, CSF1R, CCR5, CCR1, AVPR1A, TACR1, INSR, ADORA2B, S1PR1, ADAM17, BACE1, SLC5A2, P2RX7, NPY2R, CCR4, AVPR2, ADORA1, ADRB1, CNR1, TLR4, NPY4R, CNR2, ADRA1D, ITGA4, GCGR, ADRA2C, FGFR1, TNF, HTR2A) against 1641 background genes at a strength of 0.67. Fig. 3 represents the top 20 biological processes, molecular function, and cellular components. Additionally, the network analysis identified the cassiacin B to regulate the highest number of proteins; AKR1B1 was primarily a targeted protein, and the PI3K-Akt signaling pathway was primarily modulated signaling pathway. Similarly, the interaction of individual bioactives with the respected modulated proteins and regulated pathways is presented in Fig. 4. In addition, Fig. 5 summarizes the association of gene ontology terms linked genes/targets with the KEGG-triggered genes; 76.6% of the top 20 gene ontology genes were identified to be common with KEGG-identified proteins.

Fig. 3

Gene ontology of proteins regulated by bioactives of C. glauca for (a) molecular function, (b) cellular components, and (c) biological process.

Fig. 4

Network interaction of bio-actives from C. glauca, their regulated targets, and modulated pathways.

Fig. 5

Interaction of genes regulated in CC; cellular components, MF; molecular function, and BP; biological process with KEGG; KEGG pathways.

It was observed that interaction between cassiarin B and MAPK8 showed the highest edge betweenness i.e. 299.94 (Table S1; ESI file†). Further, node notch signaling pathway, cassiarin B, PTPN1, CFD, and FFAR1 had the average shortest path length i.e. 3.85, betweenness centrality i.e. 0.11, closeness centrality i.e. 0.43, neighborhood connectivity i.e. 24, and topological coefficient i.e. 0.79 respectively (Table S2; ESI file†).

In silico molecular docking

In silico molecular docking revealed that methyl 3,5-di-O-caffeoyl quinate to possess the highest docking score (−9.209) with AKR1B1 compared to the rest of the bioactives. Similarly, three hydrogen bond interactions of methyl 3,5-di-O-caffeoyl quinate were observed with three amino acid residues, i.e. O with Leu 300 and Hie100 and –OH with Asp 43. Further, one of the aromatic rings of methyl 3,5-di-O-caffeoyl quinate was involved in pi–pi stacking with Trp219 (Fig. 6a). Similarly, kaempferol 3-beta-gentiobioside had a docking score of −9.123 with six hydrogen bond interactions, i.e. OH with Hie110, Ser302, Lys21, Tyr48, and Val47 with aldo-ketoreductase. Similarly, two aromatic rings from flavonoid moiety of kaempferol 3-beta-gentiobioside were involved in pi–pi stacking with Trp20 and Phe122 (Fig. 6b). Further, 3,5-di-O-caffeoyl quinic acid showed the highest docking score of −9.117 with AKR1B1 via three hydrogen bond interactions, i.e. two O group interacted with Leu300 and Hie110 and one –OH group with Asp43. Similarly, one of the aromatic rings of 3,5-di-O-caffeoyl quinic acid had a pi–pi stacking with Trp219 (Fig. 6c). Similarly, sennoside was predicted to possess the docking score of −8.108 with AKR1B1 via –OH group of tetrahydropyran ring via one hydrogen bond, i.e. –OH with Tyr48. Likewise, pi–pi stacking was observed with one of the aromatic rings of the sennoside with Phe122 (Fig. 6d). Likewise, torososide B was predicted to possess the docking score of −7.945 with AKR1B1 via four hydrogen bond interactions by –OH groups of tetrahydropyran rings with Tyr48, Gln49, and Lys119 (Fig. 6e). In addition, epalrestat, a positive control as AKR1B1 inhibitor, possessed a docking score of −8.646 with four hydrogen interactions in which two O groups interacted with Cys298 and Trp111. Similarly, O− of (–COOH) of epalrestat possessed two hydrogen bond interactions with Hie110 and Tyr48 (Fig. 6f); the docking score of individual bioactives with AKR1B1 is summarized in Table 5.

Fig. 6

Interaction of (a) methyl 3,5-di-O-caffeoyl quinate, (b) kaempferol 3-beta-gentiobioside, (c) 3,5-di-O-caffeoyl quinic acid, (d) sennoside, (e) torososide B, and (f) epalrestat with AKR1B1.

Docking score of bioactives with AKR1B1

Title	Docking score
Methyl 3,5-di-O-caffeoyl quinatea	−9.209
Kaempferol 3-beta-gentiobiosidea	−9.123
3,5-Di-O-caffeoyl quinic acida	−9.117
Sennoside	−8.108
Torososide Ba	−7.945
Cassiarin Ba	−7.903
Cassiaoccidentalin Ba	−7.839
Cassiaside C2a	−7.646
Cassiarin Aa	−7.298
Norrubrofusarina	−7.211
3-Phenylpropanal	−6.753
Cinnamtannin B-1	−6.708
Sandwicensin	−6.46
Diphenyl sulfone	−6.192
Cinnamic acida	−5.984
1-Tetracosanoylglycerol	−5.776
3-O-Methylcalopocarpin	−5.753
Cinnamtannin A2a	−2.464
Pentadecanal	−0.869
[Epicatechin-(4beta->8)]5-epicatechina	0
Cassiatannin A	0
Cinnamtannin D-1	0
Parameritannin A-1a	0
Epalrestatb	−8.646

Compounds identified from network interactions.

Gold standard as aldo-keto reductase inhibitor.

Compounds identified from network interactions. Gold standard as aldo-keto reductase inhibitor. Methyl 3-5-di-O-caffeoyl quinate, kaempferol-3-beta-gentiobioside, and 3-5-di-O-caffeoyl quinic acid complexed with AKR1B1 and a standard drug epalrestat were studied for their comparative stability in the active site of the receptor and their respective binding free energies. The trajectory analysis of methyl 3-5-di-O-caffeoyl quinate complexed with AKR1B1 is presented in Fig. 7. The protein RMSD during the 100 ns simulation shows a slight and gradual rise in the fluctuations for the protein residues in three stages, first 0–30 ns, second 30–60 ns, and the final one with more fluctuation between 60–100 ns (Fig. 7A). During the last 10 ns, it stabilizes between 1.7 to 2 Å. The ligand RMSD showed higher fluctuations throughout the simulation, with RMSD fluctuations between 1.5 to 4 Å (Fig. 7B). The RMSF for protein amino acid residues was between 0.5 to 1.5 Å for almost all the residues except for the residues between 210 to 250; showed high RMSF of up to 4 Å (Fig. 7C). The initial frame (Fig. 7D) of this complex showed hydrogen bonds with Asp44, His111, and Cys299. In contrast, the end conformation from the simulations showed the formation of some new hydrogen bonds between Trp112 and Ile261 and loss of H-bonds with Cys299 (Fig. 7E).

Fig. 7

MD results for the methyl 3-5-di-O-caffeoyl quinate-AKR1B1 complex; (A) protein RMSD, (B) ligand RMSD, (C) protein RMSF, (D) initial state of complex pre-MD production, (E) final state of the complex post-MD production.

The trajectory analysis of the kaempferol-3-beta-gentiobioside complexed with AKR1B1 is presented in Fig. 8. The protein RMSD during the 100 ns simulation showed a quick transition after 25 ns. The rise in the fluctuations for the protein residues occured between 25–30 ns, and then the trajectory converged towards a stable RMSD between 2–3 Å. The overall fluctuation was 1 Å from 30 to 100 ns of MD simulation (Fig. 8A). The ligand RMSD showed a similar trend; initially, the RMSD fluctuated between 0–15 Å. Then an abrupt fluctuation of about 10 Å was observed. Later, the RMSD stabilized between 12.5 to 17.5 Å during the remaining 25–100 ns, which accounted for an average RMSD of 5 Å (Fig. 8B). The RMSF for protein amino acid residues was high for almost all the regions of the protein, between 0.5 to 3 Å. The residues from 210 to 225 showed higher RMSF of up to 4.5 Å (Fig. 8C). The initial frame of this complex showed the formation of no hydrogen bonds between the ligand and the receptor (Fig. 8D). The end confirmation from the simulations showed the formation of some new hydrogen bonds between Trp220 and Leu302 (Fig. 8E).

Fig. 8

MD results for the kaempferol-3-beta-gentiobioside-AKR1B1 complex; (A) protein RMSD, (B) ligand RMSD, (C) protein RMSF, (D) initial state of complex pre-MD production, (E) final state of the complex post-MD production.

The trajectory analysis of 3-5-di-O-caffeoyl quinic acid complexed with AKR1B1 is presented in Fig. 9. The protein RMSD during the 100 ns simulation showed a stable conformational picture with a low fluctuation in the trajectory RMSD between 1–2 Å. The overall fluctuation was 1 Å for almost all of 100 ns MD simulations (Fig. 9A). The ligand RMSD showed larger fluctuations between 0.5–3.5 Å. Further, an abrupt fluctuation of about 10 Å was also observed. Later, the RMSD stabilized between 12.5 to 17.5 Å during the remaining 25–100 ns, which accounted for an average RMSD of 5 Å's (Fig. 9B). The RMSF for protein amino acid residues was high for almost all the regions of the protein, between 0.5 to 3 Å. The residues within 210 to 225 reflected higher RMSF of up to 4.5 Å (Fig. 9C). The initial frame of this complex showed the formation of hydrogen bonds between the ligand and the receptor residues Trp112, Asp44, Cyc299, and Trp220 (Fig. 9D). The end confirmation from the simulations showed the formation of new hydrogen bonds between Leu301 and the loss of other hydrogen bonds observed previously (Fig. 9E).

Fig. 9

MD results for the 3-5-di-O-caffeoyl quinic acid-AKR1B1 complex; (A) protein RMSD, (B) ligand RMSD, (C) protein RMSF, (D) initial state of complex pre-MD production, (E) final state of the complex post-MD production.

The trajectory analysis of epalrestat complexed with AKR1B1 is presented in Fig. 10. The protein RMSD during the 100 ns simulation showed a gradual rise in RMSD for 60 ns of the trajectory, after which was a high rate of fluctuation levelling up to 6.5 Å (Fig. 10A). The ligand RMSD had lower RMSD i.e. 5 Å, but later the fluctuations rose significantly, suggesting a breakdown of the complex (Fig. 10B). The RMSF for protein amino acid residues was low between 0.5 to 2 Å (Fig. 10C). The initial frame of this complex showed the formation of hydrogen bonds between the ligand and the receptor residue Ser303 (Fig. 10D). The end confirmation from the simulations shows a complex breakdown, and the ligand had fallen out of the receptor binding site (Fig. 10E).

Fig. 10

MD results for the epalrestat-AKR1B1 complex; (A) protein RMSD, (B) ligand RMSD, (C) protein RMSF, (D) initial state of complex pre-MD production, (E) final state of the complex post-MD production.

The MMGBSA calculations were performed on the complete trajectories to study the binding energy of the complexes; mentioned in Table 6. The contribution from various parameters are summarized in the table for the four complexes studied in the MD simulations.

Binding free energy components for the protein–ligand complexes calculated by MM-GBSA analysis, all energies are in kcal mol−1 with standard deviation in parenthesisa

Compounds	MM-GBSA
Compounds	ΔE_VDW	ΔE_ELE	ΔG_GB	ΔG_Surf	ΔG_gas	ΔG_Sol	ΔG_bind
Quinate	−49.66 (4.94)	−47.01 (12.56)	62.60 (7.91)	−7.54 (0.45)	−95.38 (12.97)	55.05 (7.64)	−40.33 (6.69)
Gentiobioside	−29.82 (8.44)	−18.84 (10.91)	35.17 (12.18)	−3.91 (1.25)	−63.24 (16.70)	31.25 (11.14)	−31.98 (7.28)
Quinic acid	−40.05 (3.73)	33.99 (16.35)	−4.51 (10.90)	−5.80 (0.44)	−3.34 (15.30)	−10.32 (10.79)	−13.66 (7.04)
Epalarestat	−12.41 (8.46)	−24.88 (31.74)	33.73 (32.82)	−1.62 (1.08)	−39.83 (35.02)	32.10 (32.49)	−7.72 (4.87)

ΔEVDW = van der Waals contribution from MM; ΔEELE = electrostatic energy as calculated by the MM force field; ΔGGB = the electrostatic contribution to the solvation free energy calculated by GB; ΔGSurf = solvent-accessible surface area; ΔGSol = solvation free energy; ΔGgas = gas phase interaction energy; ΔGbind = binding free energy.

Discussion

Traditional medicinal plants have played a vital role in drug discovery from ancient times. Many well-known bioactives have been derived from the plant origin either based on the traditional claim or independent of traditional knowledge; they are analogous to the core structure or secondary metabolites.[27]C. glauca is reported to lower the blood glucose in streptozotocin-induced diabetes mellitus.[18] Further, a record has been made to report its efficacy to inhibit enzymes involved in post-prandial hyperglycemia i.e., α-amylase and α-glucosidase enzymes proposing evidence of its anti-diabetic property.[20] The current drug discovery proceedings involve using in silico tools to predict possible molecular mechanisms, which further adds-on support to in vitro and in vivo models.[28] Network pharmacology provides an overview of predicting the medicinal plant's possible molecular mechanism via multi-compounds and multi proteins interaction.[29] In the present study, among two hundred and sixteen targets of diabetes mellitus (as recorded by the therapeutic target database), twenty-three bioactives of C. glauca were identified to regulate eighty-four proteins accountable for hyperglycemia. Previously it has been demonstrated that traditional medicinal plants may possess its anti-diabetic activity via their action over multiple proteins involved in pathogenesis but may not rely on one pathway or one protein, as explained previously.[30] Similarly, during the query of targets of bioactives, cassiarin B was predicted to modulate the maximum number of genes i.e. 24 (HSD11B1, SYK, AKR1B1, PARP1, ADAM17, P2RX7, HSP90AA1, ESR1, PTGS2, AURKA, AURKB, MAPKAPK2, PTPN1, CCR5, KDR, HDAC6, CFD, MAPK8, HDAC1, SCD, CSF1R, PIK3CG, CTSS, HSD11B2) in which maximum bio-actives majorly targeted AKR1B1; AKR1B1 is associated with the progression of the vascular complications such as diabetic nephropathy, diabetic retinopathy, and stroke.[31] Similarly, the PI3K-Akt pathway was modulated based on the gene counts, i.e. 14 (RAF1, PDGFRA, PDGFRB, KDR, CSF1R, INSR, GSK3B, HSP90AA1, PIK3CG, TLR4, SYK, ITGA4, JAK3, FGFR1). Phosphatidyl inositol 3-kinase (PI3K) plays a crucial role in mediating insulin's metabolic effects. AKT/protein kinase B is involved in regulating the activity of numerous targets, including transcription factors and kinases, and other regulatory molecules.[32] This report suggests that PI3K-Akt regulators are efficacious in managing cases of diabetic retinopathy.[33] Likewise, in the present study, cytokine–cytokine receptor interaction was the second major pathway which was modulated with the regulation of 9 genes, i.e. PDGFRA, PDGFRB, KDR, CSF1R, CCR5, CCR1, CXCL8, CCR4, TNF; reports have been made to point the increase in inflammatory markers such as C-reactive protein, plasminogen activator inhibitor-1, and other cytokines in insulin-resistant states without diabetes.[34,35] Further, the drug-likeness of the molecule provides an idea of how the molecule is likely to behave in vivo, including the characters of molecular weight, the number of hydrogen bond donors, acceptors, and lipophilicity for its oral bioavailability.[36] Among the twenty-three bio-actives, fourteen were predicted to possess a positive drug-likeness score indicating their probability for oral intestinal absorption. Likewise, the ADMET profile predicts the pharmacokinetic and pharmacodynamic parameters of the drug molecule such as intestinal absorption, BBB permeability, water-solubility, plasma protein binding, drug metabolism, hepatotoxicity, and other parameters,[37] which were also predicted for individual bioactives. The target AKR1B1 was chosen for docking as it was linked with the majority of bioactives in the combined bioactive-protein-pathway network. Among the bio-actives modulating AKR1B1, four molecules, i.e. cassiarin B, cinnamic acid, cassiarin A, and norrubrofusarin were with the molecular weight of <500 daltons in which cassiarin B possessed the docking score of −7.903. Similarly, cassiaoccidentalin B was predicted to have the highest drug-likeness score of 0.84 and possessed a docking score of −7.839. Earlier studies have reported the binding affinity of cassiaoccidentalin B as α-glucosidase and α-amylase inhibitor,[12] which may also contribute to managing post-prandial hyperglycemia. The molecular mechanism of any plant may not be justified by a single compound or a single target but may be implicated via multiple bio-actives and their target interactions.[38] In this regard, among the multiple bioactives of C. glauca, cassiarin B modulated 24 different proteins in which PTPN1 was primarily regulated. Further, PTPN1 negatively regulates insulin signaling by dephosphorylating the phosphotyrosine residues of the insulin receptor kinase activation segment. Moreover, PTPN1 gets activated through 3 pathways, i.e. insulin resistance, insulin signaling pathway, and adherens junction pathway, which could be regulated and may play an important role in manipulating the pathogenesis of hyperglycemia. Although, it would be complex to define the exact molecular mechanism of C. glauca due to the combined effect of multiple bio-actives by acting on various proteins to maintain the glucose homeostasis, from the present findings it can be speculated that, the PI3K-Akt signaling pathway could be regulated as all the bio-actives majorly modulated it, and cassiarin B and cinnamtannin A2 could be the lead hits for this action. The molecular dynamics simulation of the three natural bioactives and one standard drug molecule gives an elaborating insight into the binding of these molecules with the active site of the receptor AKR1B1. These molecules were selected based on their docking profile, scoring, and interaction with the residues in the active site. The compound methyl 3-5-di-O-caffeoyl quinate complexed with AKR1B1 showed the best profile concerning binding energy at −40.33 ± 6.69 kcal mol−1. This compound showed better stability during the simulation with low RMSD and higher stability of the complex compared to kaempferol-3-beta-gentiobioside, and 3-5-di-O-caffeoyl quinic acid. These ligands formed several hydrogen bonds with the receptor residues, but these were not the same in each case; this could be attributed to the large volume of the receptor surface and its shallow nature. The compound methyl 3-5-di-O-caffeoyl quinate retained its interaction with Ile261 and His111, suggesting that these are important residues in the active site or form stronger hydrogen bonds which is also dependent on the conformation and the flexibility this molecule possess. The compound kaempferol-3-beta-gentiobioside formed hydrogen bonds with the Trp220 and Leu302 towards the end of the simulation; this was the reason for its stable ligand RMSD and protein RMSD. The compound 3-5-di-O-caffeoyl quinic acid assumes a linear structure but is highly flexible, and thus it exploreed high conformation space, which is evident from its ligand RMSD plot. The more surprising findings were from the epalrestat-AKR1B1 complex. It showed very low binding energy at −7.72 ± 4.87 kcal mol−1. The reason could be the instability of the complex. The ligand "fell off" from the complex at about 70 ns giving rise to high RMSD for ligand; the visualization of MD trajectory showed that the ligand ends up at the box boundaries (Fig. 10E). The MD studies suggest methyl 3-5-di-O-caffeoyl quinate–AKR1B1 complex as the best complex from the set of molecules under investigation with better binding and energy profiles. In addition, the progression of T2DM is directly concerned with insulin resistance. In this regard, we mapped the KEGG regulated proteins within the KEGG insulin resistance pathway (entry hsa04931) in which we identified compounds that could act over PI3K; involved in insulin sensitivity, GSK3; involved in glycogenesis, TNF-α; reduces oxidative stress, PPAR; reduced fatty acid efflux, ACC-β; controls fatty acid oxidation, and IRS-1, and Akt; improves insulin sensitivity. These points are directly concerned with the development of insulin resistance-mediated diabetes mellitus. Since most of the compounds-mediated proteins interactions identified these pathways, it can be assumed that C. glauca can act over the upregulation of insulin sensitivity, promotes glycogenesis, reduces oxidative stress, and attenuates the free fatty acid metabolism (Fig. 11).

Fig. 11

Checkpoints affected by the bioactives of C. glauca in insulin resistance (KEGG entry: hsa04931) in liver and skeletal muscle. Targeted points by the bioactives.

Conclusion

The anti-diabetic effect of C. glauca may be due to bio-actives like cassiarin B, cassiaoccidentalin B, 3-5-di-O-caffeoyl quinate, and cinnamtannin A2, which were predicted to be involved in the regulation of the majority of proteins associated with the pathogenesis of diabetes mellitus. Further, the PI3K-Akt pathway was majorly modulated, and AKR1B1 was the majorly modulated protein. However, the present findings are based on the computational approaches that need to be further validated using in vitro and in vivo protocols.

Ethical statement

This work did not include any human or animal participation.

Funding

This work did not receive any funding from any national or international agencies to declare.

Conflicts of interest

All the authors of this manuscript have no conflict of interest in any financial and non-financial means.

31 in total

1. Computational methods for the prediction of 'drug-likeness'

Authors:
Journal: Drug Discov Today Date: 2000-02 Impact factor: 7.851

Review 2. ADMET modeling approaches in drug discovery.

Authors: Leonardo L G Ferreira; Adriano D Andricopulo
Journal: Drug Discov Today Date: 2019-03-16 Impact factor: 7.851

Review 3. Pathogenesis of type 2 diabetes mellitus.

Authors: Jack L Leahy
Journal: Arch Med Res Date: 2005 May-Jun Impact factor: 2.235

4. Association of aldose reductase gene (AKR1B1) polymorphism with diabetic retinopathy.

Authors: Navdeep Kaur; Vanita Vanita
Journal: Diabetes Res Clin Pract Date: 2016-09-08 Impact factor: 5.602

Review 5. Epidemiology of Diabetes Mellitus and Cardiovascular Disease.

Authors: Diana Glovaci; Wenjun Fan; Nathan D Wong
Journal: Curr Cardiol Rep Date: 2019-03-04 Impact factor: 2.931

6. Identification of α-amylase inhibitors from flavonoid fraction of Feronia elephantum and its integration with in-silico studies.

Authors: Saikiran Kulkarni; Prarambh Dwivedi; Aravinda Nagaraj Danappanvar; Belgaonkar Ashwini Subhash; B M Patil
Journal: In Silico Pharmacol Date: 2021-08-10

7. Protective effect of Cassia glauca Linn. on the serum glucose and hepatic enzymes level in streptozotocin induced NIDDM in rats.

Authors: Mamta Farswan; Papiya Mitra Mazumder; V Percha
Journal: Indian J Pharmacol Date: 2009-02 Impact factor: 1.200